Gene Cag_0075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0075 
Symbol 
ID3746409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp83182 
End bp85578 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content48% 
IMG OID637772601 
ProductDNA topoisomerase I 
Protein accessionYP_378397 
Protein GI78188059 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCAT CAGTAGCAGC ACTCTCTGCC AAAAATAAAA CCCTCATTGT GGTTGAGTCG 
CCGTCAAAGG CAAAAACCAT TAACAAGTAT CTCGGCAGCA ACTATACGGT TTTCGCCTCC
GTCGGGCACA TTAAAGATTT ACCGAAAAAA GAGATTGGGC TCGACTTTGA ACATAACTAC
TCCCCTCGTT ACGAAATTAT TCCCGGTAAA GAAAAGGTAG TAAAGCAGCT TAAAAAACTT
GCGACTGAAG CCAGCAACAT TTTGATTGCT ACTGACCCTG ACCGCGAGGG TGAAGCTATT
GCATGGCACA TTGCGAACGA AATAGAACAT GCCAAAGCAC CCGTTGCCCG CGTGCTCTTT
AACGAAGTTA CCAAAAAAGC CATTCTTGAG GCAATTGAAA AACCTCGCCA TATTGACTTG
CGCCTTGTGC ATTCCCAGCA AACAAGGCAG GGGCTCGATA AAATTGTAGG TTATAAAATC
AGCCCATTTT TGTGGAAAGT GGTGCTGCGT GGATTATCGG CAGGGCGCGT ACAATCGGTG
GCGTTACGCC TTATTTGTGA GCGTGAAGAG GAGATTGAGC GCTTTGTTAT TCAAGAATAT
TGGACTATTG CTGCCGATTT TCTAACGGCA AACAAAGAAT CCTTCCGCGC CCGCTTAGTG
CGTTTGGATG GCGATAAACC TGAAATCACG AATGTAGAGC AAGCAGAAGC TATTGCCGCT
ATTGCCAAAA AAGGCAACTA CAGCGTTCGC GAAATTACTC CTCGCATTCA ACAACGAAAA
CAGCCGCTAC CGTTTACGAC ATCGTTGTTG CAGCAAGCCG CCTCTAACCA GCTTGGTTTT
GGTGCACAGC GCACCATGCG CACCGCCCAG CAACTTTACG AAGGTATTGA ACTTGGTGCT
GAAGGTGCTA TGGGTCTTAT TACCTACATG CGTACCGACT CCACACGCAT TAGCCCCGAA
GCAGTTGGCG AAGCACGCAA CTACATTGAG CGCAATTTTG GCAAAGATTA CGTTGGAGCT
GGTAGCAGCG GCAAACCCGG CAAAAATGCT CAAGATGCCC ACGAAGCCAT TCGCCCCACC
TCGCTCCTTA AAACGCCTGA ACAGGTAAAA CCCTACCTCT CCGCCGATCA ATTCAAGCTC
TATGAATTAA TTTGGAAGCG CTTTCTTGCT GCAATGATGG CTCCCGCTAA AATTGAGCAA
ACAAAGGTTG ATGTTGAGGA GCAAAGCGGC AAATTTCTTT TTCGAGCAAA TGGAAGCCGC
GTACTCTTCC CTGGCTTTAT GCGCGTTTAT GACGATCAGC AAGAGCTTGC ATATGAGGCA
CAAACCTCCA CAAAAGAAGA GGTGGAAAAT GAAATGGTGG TAAAACTTCC AGAAAAACTT
GCCGTTAATG ACCCGCTTGG GCTTGGCGCA TTAGAACAAA AACAAAGCTT TACCCGTCCA
CCTGCTCGTT ACAGCGAAGC AAGCTTAGTA AAAGACCTCG ACCACTTTGG CATTGGGCGC
CCATCAACCT ACGCCTCTAT TTTTTCAACT TTGCAAGATC GCCGCTATGT GGCACTTGAA
AAGCGCAAAA TTATGCCCAC CGATCTTGGG CGTGATGTTG CTAAAATTCT TGTAGCCAAC
TTCCCCGAAC TCTTTAACGT TGGCTTTACC GCCTTTATGG AGGACGAGCT TGATAAAGTT
GCAAGTGGCG ACGATGCGTA TGAAAAAGTG CTCGATAGCT TTTACAAGCC TCTCACATCA
GCTTTAGCGC TTCGCAGTGC CACGCCACTC ATTCCGCAAA ACAACGAAGC CGAAACATGC
GACAAATGCG GCACAGGCAA AATGATTTTA AAATGGACAG CAAGCGGCAA ATTTCTTGGA
TGCTCCAACT ATCCCAAGTG CAAAAACATT CGCACCATTA GTAGCAATCG CGAAAAACCC
GCATCAACGG GCGTGCACTG CCCCTCATGC GAAGATGGTG AAATGGTGCT CCGCAAAGGA
CGACTTGGAC CATTTCTTGC CTGCTCCAAC TATCCCAAAT GCAATACGCT GCTTAACCTA
AACAAGCAAC GCCACATTGA GCCACCGAAA ACACCACCCG TTGTTACCGA TATGGCGTGC
CCAAAATGCG GTGCCCCACT TTACCTCCGC AGTGGCAAAC GAGGGCTATG GCTTGGCTGC
TCAAAATTCC CCAAATGTAG AGGACGCCTT GCATGGACAG CGCTTGAGCC TGCTGCTCAA
GAGCGATGGG AACGCGTTAT GGCTGCACAT CAAAAAGCAC ATCCCCCCGT TACCTTAAAA
ATGGTTGATG GTAGTACTGT TTCAATGACC AGCTCAATTG ATGATATTAT TATGAAAGCT
GACGCCGCAG GATTAATTGC TCCCGCAATG GATTTAGTGC CTGAAGCGGA GGGATAG
 
Protein sequence
MASSVAALSA KNKTLIVVES PSKAKTINKY LGSNYTVFAS VGHIKDLPKK EIGLDFEHNY 
SPRYEIIPGK EKVVKQLKKL ATEASNILIA TDPDREGEAI AWHIANEIEH AKAPVARVLF
NEVTKKAILE AIEKPRHIDL RLVHSQQTRQ GLDKIVGYKI SPFLWKVVLR GLSAGRVQSV
ALRLICEREE EIERFVIQEY WTIAADFLTA NKESFRARLV RLDGDKPEIT NVEQAEAIAA
IAKKGNYSVR EITPRIQQRK QPLPFTTSLL QQAASNQLGF GAQRTMRTAQ QLYEGIELGA
EGAMGLITYM RTDSTRISPE AVGEARNYIE RNFGKDYVGA GSSGKPGKNA QDAHEAIRPT
SLLKTPEQVK PYLSADQFKL YELIWKRFLA AMMAPAKIEQ TKVDVEEQSG KFLFRANGSR
VLFPGFMRVY DDQQELAYEA QTSTKEEVEN EMVVKLPEKL AVNDPLGLGA LEQKQSFTRP
PARYSEASLV KDLDHFGIGR PSTYASIFST LQDRRYVALE KRKIMPTDLG RDVAKILVAN
FPELFNVGFT AFMEDELDKV ASGDDAYEKV LDSFYKPLTS ALALRSATPL IPQNNEAETC
DKCGTGKMIL KWTASGKFLG CSNYPKCKNI RTISSNREKP ASTGVHCPSC EDGEMVLRKG
RLGPFLACSN YPKCNTLLNL NKQRHIEPPK TPPVVTDMAC PKCGAPLYLR SGKRGLWLGC
SKFPKCRGRL AWTALEPAAQ ERWERVMAAH QKAHPPVTLK MVDGSTVSMT SSIDDIIMKA
DAAGLIAPAM DLVPEAEG