Gene Cag_0179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0179 
Symbol 
ID3747705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp200023 
End bp202797 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content48% 
IMG OID637772706 
ProductDNA polymerase A 
Protein accessionYP_378500 
Protein GI78188162 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATGC TTTACCGTGC TTTTTTTGCG TTGCAGCGCA CAGGCATGAG TAGCCCTTCG 
GGGTTGCCAA CGGGTGCGCT CTACGGCTTT ACCACAGCGT TGCTTAAAAT TTTTGAGAAT
TATCATCCTC ACTACTTAGT TGCGGCATTT GATAGCCGCG AAAAAACCTT TCGCCACCAT
TTGCTTGAGA GCTATAAGGC AAATCGTGCA GCTCCACCTG AAGAGCTGTT ACAGCAACTT
GAAAAGTTGT TTGAGTTGTT GAAAGCTTTT GGAGTGCCTG TTATTAAGCA AGCGGGTTAT
GAAGCTGATG ATCTTATTGG CGCGATGGTT ACTCAGTTTG CGGATGTTTG CCGCATTGGC
ATTGTTACGC CCGATAAAGA TTTAGCGCAG CTTGTGCGCG AAGGTGTGCA AATTTTAAAG
CCGGGGAAAA ATCAGCATGA GTTAGAGCCG CTTGGTTGCA ATGAGGTGAA AGCTCACTTT
GGCGTTCCTC CCAAACAATT CACCAATTTT TTAACCTTAA CCGGTGATAC GTCGGATAAC
ATTGTGGGCG CTAAAGGCAT TGGTCCAAAA ACCGCCGCAA CCTTGCTTGA AAAATATCAA
ACCTTAGATA AGCTTTACCA ACACTTGGAT GAGTTAACGC CAAAGGTGCG GAAAAGCCTT
GAGGATTTTG CACCGAATCG GGAGTTGGTG CTGCAACTTG TTACCATTTG CTGCGATGCG
CCGCTCCATG TTACGTTAGA GGAACTTGCT TGCAAAAATC CCGCACGAGA TGTTGTGCTG
CCGCTCTTGC AAGAGTTGGG CTTCCGTACC ATTGCTGCTC GTTTACAAGC TGCGTCCGTG
GCACTTACAT GCGCTTGTAA TGATGGGGGG GAAAGTGCTC CACCAATGCA AAGCGATCCT
AATAGTTCCA ACCTTTTAAA CGGAAGTGAT GGCAATACTT CGGCAACCGA TACCGCTCCC
CCACCATCAT TCCCAGACGT TCCTCGCCAT TACACCCTTG TAGAAACAAG AGAGCAATTG
CAGGCGTTGC TTGAGGAGTT GCAACAGGTT ACGCATATAG CGGTTGATAC CGAAACCACA
AGCCTTGATG TTTTTGAAGC TGAGCTGGCA GGAATTTCGC TTTGTGCTGA AGCGGGTAAA
GCATTTTTTA TTGCCACTAC GCCCGATGCT CTTGAGAGAA AAGAGGTTGT CAAGCAACTC
AAACCACTGC TTGAAAATCC CGCAATTACG AAAAGCGGGC AGAATTTGAA GTACGATATG
CTGGTGCTGA AAAAGTATGG CATTGAACTT GCACCCATCA GCTTTGATAC CATGCTTGCA
AGTTATGTGC TTAACCCCGA TGAGCACCAC AATCTCGACG ACATGGCACT GCGTTACCTT
GGGCGCACCA CCACCAAGTA TGATGAGCTT ACGGGCACAG GCAAACAGCG CCGCCATATT
TTTGAGGTGG AAAAAGAGGC ACTCACCAAC TACGCCTGCC AAGATGCCGA TGTGGCTTTT
CAACTGGAAG AGGTGCTGCA AGCCCAACTG CAAGCCGAGC CGCAACTGCT GGCACTTTGC
ACCACTATGG AGTTCCCGCT TGTGCGCGTG TTAGCAACAA TGGAGTATGC TGGTATTGCT
ATTGATACCG AGCATCTTGC CCGTGTAGCC GAAACCACCG AGCTGGAACT TCAATCCTTA
ACAGACAACA TTTACGCGGC GGCTGGTAGC TCTTTTAATA TTGATTCACC CAAACAGCTT
TCGCACGTAC TCTTTACCGA TCTTAGCTTG CCAACAGGTA AATCCACCAA AACAGGCTTT
TCAACCGATG TTGGCGTTTT GGAGGAGTTG GCTGCAACCT ACCCCATCGC AAGCGATTTA
CTGAGCTACC GCACGTTGCA AAAGTTAAAA GGAACCTACA TTGAGGCGCT GCCAAAAATA
ATCAATCCAC GCACAGGACG CATTCATACC TCCTTTAACC AGCACATTAC CGCAACGGGC
AGGCTCTCAT CCTCAAATCC CAACCTGCAA AACATTCCCG TTCGCACGGC GCTTGGTAAG
GAAATTCGCC GCGCCTTTAT TCCTTCAACC CCCGAACATT GGCTGCTTTC GGCTGATTAC
TCGCAAATTG AGCTGCGCAT TGCCGCTGAG CTTTCGGGCG ATGAGCGCTT GATTGCTGCT
TTCCGCAACG GCGAGGATAT TCACACCGCA ACGGCACAAG TGATTTTTGG AACGGAGGAA
ATTAGTAGCG ATATGCGCCG CAAAGCTAAA GAGGTGAACT TTGGCGTGCT CTACGGCATT
CAGCCTTTTG GGTTAGCAAA GCGCTTGAAC ATTCCCCAAA AAGAGGCAAA AGTTATTATT
GAAACCTATA AAGCTAAATA TCCACAGCTC TTTAATGTGT TGCGCCATAT TATTGAGGAG
GGAAAAGAAA AAGGCTACGT TACCACCCTT TTGGGGCGAC GACGCTACAT TGCTGACCTT
AACAGCCGCA ATGGCACCGT ACAAAAAGCT GCCGAACGCG CCGCTATGAA TACGCCCATT
CAAGGTACAG CGGCAGATAT TATTAAGTGC GCTATGAACC TTTGTTATCA GCAAATGCAA
GCGTCAGGCA TGGCTTCCGA AATGCTCTTG CAAGTGCATG ATGAATTGCT TTTTGAAACC
ACTGATAGCG AAAAAGAGGC ACTAACAAAG CTTGTAGAAA ATGCCATGAA AGAGGCTGCG
GTGCTTTGCG GCATGAAGCA AGTGCCGGTG GAGGTTGATT GCGGAGTTGG AAAAAATTGG
CTTGAAGCCC ATTGA
 
Protein sequence
MAMLYRAFFA LQRTGMSSPS GLPTGALYGF TTALLKIFEN YHPHYLVAAF DSREKTFRHH 
LLESYKANRA APPEELLQQL EKLFELLKAF GVPVIKQAGY EADDLIGAMV TQFADVCRIG
IVTPDKDLAQ LVREGVQILK PGKNQHELEP LGCNEVKAHF GVPPKQFTNF LTLTGDTSDN
IVGAKGIGPK TAATLLEKYQ TLDKLYQHLD ELTPKVRKSL EDFAPNRELV LQLVTICCDA
PLHVTLEELA CKNPARDVVL PLLQELGFRT IAARLQAASV ALTCACNDGG ESAPPMQSDP
NSSNLLNGSD GNTSATDTAP PPSFPDVPRH YTLVETREQL QALLEELQQV THIAVDTETT
SLDVFEAELA GISLCAEAGK AFFIATTPDA LERKEVVKQL KPLLENPAIT KSGQNLKYDM
LVLKKYGIEL APISFDTMLA SYVLNPDEHH NLDDMALRYL GRTTTKYDEL TGTGKQRRHI
FEVEKEALTN YACQDADVAF QLEEVLQAQL QAEPQLLALC TTMEFPLVRV LATMEYAGIA
IDTEHLARVA ETTELELQSL TDNIYAAAGS SFNIDSPKQL SHVLFTDLSL PTGKSTKTGF
STDVGVLEEL AATYPIASDL LSYRTLQKLK GTYIEALPKI INPRTGRIHT SFNQHITATG
RLSSSNPNLQ NIPVRTALGK EIRRAFIPST PEHWLLSADY SQIELRIAAE LSGDERLIAA
FRNGEDIHTA TAQVIFGTEE ISSDMRRKAK EVNFGVLYGI QPFGLAKRLN IPQKEAKVII
ETYKAKYPQL FNVLRHIIEE GKEKGYVTTL LGRRRYIADL NSRNGTVQKA AERAAMNTPI
QGTAADIIKC AMNLCYQQMQ ASGMASEMLL QVHDELLFET TDSEKEALTK LVENAMKEAA
VLCGMKQVPV EVDCGVGKNW LEAH