Gene Cag_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_2007 
Symbol 
ID3747117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2544884 
End bp2547250 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content42% 
IMG OID637774544 
ProductDNA-directed DNA polymerase B 
Protein accessionYP_380298 
Protein GI78189960 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATC TTATCAACCA TATTGTTACC AACAACTTAT TGTTCGGTAA AGATAAAGAA 
GAGCGCATTG TTGGTGCTTA CCAACTCTCC GACACCCACA TTCGCCTTTT TAACCGTAAC
GGCGATACGG TAACGTTTCA CGATGAACCA TTTTATCCCT ACTTTTTTCT CTCCGATAGC
TCTTTGCTGG AAACCTTCGT ACCAGAAAAT CAAGAGAAGT TTTGGCTCGT GCCGCTTGCG
GGTAGCAACT ACTATACGGC GCTTGCAATT TTTAAAAGCA GTCGTAACCA TAAAAATGCC
GTTGATTTTC TTAACCGAAA GTGGAATGGC AATCAAGCCG CACAAGGGGA AGCTGCTGGT
AAAAACAGCA TGGAGAGCAA TCCCTTTATG TACAATAAAG GAGACACCAT TACGCAATAT
CTGATGCAAA GCGGTAAAAC CATGTTTAAA GGCATGTTGT TTGACGACAT TTACCGAATG
CAGCTTGATA TTGAAACAAA TTATAATGGA GAGAAAAAAG GGTTCTATGA TGACGAAATC
ATCATTATTT CGCTTTCTGA TAATAGGGGA TGGGAACAGC CGTTGCATTC AAAAGGGCGC
AATGAAAAAG AGCTGCTTCA AGAGCTAATT GCCGTTATCC AAGAAAAAGA TCCCGACGTT
ATTGAAGGGC ACAACATTTT TAATTTCGAC CTACCCTACA TTCAGCGCCG CTGCGAACGC
CATTCCATTC CATTTACCAT AGGGCGCAAT CAAACAATCC CTCGCACCTA TCCATCAAGC
ATACGTTTTG GCGAGCGCAC CATTGACTTC CCCTATTGCG ACATTCCTGG ACGCCATGTT
ATAGACACCC TCTTTCTCGT GCAAGGCTAC GATGTGGCAA AACGCTCTAT TGAAAGCTAT
GGGTTGAAAA ATGTAGCACG CCACTTTGGC TTTGCCTCTG CTAACCGCAC CTACATAGAG
TATAAAGATA TTGCCCGTTT ATGGCAGGAG GAGCCGAATA CCTTATTAGC TTATGCGCTT
GATGATGTAC GCGAAACGCA AGCGCTCTCC TCACTGCTAT CGGGGAGCAA TTTTTACATG
ACGCAAATGC TACCCTACAG TTATGCTATG ACGGCACGGC TTGGGCAGGC TGCAAAAATT
GAGGCTCTTT TTGTGCGCGA ATACCTCCGC GAAAAGCATT CTCTACCAAA ACCAACCTCA
GGGCAGCAGC AAAGCGGAGG CTACACTGAA GTCTTCTTAA AAGGCATTCT TGGACCAATT
GTCTATGCCG ATGTTGAATC GCTTTACCCC TCAATCATGC TCTCTTATAA CGTCTGCCCA
AAAAGCGATG CGTTACGAGT GTTTCCTAAC GTTTTGCGGA GCCTTAAAGA GTTACGTTTT
AAAGCAAAAG ATCAAGCACA GCAAGAGCTG CAAGCAGGCA ACAAACGCAA TGCCGATAAC
TTTGATGCCA TGCAAGCCTC CTTTAAAATT ATTATTAATG CCATGTATGG CTACCTTGGC
TATAGTGGAG GCATTTTTAA CGATTACGGC GAAGCTGACC GTGTAACCAC AACGGGACAG
GGCATTGCAA GAAAAATGAT TGCTGAATTT GAAAAACGGG GCTGCAAAAT TATAGAGGTT
GATACTGATG GAATTTTCTT TATACCTCCA GCATCCATTG CAAGCGAGCA AGAGGAGAAA
GCGCTTGTAG AAGAGGTATC GCAACAAATG CCTGATGGCA TTAATATTGG CTTTGATGGG
CGTTTTAAGA AGATGATTTC TTACATGAAA AAGAATTACG CCTTGCTAAG CTACAATAAT
GTTATGAAGC TTAAAGGCTC ATCGCTTAAT AGCCGAAGTG CTGAAAAGTT TGGACGGGAA
TTTATTAGAA GAGGCTTTCA AATGCTCTTA GCTGAAGATA TTAAAGGCTT GCATCTTCTT
TTTGCGGAAT ACAAAGAAAA AATTCTTAAC CATCAGCTTT CCATCGAAGA GTTTTCTCGT
AGTGAAAGCT TAAAACAAAC CAAAGAACAA TATCTTGAAG ATGTAGCTTC CGCTAAACGC
TCAAAGTCCA TTACCTATGA ATTAGCTATT CGCAAAGGAA TGGAAATTCG CAAAGGCGAT
AAAATTAGCT ACTACATTAC AGGAAGCGGC TCGAGCAATT TTTCGTGGGA TAAAGGCAAA
CTTGCCGCCG AATGGGATCC CAACAAACCC GATGAAAACA GCGCTTTTTA CTTAAAACGT
CTTGATGAGT ACAGTCAAAA ATTTTTACCC TTTTTTAAGC CGCAAGATTA CAGTATGATT
TTTTCCACAG GCTCGCTCTT TGCTTTTAGC GAAGAGGGAA TTGAATTACT TAAAGAAATT
CCCAATACTG ATAGTCAAAC AGAATAA
 
Protein sequence
MENLINHIVT NNLLFGKDKE ERIVGAYQLS DTHIRLFNRN GDTVTFHDEP FYPYFFLSDS 
SLLETFVPEN QEKFWLVPLA GSNYYTALAI FKSSRNHKNA VDFLNRKWNG NQAAQGEAAG
KNSMESNPFM YNKGDTITQY LMQSGKTMFK GMLFDDIYRM QLDIETNYNG EKKGFYDDEI
IIISLSDNRG WEQPLHSKGR NEKELLQELI AVIQEKDPDV IEGHNIFNFD LPYIQRRCER
HSIPFTIGRN QTIPRTYPSS IRFGERTIDF PYCDIPGRHV IDTLFLVQGY DVAKRSIESY
GLKNVARHFG FASANRTYIE YKDIARLWQE EPNTLLAYAL DDVRETQALS SLLSGSNFYM
TQMLPYSYAM TARLGQAAKI EALFVREYLR EKHSLPKPTS GQQQSGGYTE VFLKGILGPI
VYADVESLYP SIMLSYNVCP KSDALRVFPN VLRSLKELRF KAKDQAQQEL QAGNKRNADN
FDAMQASFKI IINAMYGYLG YSGGIFNDYG EADRVTTTGQ GIARKMIAEF EKRGCKIIEV
DTDGIFFIPP ASIASEQEEK ALVEEVSQQM PDGINIGFDG RFKKMISYMK KNYALLSYNN
VMKLKGSSLN SRSAEKFGRE FIRRGFQMLL AEDIKGLHLL FAEYKEKILN HQLSIEEFSR
SESLKQTKEQ YLEDVASAKR SKSITYELAI RKGMEIRKGD KISYYITGSG SSNFSWDKGK
LAAEWDPNKP DENSAFYLKR LDEYSQKFLP FFKPQDYSMI FSTGSLFAFS EEGIELLKEI
PNTDSQTE