Gene Cag_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0543 
Symbol 
ID3747047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp646895 
End bp648955 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content50% 
IMG OID637773077 
Producthypothetical protein 
Protein accessionYP_378859 
Protein GI78188521 
COG category[L] Replication, recombination and repair 
COG ID[COG1555] DNA uptake protein and related DNA-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.792006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTTC CCTGCATTGT GCCCTCTACC TTGCGGCGTT ATCTGCCCAC ACTCTTTGTT 
ACCTCTCTTG CACTTTTGCC ACTTACACCC ATTTCGGTAT ATGGTGAAAT GGCAGAGGAT
TTAGCGGCAC TTTCATCGTC CTCCGAATCC GATTACAATA GCGAAGTGGC GTTAGCGTTG
CTCGAAGAGT TGCGCCACCA TCCACTTTCC ATTAATCGTG CGACAGCCAA TGAGTTACGC
CAATTACCGT GGCTTAGCGC GGCGGACGTT CATGCCATTA TTAAGTACCG CACGCAAAAA
GGTGCTTTTC GTTCACTTTC TGAGCTTGAA ACAATACTCG GTAAAGAAAG AGCCACATGG
CTTTCCCCCT ACTTAACGGT TGAGGCAGCG CCCGTTCCAG CTAAAACAAC GGTGCGCCCA
AAAGCAACAT CCACGTCCAA AAAAACTAAA AAAGTAGCAA CCACAGGCTC GCTCTATAGC
CGCTACTTTA CCGAAATGCC TCCACGCAAA GGTATTCTTA CGGAGAAATA TGAAGGGGGA
AATAGTAAAA TGTACCATCG CGCACAATTT TATGCTCCCC ATGTAAGCGC CTCCGTTGTG
CAAGAAAAAG ATATTGGCGA AGCTGCAATT ACTGACTTTA CCTCGCTCAG CGTCAGTGTG
GCTGACGTGG GTATGATGGA ACGAGTGGTG TTGGGCAATT ATCGGCTGAC GCTGGGGCAG
GGATTAATGA TTGGGCAAGG ACGTTTTTTT TCGAAAGGTG CCGAAGTTGG CGGGCGACTT
ACCACCAAAA CCTTAATGCC TTACGCCTCA GCAAGCGAAG AGGGCTTTTT GCAAGGGGCG
GCGGCAACAC TGCAAATTCA GCCCATAGCG CTCACCCTTT TTTATTCGGC AAACCAGCGC
GATGCCATTA TTAATAAAGA GGGCGTTATT ACCAGCTTAA GCAGCAGCGG CTACCACCGC
ACCACACTTG AAGTAAGCCG TAAAGATAAC ATTACCGAAA ACGTTATGGG CGCTCATTTG
CGCTACCGAA CGGCGGTAGC GGGCATGGAG GCAACGCTTG GCGGCGGCAT GATGAACTAC
AGCTATCCCT ACCCATTTGA TGAGCTTGAA CCCAACGAGC CCGTTAGCAC GGTTTTAGGA
GCAACCCTTA CCAATGTTGA TGCAACCCTC TCATTCGGTA GCGGAGCACT CTTTGCCGAA
GCCGCATTCG CAAGCGATCC ACACGATATG GCATGGTTTG CAGGAGCAGA ATATGAGCCG
CTGCGGGGCG TTACCGCCGT TGCCGCCTTA CGCCGTTATG GAGAGAATTT TTATTCACCC
TTTGCCAATG CCTTTGCAGA ACGAGGTGGT GGCTCCAATG AAGAGGGCTT ATACACGGCT
GTTCAAGCGG CGTTCAGCAA AAAAGTAACT CTTGGCGCTT ATTACGACCG CTTTACCTTT
CCCCAACTTG GCAGCCACTA CCAGCAAGCC GCCGATGGCT TTGATGCACG CGCTTGGTTT
TCGTGGCAGC AATCGAGCCT ACTTTGCTGG AATGTGCAGG TGCAGCACAA AGAAAAGCCT
GAAGAAAAAA ATCAAGGCAC CACTAAAAAT CCCATATGGA CACCATTGCC GATTCTTACC
GACCGCCTAC AACTTAACTG CGAAGTAACG CCACATAAGG GCATAAGCTT ACGCACACGC
TTTGAGCTAA AAAATGTTGA TAAAGAGTAT CTCTTAGCCA CGCAATCTTT TACGGGAAAA
ATGTGGTATC AGCAAGTGGG CTACCGCACA GAAAATTTCA GCTTAAAAGG GCGCTTTACT
CGCTTTACCA CCACCGATTA CGCCGCCGCA ATCTATGCCT ATGAAGATGA TTTACCGCTA
ACCTCCAGCT TAGGCATGTA TAGCGGCGAT GGCAGCTCGC TTTTTGCCGT AGCAACGTGG
CAGCCCATGA AGCAAATGAA AGTAGCCGCA CGTTACGAAG TTACACGCTA TAACGACCGC
GACGTTTATA GCAGCGGCAA TGACGAGCGT GCAACCAATG CACCGTCATC GCTCCATGTT
GGATGTATGC TGTCGTTTTG A
 
Protein sequence
MQLPCIVPST LRRYLPTLFV TSLALLPLTP ISVYGEMAED LAALSSSSES DYNSEVALAL 
LEELRHHPLS INRATANELR QLPWLSAADV HAIIKYRTQK GAFRSLSELE TILGKERATW
LSPYLTVEAA PVPAKTTVRP KATSTSKKTK KVATTGSLYS RYFTEMPPRK GILTEKYEGG
NSKMYHRAQF YAPHVSASVV QEKDIGEAAI TDFTSLSVSV ADVGMMERVV LGNYRLTLGQ
GLMIGQGRFF SKGAEVGGRL TTKTLMPYAS ASEEGFLQGA AATLQIQPIA LTLFYSANQR
DAIINKEGVI TSLSSSGYHR TTLEVSRKDN ITENVMGAHL RYRTAVAGME ATLGGGMMNY
SYPYPFDELE PNEPVSTVLG ATLTNVDATL SFGSGALFAE AAFASDPHDM AWFAGAEYEP
LRGVTAVAAL RRYGENFYSP FANAFAERGG GSNEEGLYTA VQAAFSKKVT LGAYYDRFTF
PQLGSHYQQA ADGFDARAWF SWQQSSLLCW NVQVQHKEKP EEKNQGTTKN PIWTPLPILT
DRLQLNCEVT PHKGISLRTR FELKNVDKEY LLATQSFTGK MWYQQVGYRT ENFSLKGRFT
RFTTTDYAAA IYAYEDDLPL TSSLGMYSGD GSSLFAVATW QPMKQMKVAA RYEVTRYNDR
DVYSSGNDER ATNAPSSLHV GCMLSF