Gene Cag_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1048 
Symbol 
ID3747029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1417669 
End bp1421049 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content42% 
IMG OID637773577 
Producthypothetical protein 
Protein accessionYP_379353 
Protein GI78189015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00227105 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTCACAT GGGGCTCAAA TCATGGTAGC AATATCTATG GTCAGATATT CAAGAATGAT 
GGGTCAAAAG AGGGTAATCA ATTTCAGATT AACACTTATA CCTCTCAGCA ACCCGACTGG
AACCCATTTG AAAAATATGC TCCCTCGATG GCTGCCTTAG CTAACGGCGG TTTCGTGGTA
ACATGGAATA ACTATTGGCA GGATGGCGAT GGATCCGGTA TCTATGCAAG AATTTACGAT
AACAGTCCCA GCACTGGCTT GTTGAGTCTT GACTTACTTG ATACAAATTG GAGTAATGAT
AGTGATAATA TTTATAGAGC AACAGGGCGT GCCACATTGG GACTTTTGTC AGGCAATAGT
AGTATGCTGT ACATCCAAGA TGCTCAATAT ACGCTTACTG GTAATGCTCT GATTATTGAA
GGTGAATTCA GTGCAATTAT TGAAGGAACT ACTCGGTCAC TCTTTATCGG CAAAGCTGTT
TTTGATGTTA CAACAGGTTT TGCTGAACTT ACTGTTGGTA GTAACCTTCA TAATCCAGCA
GGATTGGGTT TTGAATTTAC CGCTCTTGAT CTCAGCAACA ACTATATTGA TATACATTTT
TCACCATTGG AATTACCAGC AGGAATCAGT AATACCAATA TCATTTTAGG CTCTGACTCA
TTTGTTATCA AGGAAAACTA TCCTCAACTT GGTTTTTATG GTAGCGTAAA TTTTCCTGAT
AAGACATTTG AACTTTTTGA TATGCTTACT GTTCATGCTT ATGACTGGTC AGTGAGTTAT
GACAGCCCCG ATAATGAATT GCATATACTT GGAGGCTTTG AGTTACAAAC CGGATGGGAT
AATGTTCCCG GTATCAATGC AGAACTGACG GGCGATGGTT TAGTTATTCG CAATGGTGAA
TTTGCTGATG TTGCTCTTAC AATCACCCTT GATGATTTCT CCGTCAAAGG CTGGGGATTC
AATAATGTTA GTGTTACCCT TGATACAGAG AAAAATTCCA TTGTTGGCTC GGCTGGTATA
AAGCTACCTA TGTTCGCCTC TTCTCTTGAA ACCACCATCG GATTTATCGT TAATCCTTTT
GAGCTTGACA CAGCCCTTTT TCATATCCCG TTTATCAACC CGGGTATCGC CCTTGGTACA
ACAGGATGGT TTCTAACTGC TATTGAAGGA GGTGTTTCTA ATCTGGCAAG TAGCAATAAG
GAACCGTTAC TATTTAAAGG CGGAGTAGAG CTACAGCTTC CTGAAGTTAT GGATTTGAGT
GTGAACGGTG AGCTGGATAG CAAACATATT GCAGGATTCG TAGAAGGAAC AATAATAGAT
AAAGATGCTA TAGATTTTCA AGGGCAAGTT ACCCTTAATT GGAACAAAGA TTATGTGCGA
GTGAACGGTT CGGCATCCTT TGCACAAGGA ATGATTGTCG GAGATTTTGG TTTTACAAGC
AACTTGAATC TTGATTTTAC AGCTAAAGGC AGTGCAACGG TAAAATTTGA TACAATTGAT
CAAATCCTTT CAGGCCATTA CTATCTTAAT TACAGCAATG ATCATAAAGA TAGTAATGAT
TATATAGCTG CATGGGCAGA GACTGTATTG CATATCCCAG TTTTTGGAGA CAAAACGGTA
ACTATAGGAA CCAAGTATTC CTTTGATGGA ACATGGAGAG CTTTCGGGGC AGCAGAGGTT
CCACTCTACA GCAGTTGGAT TGTTGATGAA ACGATAAGTG ATTTAATGGT TACAGTGAAT
TGGGATAATC CTGTAAACGA CGTTGAAACA CGAGTAGTAG TTTATGATGA TTTGGAGAAA
ACCAAAATCC GCCAAATTAT TAGCGAAGCG GAATATGGAG AACATGATAT TGCCATCATC
AGCGAATGGA GCGGTAGTAC AGCTAAAGTG ATTTATATCA ATACCCCAGA AGCAGGCTTG
TGGGATGTTG AGGTTATCAA CTCTGATGGA CTTGGTGAAG TAGTTTACAG CGCAACAACA
AGCCTTAAAC CGCTTGTTCT TACAGTGGGT GAACTTAATC TGAATAACGA TCAGCTCAGT
CTAAGCTATA TCGCGAATAC CCCTGAAACC GATGGCGTTA TATCTTTCTA CTTAGATGAC
AACGACAATG GTTTTGATGG TCAGATGATA GAGAGTATGG CTGATCCTGA TGGAAATGGG
CAATGGGTGT GGAATACCAC TGGATTTCAT GGCGGCACAT ACTGGCTCTA TGCAACGTTA
GCAGATGGAA AAAGTGCGCC GGTAATGTCG TATGCAGCAC AATCCATTTT TATTAATAAT
GCTCCAGTTG CGCTTGATGA TTCAATAATC ACAAATGAAG ACACCGCAGT TGTTATTGAT
GTTTTAGTGA ACGATAGCGA TTTTGATGAC AACCCGTTGC GCATAAGCAG CATTACTACT
CCATCCAACG GTACGGTTAG CGTTACTGAT GACAATAAAA TCCTTTTCGC TCCTTATGCT
GATACGTATG GTGAATCCAT ATTCACTTAC ACTATTACTG ATGGTTACGG TGGAGAAAGT
ACGGCAGTTG TAAATATTAC TATCAATAGT ATGCCTGATG CTCCGAAAGG GTCGGTGATT
ATTGAAGGGC ATCTCAAACA AGGTGAAATA CTGAGAGCTG ACGTGAGTAC TCTTAGTGAT
TCGGATGGTA TGGGAACCAT TGCTTACCAA TGGAAAGTTG ATGGCACCAA TATTGAAGGA
GCAACAAATG AAACATACAC CCTGACGGCA GCGGAGGTGG GCAACATTGT TACTGTTGAG
GTGAGCTATA CCGATGGCAA TGGCAAGCTG GAAAGCGTAG CAAGTTTAGC AACCAATGCG
GTTACACCTA TTAACGCCCC CAATCATCAC GACCTCGACG GCACCATCAC CTTCTGGCAA
ACTGGCGACG CACTTGCTGA CGTAGCCACT ACGCTCACAC AACCCAACAA CGGTGCCGCT
ACCGCCACCA CCGATGTCCA TGGATACTAC CAAATCCCCG ATGTGCAATC CGGCACCTAC
CAACTCAGCG CCATAAAAGC CACCGATACA GCAACAACAA ACGCCGTTAC AACCGATGAT
GTTCATGCAG TTTTAAAAAT AGCCGCAGGA ATTAATCCAA ATCCAGACGG CAGCGCTGTT
TCACCCTATC AATTCCTTGC CGCCGACATA AACCACGATG GCAAAGTACG CGCCGCCGAT
GCCCTTCTGC TTCTAAAAAT GATCGTTGAC TACGAAGGCG CACCAGAACC ACAATGGTAC
TTTGCACCAT ACAACATTGG TAACGAAGCC ACCATGGATC GCTCACATGT TGATTGGTCA
GTTACTAATC CACAAACCGT TACCATTAAC GACAACACAA CAGTAAATCT TATCGGCATT
CTAACCGGGG ATGTGGTGTG A
 
Protein sequence
MVTWGSNHGS NIYGQIFKND GSKEGNQFQI NTYTSQQPDW NPFEKYAPSM AALANGGFVV 
TWNNYWQDGD GSGIYARIYD NSPSTGLLSL DLLDTNWSND SDNIYRATGR ATLGLLSGNS
SMLYIQDAQY TLTGNALIIE GEFSAIIEGT TRSLFIGKAV FDVTTGFAEL TVGSNLHNPA
GLGFEFTALD LSNNYIDIHF SPLELPAGIS NTNIILGSDS FVIKENYPQL GFYGSVNFPD
KTFELFDMLT VHAYDWSVSY DSPDNELHIL GGFELQTGWD NVPGINAELT GDGLVIRNGE
FADVALTITL DDFSVKGWGF NNVSVTLDTE KNSIVGSAGI KLPMFASSLE TTIGFIVNPF
ELDTALFHIP FINPGIALGT TGWFLTAIEG GVSNLASSNK EPLLFKGGVE LQLPEVMDLS
VNGELDSKHI AGFVEGTIID KDAIDFQGQV TLNWNKDYVR VNGSASFAQG MIVGDFGFTS
NLNLDFTAKG SATVKFDTID QILSGHYYLN YSNDHKDSND YIAAWAETVL HIPVFGDKTV
TIGTKYSFDG TWRAFGAAEV PLYSSWIVDE TISDLMVTVN WDNPVNDVET RVVVYDDLEK
TKIRQIISEA EYGEHDIAII SEWSGSTAKV IYINTPEAGL WDVEVINSDG LGEVVYSATT
SLKPLVLTVG ELNLNNDQLS LSYIANTPET DGVISFYLDD NDNGFDGQMI ESMADPDGNG
QWVWNTTGFH GGTYWLYATL ADGKSAPVMS YAAQSIFINN APVALDDSII TNEDTAVVID
VLVNDSDFDD NPLRISSITT PSNGTVSVTD DNKILFAPYA DTYGESIFTY TITDGYGGES
TAVVNITINS MPDAPKGSVI IEGHLKQGEI LRADVSTLSD SDGMGTIAYQ WKVDGTNIEG
ATNETYTLTA AEVGNIVTVE VSYTDGNGKL ESVASLATNA VTPINAPNHH DLDGTITFWQ
TGDALADVAT TLTQPNNGAA TATTDVHGYY QIPDVQSGTY QLSAIKATDT ATTNAVTTDD
VHAVLKIAAG INPNPDGSAV SPYQFLAADI NHDGKVRAAD ALLLLKMIVD YEGAPEPQWY
FAPYNIGNEA TMDRSHVDWS VTNPQTVTIN DNTTVNLIGI LTGDVV