Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1048 |
Symbol | |
ID | 3747029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1417669 |
End bp | 1421049 |
Gene Length | 3381 bp |
Protein Length | 1126 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637773577 |
Product | hypothetical protein |
Protein accession | YP_379353 |
Protein GI | 78189015 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00227105 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTCACAT GGGGCTCAAA TCATGGTAGC AATATCTATG GTCAGATATT CAAGAATGAT GGGTCAAAAG AGGGTAATCA ATTTCAGATT AACACTTATA CCTCTCAGCA ACCCGACTGG AACCCATTTG AAAAATATGC TCCCTCGATG GCTGCCTTAG CTAACGGCGG TTTCGTGGTA ACATGGAATA ACTATTGGCA GGATGGCGAT GGATCCGGTA TCTATGCAAG AATTTACGAT AACAGTCCCA GCACTGGCTT GTTGAGTCTT GACTTACTTG ATACAAATTG GAGTAATGAT AGTGATAATA TTTATAGAGC AACAGGGCGT GCCACATTGG GACTTTTGTC AGGCAATAGT AGTATGCTGT ACATCCAAGA TGCTCAATAT ACGCTTACTG GTAATGCTCT GATTATTGAA GGTGAATTCA GTGCAATTAT TGAAGGAACT ACTCGGTCAC TCTTTATCGG CAAAGCTGTT TTTGATGTTA CAACAGGTTT TGCTGAACTT ACTGTTGGTA GTAACCTTCA TAATCCAGCA GGATTGGGTT TTGAATTTAC CGCTCTTGAT CTCAGCAACA ACTATATTGA TATACATTTT TCACCATTGG AATTACCAGC AGGAATCAGT AATACCAATA TCATTTTAGG CTCTGACTCA TTTGTTATCA AGGAAAACTA TCCTCAACTT GGTTTTTATG GTAGCGTAAA TTTTCCTGAT AAGACATTTG AACTTTTTGA TATGCTTACT GTTCATGCTT ATGACTGGTC AGTGAGTTAT GACAGCCCCG ATAATGAATT GCATATACTT GGAGGCTTTG AGTTACAAAC CGGATGGGAT AATGTTCCCG GTATCAATGC AGAACTGACG GGCGATGGTT TAGTTATTCG CAATGGTGAA TTTGCTGATG TTGCTCTTAC AATCACCCTT GATGATTTCT CCGTCAAAGG CTGGGGATTC AATAATGTTA GTGTTACCCT TGATACAGAG AAAAATTCCA TTGTTGGCTC GGCTGGTATA AAGCTACCTA TGTTCGCCTC TTCTCTTGAA ACCACCATCG GATTTATCGT TAATCCTTTT GAGCTTGACA CAGCCCTTTT TCATATCCCG TTTATCAACC CGGGTATCGC CCTTGGTACA ACAGGATGGT TTCTAACTGC TATTGAAGGA GGTGTTTCTA ATCTGGCAAG TAGCAATAAG GAACCGTTAC TATTTAAAGG CGGAGTAGAG CTACAGCTTC CTGAAGTTAT GGATTTGAGT GTGAACGGTG AGCTGGATAG CAAACATATT GCAGGATTCG TAGAAGGAAC AATAATAGAT AAAGATGCTA TAGATTTTCA AGGGCAAGTT ACCCTTAATT GGAACAAAGA TTATGTGCGA GTGAACGGTT CGGCATCCTT TGCACAAGGA ATGATTGTCG GAGATTTTGG TTTTACAAGC AACTTGAATC TTGATTTTAC AGCTAAAGGC AGTGCAACGG TAAAATTTGA TACAATTGAT CAAATCCTTT CAGGCCATTA CTATCTTAAT TACAGCAATG ATCATAAAGA TAGTAATGAT TATATAGCTG CATGGGCAGA GACTGTATTG CATATCCCAG TTTTTGGAGA CAAAACGGTA ACTATAGGAA CCAAGTATTC CTTTGATGGA ACATGGAGAG CTTTCGGGGC AGCAGAGGTT CCACTCTACA GCAGTTGGAT TGTTGATGAA ACGATAAGTG ATTTAATGGT TACAGTGAAT TGGGATAATC CTGTAAACGA CGTTGAAACA CGAGTAGTAG TTTATGATGA TTTGGAGAAA ACCAAAATCC GCCAAATTAT TAGCGAAGCG GAATATGGAG AACATGATAT TGCCATCATC AGCGAATGGA GCGGTAGTAC AGCTAAAGTG ATTTATATCA ATACCCCAGA AGCAGGCTTG TGGGATGTTG AGGTTATCAA CTCTGATGGA CTTGGTGAAG TAGTTTACAG CGCAACAACA AGCCTTAAAC CGCTTGTTCT TACAGTGGGT GAACTTAATC TGAATAACGA TCAGCTCAGT CTAAGCTATA TCGCGAATAC CCCTGAAACC GATGGCGTTA TATCTTTCTA CTTAGATGAC AACGACAATG GTTTTGATGG TCAGATGATA GAGAGTATGG CTGATCCTGA TGGAAATGGG CAATGGGTGT GGAATACCAC TGGATTTCAT GGCGGCACAT ACTGGCTCTA TGCAACGTTA GCAGATGGAA AAAGTGCGCC GGTAATGTCG TATGCAGCAC AATCCATTTT TATTAATAAT GCTCCAGTTG CGCTTGATGA TTCAATAATC ACAAATGAAG ACACCGCAGT TGTTATTGAT GTTTTAGTGA ACGATAGCGA TTTTGATGAC AACCCGTTGC GCATAAGCAG CATTACTACT CCATCCAACG GTACGGTTAG CGTTACTGAT GACAATAAAA TCCTTTTCGC TCCTTATGCT GATACGTATG GTGAATCCAT ATTCACTTAC ACTATTACTG ATGGTTACGG TGGAGAAAGT ACGGCAGTTG TAAATATTAC TATCAATAGT ATGCCTGATG CTCCGAAAGG GTCGGTGATT ATTGAAGGGC ATCTCAAACA AGGTGAAATA CTGAGAGCTG ACGTGAGTAC TCTTAGTGAT TCGGATGGTA TGGGAACCAT TGCTTACCAA TGGAAAGTTG ATGGCACCAA TATTGAAGGA GCAACAAATG AAACATACAC CCTGACGGCA GCGGAGGTGG GCAACATTGT TACTGTTGAG GTGAGCTATA CCGATGGCAA TGGCAAGCTG GAAAGCGTAG CAAGTTTAGC AACCAATGCG GTTACACCTA TTAACGCCCC CAATCATCAC GACCTCGACG GCACCATCAC CTTCTGGCAA ACTGGCGACG CACTTGCTGA CGTAGCCACT ACGCTCACAC AACCCAACAA CGGTGCCGCT ACCGCCACCA CCGATGTCCA TGGATACTAC CAAATCCCCG ATGTGCAATC CGGCACCTAC CAACTCAGCG CCATAAAAGC CACCGATACA GCAACAACAA ACGCCGTTAC AACCGATGAT GTTCATGCAG TTTTAAAAAT AGCCGCAGGA ATTAATCCAA ATCCAGACGG CAGCGCTGTT TCACCCTATC AATTCCTTGC CGCCGACATA AACCACGATG GCAAAGTACG CGCCGCCGAT GCCCTTCTGC TTCTAAAAAT GATCGTTGAC TACGAAGGCG CACCAGAACC ACAATGGTAC TTTGCACCAT ACAACATTGG TAACGAAGCC ACCATGGATC GCTCACATGT TGATTGGTCA GTTACTAATC CACAAACCGT TACCATTAAC GACAACACAA CAGTAAATCT TATCGGCATT CTAACCGGGG ATGTGGTGTG A
|
Protein sequence | MVTWGSNHGS NIYGQIFKND GSKEGNQFQI NTYTSQQPDW NPFEKYAPSM AALANGGFVV TWNNYWQDGD GSGIYARIYD NSPSTGLLSL DLLDTNWSND SDNIYRATGR ATLGLLSGNS SMLYIQDAQY TLTGNALIIE GEFSAIIEGT TRSLFIGKAV FDVTTGFAEL TVGSNLHNPA GLGFEFTALD LSNNYIDIHF SPLELPAGIS NTNIILGSDS FVIKENYPQL GFYGSVNFPD KTFELFDMLT VHAYDWSVSY DSPDNELHIL GGFELQTGWD NVPGINAELT GDGLVIRNGE FADVALTITL DDFSVKGWGF NNVSVTLDTE KNSIVGSAGI KLPMFASSLE TTIGFIVNPF ELDTALFHIP FINPGIALGT TGWFLTAIEG GVSNLASSNK EPLLFKGGVE LQLPEVMDLS VNGELDSKHI AGFVEGTIID KDAIDFQGQV TLNWNKDYVR VNGSASFAQG MIVGDFGFTS NLNLDFTAKG SATVKFDTID QILSGHYYLN YSNDHKDSND YIAAWAETVL HIPVFGDKTV TIGTKYSFDG TWRAFGAAEV PLYSSWIVDE TISDLMVTVN WDNPVNDVET RVVVYDDLEK TKIRQIISEA EYGEHDIAII SEWSGSTAKV IYINTPEAGL WDVEVINSDG LGEVVYSATT SLKPLVLTVG ELNLNNDQLS LSYIANTPET DGVISFYLDD NDNGFDGQMI ESMADPDGNG QWVWNTTGFH GGTYWLYATL ADGKSAPVMS YAAQSIFINN APVALDDSII TNEDTAVVID VLVNDSDFDD NPLRISSITT PSNGTVSVTD DNKILFAPYA DTYGESIFTY TITDGYGGES TAVVNITINS MPDAPKGSVI IEGHLKQGEI LRADVSTLSD SDGMGTIAYQ WKVDGTNIEG ATNETYTLTA AEVGNIVTVE VSYTDGNGKL ESVASLATNA VTPINAPNHH DLDGTITFWQ TGDALADVAT TLTQPNNGAA TATTDVHGYY QIPDVQSGTY QLSAIKATDT ATTNAVTTDD VHAVLKIAAG INPNPDGSAV SPYQFLAADI NHDGKVRAAD ALLLLKMIVD YEGAPEPQWY FAPYNIGNEA TMDRSHVDWS VTNPQTVTIN DNTTVNLIGI LTGDVV
|
| |