Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1921 |
Symbol | |
ID | 3747296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 2453687 |
End bp | 2455000 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637774456 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_380212 |
Protein GI | 78189874 |
COG category | [R] General function prediction only |
COG ID | [COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000266613 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTTC AGTGCGTTAT GACAAATCAT CAGCTTTCAC GGCGCGATTT TGCAAAGTTG TTACTGTCGA GCACGGCAGG CGCTTTGCTT GGGGTTGGTG TGCCATCGTC GCGTACGTAT GCGGCAACCA ATCGTGTAGT TATTATTGGT GGTGGGTTTG GTGGAGCTAC GGCAGCAAAG TATCTCCGCA AGCTTGATCC GTCGGTTGCT ATTACGTTGG TTGAGCCAAA ATGCCAATTT TATACCTGCC CTATTAGCAA TTGGGTGATT GCTGGTTTAA AGCCAATGCA CGCTATTGCG CAAAATTATA ATGCGTTAAG GGTGCGTTAT GGTGTAAATG TGGTACACGC TACGGCGGTG GCTATTGATG CCTTAAAAAA CAGCGTTACC CTGCATAGTG GCAAAAAGCT CTTTTATGAT CGTTTAATTG TGTCGCCCGG TATTGATTTC CGTTGGAATG CAATTCCTGG TTATAGCCAA AAGGTGGCTG AAAGTGTTAT GCCGCATGGT TTTCAAGCGG GTGAGCAAAC GTTGCTGTTG CGCAAGCAAT TGCTGGCAAT GCCGAATGGT GGCACGGTAA TTATGTGCCC TCCCAACAAT CCGCACCGTT GCCCTGCTGC ACCTTATGAG CGTGCAAGTT TAATAGCACA TTATTTAAAG CAGCACAAGC CAAAGTCGAA GGTGCTCATT CTTGATTGCA AGGAGAAGTT TTCCAAACAA GAGCTTTTTT TGCAGGGTTG GGAGCGTTTG TATTCGGGAA TGATTGAATG GCGTGCGGCA ACGGCTGGCG GTAAAGTGGA GGCGGTGAAT AGTGCGGCTA TGACGGTTAC CACCGAGTTT GGTGATGAAA AGGGTGACCT TATTAATATT ATGCCACCGC AACAAGCGGG TCGCATTGCT TTTGAGGCTG GTTTAACCGA TGCCGCTGGT TGGTGCCCTG TGCATCCCAT TACCTTTGAA TCAACGCTGC ATCCCGGCAT TCACATTATT GGCGATGCTT GCCACGCGGG TGATATGCCA AAATCAGCAT TTGCCTCAAG TAGCCAAGGG AAGGTGGCAA GTTCGGCTAT TGCTGCATTA CTGCAAGGCA GAGTGCCTGT TGCGCCATCA TTAGTAAGCA CCTGTTACAG CTTACTTAAG CCCGATTACG CTATTTCGGT AGCTAATGTT TTTCGTTTAA CGATTGACGG TATTGTTGAT GTAAAAGGTT CGGGTGGCGT TACCCCGCTT GATGCGTCGG TGGAACATTT GCAACACGAA GCCGATTTTG CATGGGGATG GTACGAAAAC ATTACACGCG ATACGTGGGG CTAA
|
Protein sequence | MFFQCVMTNH QLSRRDFAKL LLSSTAGALL GVGVPSSRTY AATNRVVIIG GGFGGATAAK YLRKLDPSVA ITLVEPKCQF YTCPISNWVI AGLKPMHAIA QNYNALRVRY GVNVVHATAV AIDALKNSVT LHSGKKLFYD RLIVSPGIDF RWNAIPGYSQ KVAESVMPHG FQAGEQTLLL RKQLLAMPNG GTVIMCPPNN PHRCPAAPYE RASLIAHYLK QHKPKSKVLI LDCKEKFSKQ ELFLQGWERL YSGMIEWRAA TAGGKVEAVN SAAMTVTTEF GDEKGDLINI MPPQQAGRIA FEAGLTDAAG WCPVHPITFE STLHPGIHII GDACHAGDMP KSAFASSSQG KVASSAIAAL LQGRVPVAPS LVSTCYSLLK PDYAISVANV FRLTIDGIVD VKGSGGVTPL DASVEHLQHE ADFAWGWYEN ITRDTWG
|
| |