Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_2042 |
Symbol | |
ID | 3746089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 2270880 |
End bp | 2273672 |
Gene Length | 2793 bp |
Protein Length | 930 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637770073 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_375927 |
Protein GI | 78187884 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.607166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAAAA CTATTTCGCG CCGCCAGTTC CTGAAAGTCG CCGGCGTTCT GGGGGGCATG TCCCTGTTGC GCCCCGTCTG GAGCCTGGGG CAGGCTGGAT CACCCGAACA AGGCGGAGCC GTCTCGGGAA CAGTTGTCTG GGTACCGAGC ATCTGCAACT TCTGTTCTTC GTTCTGTGAC ATCAAGGTTG CCACGAAGGA GATCGACGGG GTGAAGCGCG CCGTCAAGAT CGAAGGCAAT GCGGAAAGCC CCCTCAACCG CGGCAAAATC TGCGCCCGCG GTCAGGCCGG CCTCTACCAG ACCTACGATC CGGACCGGCT GAAGCAGCCG CTCATCAGGG TCGAGGGCAG CAAGCGGGGC GAATGGAACT TCAGGGCGGC GACATGGGAT GAGGCCTACC GCCACATCAT AGGCAAGCTG CAGAAGGTCA ACCCATGGGA AATCAGCCTC GTCGGCGGAT GGACGGCCTG CGTCTCCTAC ATGCATTTCA GCCTGCCGTT CTGCCAGAGC CTCGGGATCC CCAACATCGT TGCATCTCCT CTTCAGCACT GCGTCACCGC AGGCCATCTC GGCACCGATC TGGTGACCGG CAACTTCAAC GTGCACGACG AGATCCTCGC CGATTTCGAA AACGCCCGAT ACATCCTCTT CAGCCTCAAC AACGCCTCCG TGGCTGCCAT CTCGACGGCC CGTGCCGTGC GGTTCGGCCA GGCGAAGAAA AACGGGGCCA AAGTGGTCTG CCTCGATCCC AGAATGGGCG AGCTTGCCGC AAAAGCCGAC GAGTGGATCC CCGTCAAGCC GGGTACCGAC CACGCATTCT TCCTCGCGAT GCTCCATACC CTGCTCCGCG AGAAGCTCTA TGACGGGCCC TTTGTGTCCA AACACACCAA TGCCCCGTTC CTTTCGTTCG TTGACGAAAA GGGCGCCGTG CAGCTGGCCG CGGATAAGGG TGCGGACGGA AACCCCACCG CCTACTACGT CTTTGACCTC ATCAGCCAGG AGGTCCGCGC CGTCCCGGCC TATACGAACA CCAACGAGCG CACGAAATCG GGATCGAGGA TCCAGCCGGG CCTCAACGCC CCGAATAACC TCACATGGCA GGGACGCCCC GTCACGACCG TCTTCGACCG GTTCATTGCG GAATCCGAAC CCTACACCCC CGAGTGGGCA TCGAAGATCA CCGACATCCC GGCTTCCACC ATCAAACGCA TCGCCGTCGA GTTCGGCCAG GCACGTCCGG CCATGGTCGA TCCCGGCTGG ATGGGAGCAC GATACCACCA CCTCATCGGC CAGCGCCGAC TGCAGGCGAT CATCCAGACC CTCGTCGGAG GCATCGACCG TCCCGGCGGC TGGATGATGA ACGGCGAACT GCACCACAAG GCCGAGGTTT CCTGGCACAA CACGCAGCAG GGCAAGAGCG ACAAGGACCT CCTGCCGGTC CAACGCCCCG GCATGGGCTT CGCCTACGGC CTGCTTGACA TCTTCGCCAA CCCCGGAGCC TGGGAACACG GAATGCCGGC GTTCTCGTTC GCATGGGCCG AAGAGCAGGC GAAAGCAGGA AAGCAGTCCG CGTTCCTGCC CGCCATGGCC GACACCGGCC TGCTTGAAGC GGTGAGGGGA GAACTGAAGT ACCATGGGCG TCCCTACAAC ATCAAGGCCA TCATCCTCAA CGCGGCCAAC CCGATCCGCC ACTACTTCCC GGCAAAGCGG TGGGAGGAGA TCCTCTCGCA CAAGAACATC GATCTCGTGG TGGCCGTCGA CGTACTGCCC TCCGACTCGA CGCTCTATGC GGACGTCATC CTGCCGAACC ACACCTACCT TGAGCGCAAC GAGCCGCTGC TCTACCCGCT CGGGCCGAAC ACCGGCATCG GTTTCACGAC CAGGATCAGG ACCATCGATC CGATCTACGA CACACGCGAC ACGACCGACA TCCTCTGCGC CATAGCCGAG GGGATGGGCA AGCTCGATAG CTACATACAC GGCATCGCAG AATACGCGGG CCTTGATCAT GCAATGCTGA AGCGCGAAAT CGCAGCCGCA AAGAAAGCCG GCAAACCGCT GAACGAAGCC TTCCTGAAAA CCGCCTATGA AGCCATGGGC CACTTTGCAG AACATGTGAC AGGCAAGCAC ATGAGCGCCG CCGAGGTTGA AGCAACCATC ATGGACAAGG GACTGCTCAT GCTGAAAGAC GCCGATACTG TGGTTGAAGA GATGAACATG CCGCGCAAAA TCCCCGTACC GACATCCACC GGACGACTCG AGCTCTTCAG CCCGATCCTC TCTTCATTCG GAGAGAAAGC CGGAAGGACC CCGCTGTTCG ACCCCGTGCT CGGCTACGTT CCGCGGGTCG TCAGCGACAA GTCACCCGAA CCCTCCCTGA ACGCCGACGA GTTCTACTTC ACCTACGGCA AGGTCCCGGT GGTGTCGCAC GCATCGACCA ACAACAACAA CGCGGTGCTT GCCGCAGTGA CAAAGCCTAA AGAGGGCGCA TTCACGGGCC TCTGGATGAA CGCCACAAAG GCCCTTGCCC TCGGCCTCCG GGATGGTCAG GAGGTGGAGG TCACCAACCT CCGCTACGGC CCGAAGGTAA CGGCAACGCT CTTCGTGACC GAGATGATCC GTCCCGACAC GGTCTTCCTC CCTTCGTCCT ATGGAAGCCG CAACAAGATG CTTTCGGTCG CAGGCGGAAA AGGAACTGCG CTGAACGAGC TCATGCCCTA CAGCATAGAG CCGATCGCAG CATCGTTCAT GTCACAGGAA TTTACGGTAA GCGTCACGCC CGTCAACAGT TAA
|
Protein sequence | MHKTISRRQF LKVAGVLGGM SLLRPVWSLG QAGSPEQGGA VSGTVVWVPS ICNFCSSFCD IKVATKEIDG VKRAVKIEGN AESPLNRGKI CARGQAGLYQ TYDPDRLKQP LIRVEGSKRG EWNFRAATWD EAYRHIIGKL QKVNPWEISL VGGWTACVSY MHFSLPFCQS LGIPNIVASP LQHCVTAGHL GTDLVTGNFN VHDEILADFE NARYILFSLN NASVAAISTA RAVRFGQAKK NGAKVVCLDP RMGELAAKAD EWIPVKPGTD HAFFLAMLHT LLREKLYDGP FVSKHTNAPF LSFVDEKGAV QLAADKGADG NPTAYYVFDL ISQEVRAVPA YTNTNERTKS GSRIQPGLNA PNNLTWQGRP VTTVFDRFIA ESEPYTPEWA SKITDIPAST IKRIAVEFGQ ARPAMVDPGW MGARYHHLIG QRRLQAIIQT LVGGIDRPGG WMMNGELHHK AEVSWHNTQQ GKSDKDLLPV QRPGMGFAYG LLDIFANPGA WEHGMPAFSF AWAEEQAKAG KQSAFLPAMA DTGLLEAVRG ELKYHGRPYN IKAIILNAAN PIRHYFPAKR WEEILSHKNI DLVVAVDVLP SDSTLYADVI LPNHTYLERN EPLLYPLGPN TGIGFTTRIR TIDPIYDTRD TTDILCAIAE GMGKLDSYIH GIAEYAGLDH AMLKREIAAA KKAGKPLNEA FLKTAYEAMG HFAEHVTGKH MSAAEVEATI MDKGLLMLKD ADTVVEEMNM PRKIPVPTST GRLELFSPIL SSFGEKAGRT PLFDPVLGYV PRVVSDKSPE PSLNADEFYF TYGKVPVVSH ASTNNNNAVL AAVTKPKEGA FTGLWMNATK ALALGLRDGQ EVEVTNLRYG PKVTATLFVT EMIRPDTVFL PSSYGSRNKM LSVAGGKGTA LNELMPYSIE PIAASFMSQE FTVSVTPVNS
|
| |