Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1850 |
Symbol | |
ID | 7315180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1965807 |
End bp | 1967000 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643616741 |
Product | Cupin 4 family protein |
Protein accession | YP_002513918 |
Protein GI | 220935019 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.469672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCATGA ACCCCGACGA ACCGCTGACG CTGCTGGGGG GGCTCACCGC CCGGGCGTTT CTGCGCGACT ACTGGCAGCA GAAACCGCTG CTGGTGCGTC AGGCGATCCC CGGGTTCGAG TCGCCCCTCT CACCCGAGGA ACTGGCGGGG CTGGCCTGCG AGGAAGGGGT GATCAGTCGC CTGGTGCGCG AGCGGGGCGA GACCGGGTCC TGGGCGTTGC GCACCGGTCC CTTCGACGAG GACGACTTCA CCACCCTGCC CGAGAGCCAC TGGACCCTGC TGGTCTCGGA CATGGAGAAG CACCTGCCGG AACTGCGGGC CTACCTGGAA CCCTTCCGCT TCATCCCCGA CTGGCGCATG GACGACCTGA TGGTGAGCTA TGCCGCGCCG GAGGGTTCCG TGGGACCCCA CGTGGACGAG TACGACGTGT TCCTGCTCCA GGCCCAGGGC CGACGCCGCT GGCAGATCGC CCGTCAGGCG GTGAGCGGCG ATGACTTCCT GCCCGGGGTC GAACTGCGCA TCCTGCGGGA TTTCCAGCCG GACCAGGAGT GGATCCTGGA GCCCGGCGAC ATGCTCTACC TGCCGCCGCG CATCCCCCAT CACGGTGTGG CCGTGGGACC GTGCATGACC TGGTCCGTGG GTTTCCGGGC CCCGGCCTGG CGGGACCTGA TGGCCGCCTG GGTGGACCAG CGCTACGAAG CACTCGCGCC CCAGGATCGC TACGCCGATC CAGGCCTGGA GCCCCAGGAC AACCCGGGTG AACTCAGCGC AGCCGCCCTC GCCCGCCTGA TCGCCGGCCT GCGCCGGGCC ATGGCGGTCG ATGACGCCGA ACTCGCCCGC TGGCTGGGCA CTGTGCTCAC CGAACCCAAG GCGGAACTGC TGGAGCACAT GCAACTGCCG GAGACGCTCA CCCGGGACGA GGCACTCGGC CTGCTGCAGG ATGGAGTCTC CCTGGAACGC CACGGCGCCG CCCGCCTGGC CTGGATGTCG GACCACGGTG GCCTGCGCCT GTTCGTCAAC GGCCAGGAAC ACCTCCTGCC GGAAGCGGCA GGCCCGCTGG TGCGCCACCT GTGCGCCGAA ACGGCCTATG ACGGCAAGGC CCTTTGGGGC CTTGCCAGTG GTATCGACAG CGCGGAAGAT CTGCTCATGA GCCTGTGTAT AGCCGGGATC CTGCTTGAGA TCCCGGCGCC TTGA
|
Protein sequence | MAMNPDEPLT LLGGLTARAF LRDYWQQKPL LVRQAIPGFE SPLSPEELAG LACEEGVISR LVRERGETGS WALRTGPFDE DDFTTLPESH WTLLVSDMEK HLPELRAYLE PFRFIPDWRM DDLMVSYAAP EGSVGPHVDE YDVFLLQAQG RRRWQIARQA VSGDDFLPGV ELRILRDFQP DQEWILEPGD MLYLPPRIPH HGVAVGPCMT WSVGFRAPAW RDLMAAWVDQ RYEALAPQDR YADPGLEPQD NPGELSAAAL ARLIAGLRRA MAVDDAELAR WLGTVLTEPK AELLEHMQLP ETLTRDEALG LLQDGVSLER HGAARLAWMS DHGGLRLFVN GQEHLLPEAA GPLVRHLCAE TAYDGKALWG LASGIDSAED LLMSLCIAGI LLEIPAP
|
| |