Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1602 |
Symbol | |
ID | 7316228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1719039 |
End bp | 1720076 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643616494 |
Product | hypothetical protein |
Protein accession | YP_002513672 |
Protein GI | 220934773 |
COG category | [R] General function prediction only |
COG ID | [COG2358] TRAP-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0369631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATACC GCCTGATGGC AGCCATCGCA GGTCTGCTCA TCGCCTTCCC GCTGGGACTC TCGCAGATCC CTACCGCCGA GGCCCAACAG GTCAAGCAGA TCCGCTGGGC CACCTCGGCC GTGGGTTCCG CCGGGCACCG GGCCAAGGTG GCGCTGATGG CCATGCTGAA CCGGGAGATG CCCGATTACT CCATCACCGT GCTGCCCACC CCCGGGGCAC TCGCCACGGT GCGCGGTTAT GCCACCAACC AGTTCGACGG CTACTACGGT GCCGATATCG CCTTCCGCGA GCTGGCCACC GATTCGGGTC GTTTCCGCGG CTTTTCGGCG CAGATGCAGC GCGAGCCGGT GCAGTCCTTC TGGGCCTACA CCATGGAGGT GGGCCTGGGG GTGCGTGCCG CGGACCGCGA GCGCTATTCC GGCTGGGGTG ACCTGCGTGG CCGTCCGGTG TTCAGCGGCC CCGCCCCCTG GGATGTGCGC GCCCAGCTGG AGCGGGCCAT GGAGACCCTG GAGGTCGGTC ATCGCTACGT GGAGCTGGAC CTGGGGCTTG CGGCCTCCTC CCTGGCGGAG CGCAATATCG ACGCCTTCAT CATCTACAGC ACCGGTGAGT CCAACCCCGC CCCCTGGGTG AACGAGGCCA TGACGGCCAC CGATGTGGCC GCACTCAATC CCTCCGAGGC TGAGATCGCG AAGCTGCGCG CCGCAGGACT GGACGTGGTG GAAGTGGACG GCAGCGTGTT CCGCAACGCC CGCGTGGACA AGGTGGTGCT GGTGCCCTTC TTCTACGGCT TCCACGTGGG TCTGGAAGTC CCCGAGGAGG ACGTCTACCG CAAGCTCATG ACCATCGAGG CCAACGCGGC AGAACTGGCC CAGTCCGATT CCGCCTTCCG CCAGATCGCC GCCGACATGG TGGGCATCCA GCGCCGCGGC GTGGCCGCCT CGGTGGATTC CGTGAAGGTG CATCCGGGCC TGGCCCGTTA TCTGCGGGAA AAAGGCGCCT GGGACGACGC CTGGAACGAT CGCATCGCCT CACGCTAA
|
Protein sequence | MKYRLMAAIA GLLIAFPLGL SQIPTAEAQQ VKQIRWATSA VGSAGHRAKV ALMAMLNREM PDYSITVLPT PGALATVRGY ATNQFDGYYG ADIAFRELAT DSGRFRGFSA QMQREPVQSF WAYTMEVGLG VRAADRERYS GWGDLRGRPV FSGPAPWDVR AQLERAMETL EVGHRYVELD LGLAASSLAE RNIDAFIIYS TGESNPAPWV NEAMTATDVA ALNPSEAEIA KLRAAGLDVV EVDGSVFRNA RVDKVVLVPF FYGFHVGLEV PEEDVYRKLM TIEANAAELA QSDSAFRQIA ADMVGIQRRG VAASVDSVKV HPGLARYLRE KGAWDDAWND RIASR
|
| |