Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1146 |
Symbol | |
ID | 7315855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1233135 |
End bp | 1234436 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643616033 |
Product | thiol-disulfide isomerase and thioredoxins-like protein |
Protein accession | YP_002513219 |
Protein GI | 220934320 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0526] Thiol-disulfide isomerase and thioredoxins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.299964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCCCGAC GACCGAGCCT GTGCCTCGCG CTGCTCCTGC TCTTCAGCCT GCCCCATGCC GTTCTCGCCG ACTTCAGCCT GGACCTGCCG GACGGCGAGC GCCTGGACGT GCGCGTCTGG GGCGAGGACA ACACCGGCCC CCTGTTCGTC TGGCTCATCA ACCAGTACGG CGAGACCGAA GGGCCCCACA ACCTGGCCCG GCGTCTGGCC GAGCGCAATG CCACGGTCTG GCAGGTGGAC CTGCTGGACT CCCTGCTGCT TCAGCGCAGC AACGAGGTGG TGCGCAACCT GGACGGCGCA CCGGTGGCGG CCTTGTTGCA CGAGGCCGTG GACAGCGGCC GCGGACCCAT CGTGGTCACC ACCTGCGACC GCATGACCGT GCCCCTGCTG CGGGGCCTGC GGGCCTGGCA GGAAGAGGCC ACCGACATGA ACGCCGTGGC CGGCGGCATC CTGTTCTTCC CCAACCTCTA CCGGGGCACC CCGGTGGCCG GCGAGGAGCC GGAGCTGCTG GGCATCGTCT CCGCCACCAA CATGCCGCTC GCCATCCTGC AGCCGGAACT GGGCACCAAC CGCCCGCGCC TGGAGGCGCT GCTGGCCACC CTGCACCGGG CCGGCAGCCC CGCCTACGGC TGGCTGGTGG ACAGCGTGCG GGACTACTAC CTGCTGCGCT CCGTCGAGCC CGAAGCGGTG GAACTGGAGG CCATGGGCGG CCCGATCCCC TTCGACGTGA CCCGCGCCAT CCTGGACACC CCCACCCAGC TGCTGGCGGC CACCCGGCTG CTGGCCCAGA CCCCGCGCCC GGACCGTCCC GCGCCCCTGG ACGAGGAGGC CGAGGCGCCG GTGCTGCCGG CCTTTGGCCT GGTGGAACGG CCGGCCTACG ACCCGCCCGG CTATGACCTG GTGGATGCCC GGGGCGTGCG CCATCAGCAC ACCGAGAGCC TGGGGCGGGT CACCCTGGTG AACTTCTGGG CCACCTGGTG CCCGCCCTGC GTGCACGAGA TCCCGTCCAT GAACCGCCTG GCCGCCGCCT ACCCGGAAGA CGAGTTCGCC ATCGTCTCCA TCAACTTCCG GGAATCCCCG GCGCACGTTC TGAACTTCAT GGAAGACGTG AACGTGGACT TCCCCGTGCT CATGGACGAG GACGGCGCGG TCTCGGGCGA GTGGCGGGTA TTCGCCTTCC CCAGCTCCTT CCTGCTGGAC CGCCAGGGTC GGGTGCGCTA CTCGGTGAAC ACGGCCATCG AGTGGGATAC AGACGAGGTG CGGGAGGTCA TCGACCGGCT GAGGGCCGAG GATACTTATT GA
|
Protein sequence | MPRRPSLCLA LLLLFSLPHA VLADFSLDLP DGERLDVRVW GEDNTGPLFV WLINQYGETE GPHNLARRLA ERNATVWQVD LLDSLLLQRS NEVVRNLDGA PVAALLHEAV DSGRGPIVVT TCDRMTVPLL RGLRAWQEEA TDMNAVAGGI LFFPNLYRGT PVAGEEPELL GIVSATNMPL AILQPELGTN RPRLEALLAT LHRAGSPAYG WLVDSVRDYY LLRSVEPEAV ELEAMGGPIP FDVTRAILDT PTQLLAATRL LAQTPRPDRP APLDEEAEAP VLPAFGLVER PAYDPPGYDL VDARGVRHQH TESLGRVTLV NFWATWCPPC VHEIPSMNRL AAAYPEDEFA IVSINFRESP AHVLNFMEDV NVDFPVLMDE DGAVSGEWRV FAFPSSFLLD RQGRVRYSVN TAIEWDTDEV REVIDRLRAE DTY
|
| |