Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1239 |
Symbol | |
ID | 7317728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1330002 |
End bp | 1331270 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643616127 |
Product | thiol-disulfide isomerase and thioredoxins-like protein |
Protein accession | YP_002513312 |
Protein GI | 220934413 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0526] Thiol-disulfide isomerase and thioredoxins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.502856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA CGGGATCTCT CCTGCTGCTC TGCCTGTGCC TGATCGCCCC CCTGGCTGCG GGCGCCATGA CCCTCGACCT GAAGGATGGC ACCGAACTGC CCGTGGCCCG CTACGAGGCC GAGGGCGAGC GCCTGCTCAT CTGGCTGCCC TCGGAGCACG GCGTGCTCGC CGGCCATCAC CAGCAGGCGG AATTCCTGCA GGCCCGAGGT GTCGAGGTCT GGCTGCCGGA TCCCTTCAGC GCCTACTTCC TGCCCAACAC CCCCAGCAGC CTGGACCAGA TCCCCGACGC CCTGATCGCC TGGCTGATCG ACGCGGCCCG GGAGGAAACC GGCAAGCAGG TGTTCGTGTT CGGCAACGAC CGGGCCGCTC CCTGGGTGCT GAGCGGCCTG CGGGAATGGC AGCTTCAGGA ACCCGATCGT GACGGCCTGG GCGGCGTGGT ACTCATGAGC CCCTACCTGC ACACGGGCAT TCCCGAGGAT GACCGCCCGG CGCAATTCCA GCCCATCACC CGGGGCACCA ACCTGCCCGT GTACCTGATT CAGCCGGTGC TCTCGCCCCA GTATCCCCTG CTCAAGGCCA TGGCCGAGCG GCTGGCGGAA GGCGGCAGTC AGGTCTACGT GCGCACCCTG CCGCGCGTGC GGGACCGGTT CTTCTTCCGT CCCAACACCC TGGAGGCGGA GGACGCATTC AAGCCCCAGT TCGCGGCGGA GCTGGACACC GCCCTGCGCC TGCTGAACGG CCTGGAATCT CGGCGCGCCG CCGTGGATCT GCCCGGCGAG GCACGCACCG CCACCCCAAC CCGCCAGGAC CGGCGCCTGG AACCCTGGCG CGGCAACCCG GAGCCACCAC CCCTGGTCCT GCCCGACCTG GACGGCAAGA CCCATGACCT GGCCGATTAC GAGGGCACGG TCGTGCTCAT CAATTTCTGG GCCAGCTGGT GTCCGCCCTG CGTGCACGAG ATGCCCTCCA TGCAGCGCCT TGAAGAGAGC TTCGAGGGGC GGCCCTTCAC CATCCTGGCG GTGAACCTGG GCGAGGACGA GGCCACCGTG CTGCCCTTCC TGGAACGCAT CAACGTGGAC TTCACCATCC TCATGGATCC GGCCGGTCAG GCCCTGCGCA ACTGGAACGT ACTCGCCTAT CCCACCAGCT ACGTGATCGA CCGGCAGGGC CGCATCAGCC ATGCCCTGTT CGGCGCCATC GAATGGATGG ACCCGGAGGT GATCGCGGTG TTCGAGGATC TGATGGCGCG GGCAAGCAGG GGGCAGTGA
|
Protein sequence | MKKTGSLLLL CLCLIAPLAA GAMTLDLKDG TELPVARYEA EGERLLIWLP SEHGVLAGHH QQAEFLQARG VEVWLPDPFS AYFLPNTPSS LDQIPDALIA WLIDAAREET GKQVFVFGND RAAPWVLSGL REWQLQEPDR DGLGGVVLMS PYLHTGIPED DRPAQFQPIT RGTNLPVYLI QPVLSPQYPL LKAMAERLAE GGSQVYVRTL PRVRDRFFFR PNTLEAEDAF KPQFAAELDT ALRLLNGLES RRAAVDLPGE ARTATPTRQD RRLEPWRGNP EPPPLVLPDL DGKTHDLADY EGTVVLINFW ASWCPPCVHE MPSMQRLEES FEGRPFTILA VNLGEDEATV LPFLERINVD FTILMDPAGQ ALRNWNVLAY PTSYVIDRQG RISHALFGAI EWMDPEVIAV FEDLMARASR GQ
|
| |