Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2039 |
Symbol | |
ID | 7316428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 2167678 |
End bp | 2168886 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643616931 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_002514106 |
Protein GI | 220935207 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGACTC AGCTTGCCGA TTTCATCAAG GACACCCACC AGGGCCGCGA GGCCGAGCAG ATCCTGCGCG CCTGCGTGCA CTGCGGCTTC TGCAATGCCA CCTGTCCCAC CTACCAGCTG CTGGGCGATG AACTCGACGG TCCGCGCGGC CGCATCTACC TGATGAAGCA GATGCTGGAA GGCCACACAC CCACCGGCAA GACCCTGAGC CACCTGGACC GCTGCCTCAG CTGTCGCAAC TGCGAGACCA CCTGCCCCTC GGGTGTGCGC TATGGCCGCC TGGTGGAGAT CGGCCGCGAG GTGATCGAGC AGCAGGTGCA ACGCTCCCTG AGCGAGCGCC TGCTGCGCGG CGCCCTGCGG CGCTTCCTGC GCTCGCCGCT GTTCGGCGTG AGCCTCGCCC TGGGCCGCCT GGCGCGTCCG GTGCTGCCCG CGTCCCTGCG CCGCCGTGTC CCCGAATCCC GGCCCGCGCC CCGGCGCCCG GGCCGGCAGC ACGCCCGGCG CATGCTGTTG CTGGAGGGCT GCGTGCAGCC CGCCCTGTCC CCGGACATCA ACGCCTCGGC CGCCCGGGTG CTGGACCGCC TGGGTATCGA ACTGGTGAGT GTCCCGGCCG CCGGCTGCTG CGGCGCCATC GACCAGCACC TGGGGGCCTT CGAGGCGGCC CGCGCACAGA TCCGACGCAA CATCGACGCC TGGTGGCCCC AGGTGCAGGC CGGCGCCGAG GCCATCGTGA TGACCGCCAG CGGCTGCGGC GCGCAGGTCA AGGACTACGG CGCCCTGCTG GCCGACGATC CCGCGTATGC GGAAAAGGCG GCGCACATCG CGGCCCTGAC CCGGGATCTG AGCCAGGTGC TGGCCGAGGC CGACCTGTCC GGTTTCCGGG TGTCCCGGCG GCGCATCGCC TTCCATCCGC CCTGCACCCT CCAGCACGGC CAGAAGCTGC GCGGCGTGGT GGAGGGCATC CTGACACGCC TCGGTTTCGA ACTCTTGCCG GTGGCGGACA GCCACCTGTG CTGCGGCTCT GCCGGCACCT ATTCGATCCT GCAGGGCGAC ATCGCGGGGC GCCTGCGCGA GGACAAGCTC CACAAGCTGC AGGCCAACGG CCCCGAACTC ATCGCCACCG CCAACATCGG CTGTCAGACC CACCTGGCCT CGGGGAGCGA CGTCCCCGTG GTGCACTGGA TCACGCTGCT TGACCCACCG GCGAACTGA
|
Protein sequence | MQTQLADFIK DTHQGREAEQ ILRACVHCGF CNATCPTYQL LGDELDGPRG RIYLMKQMLE GHTPTGKTLS HLDRCLSCRN CETTCPSGVR YGRLVEIGRE VIEQQVQRSL SERLLRGALR RFLRSPLFGV SLALGRLARP VLPASLRRRV PESRPAPRRP GRQHARRMLL LEGCVQPALS PDINASAARV LDRLGIELVS VPAAGCCGAI DQHLGAFEAA RAQIRRNIDA WWPQVQAGAE AIVMTASGCG AQVKDYGALL ADDPAYAEKA AHIAALTRDL SQVLAEADLS GFRVSRRRIA FHPPCTLQHG QKLRGVVEGI LTRLGFELLP VADSHLCCGS AGTYSILQGD IAGRLREDKL HKLQANGPEL IATANIGCQT HLASGSDVPV VHWITLLDPP AN
|
| |