Gene Tgr7_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_2039 
Symbol 
ID7316428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp2167678 
End bp2168886 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID643616931 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_002514106 
Protein GI220935207 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACTC AGCTTGCCGA TTTCATCAAG GACACCCACC AGGGCCGCGA GGCCGAGCAG 
ATCCTGCGCG CCTGCGTGCA CTGCGGCTTC TGCAATGCCA CCTGTCCCAC CTACCAGCTG
CTGGGCGATG AACTCGACGG TCCGCGCGGC CGCATCTACC TGATGAAGCA GATGCTGGAA
GGCCACACAC CCACCGGCAA GACCCTGAGC CACCTGGACC GCTGCCTCAG CTGTCGCAAC
TGCGAGACCA CCTGCCCCTC GGGTGTGCGC TATGGCCGCC TGGTGGAGAT CGGCCGCGAG
GTGATCGAGC AGCAGGTGCA ACGCTCCCTG AGCGAGCGCC TGCTGCGCGG CGCCCTGCGG
CGCTTCCTGC GCTCGCCGCT GTTCGGCGTG AGCCTCGCCC TGGGCCGCCT GGCGCGTCCG
GTGCTGCCCG CGTCCCTGCG CCGCCGTGTC CCCGAATCCC GGCCCGCGCC CCGGCGCCCG
GGCCGGCAGC ACGCCCGGCG CATGCTGTTG CTGGAGGGCT GCGTGCAGCC CGCCCTGTCC
CCGGACATCA ACGCCTCGGC CGCCCGGGTG CTGGACCGCC TGGGTATCGA ACTGGTGAGT
GTCCCGGCCG CCGGCTGCTG CGGCGCCATC GACCAGCACC TGGGGGCCTT CGAGGCGGCC
CGCGCACAGA TCCGACGCAA CATCGACGCC TGGTGGCCCC AGGTGCAGGC CGGCGCCGAG
GCCATCGTGA TGACCGCCAG CGGCTGCGGC GCGCAGGTCA AGGACTACGG CGCCCTGCTG
GCCGACGATC CCGCGTATGC GGAAAAGGCG GCGCACATCG CGGCCCTGAC CCGGGATCTG
AGCCAGGTGC TGGCCGAGGC CGACCTGTCC GGTTTCCGGG TGTCCCGGCG GCGCATCGCC
TTCCATCCGC CCTGCACCCT CCAGCACGGC CAGAAGCTGC GCGGCGTGGT GGAGGGCATC
CTGACACGCC TCGGTTTCGA ACTCTTGCCG GTGGCGGACA GCCACCTGTG CTGCGGCTCT
GCCGGCACCT ATTCGATCCT GCAGGGCGAC ATCGCGGGGC GCCTGCGCGA GGACAAGCTC
CACAAGCTGC AGGCCAACGG CCCCGAACTC ATCGCCACCG CCAACATCGG CTGTCAGACC
CACCTGGCCT CGGGGAGCGA CGTCCCCGTG GTGCACTGGA TCACGCTGCT TGACCCACCG
GCGAACTGA
 
Protein sequence
MQTQLADFIK DTHQGREAEQ ILRACVHCGF CNATCPTYQL LGDELDGPRG RIYLMKQMLE 
GHTPTGKTLS HLDRCLSCRN CETTCPSGVR YGRLVEIGRE VIEQQVQRSL SERLLRGALR
RFLRSPLFGV SLALGRLARP VLPASLRRRV PESRPAPRRP GRQHARRMLL LEGCVQPALS
PDINASAARV LDRLGIELVS VPAAGCCGAI DQHLGAFEAA RAQIRRNIDA WWPQVQAGAE
AIVMTASGCG AQVKDYGALL ADDPAYAEKA AHIAALTRDL SQVLAEADLS GFRVSRRRIA
FHPPCTLQHG QKLRGVVEGI LTRLGFELLP VADSHLCCGS AGTYSILQGD IAGRLREDKL
HKLQANGPEL IATANIGCQT HLASGSDVPV VHWITLLDPP AN