Gene Tgr7_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_1848 
Symbol 
ID7315178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp1962835 
End bp1964205 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content68% 
IMG OID643616739 
Productfumarate lyase 
Protein accessionYP_002513916 
Protein GI220935017 
COG category[C] Energy production and conversion 
COG ID[COG0114] Fumarase 
TIGRFAM ID[TIGR00979] fumarate hydratase, class II 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGACC AGCAGACCCG AACCGAACGC GACAGCATGG GCACGGTGGA GGTCCCCGCG 
GATGCGCTCT ACGGGGCCCA GACCCAGCGG GCAGTGGACA ACTTCCCGGT CAGTGGTCTG
CCCATGCCTC CCGGTTTCAT TCATGCCCTG GGGCACATCA AGGGTGCCTG TGCCCGGGCC
AACGCGGCCC TGGGCGGCCT GGACGCGGAC GTGGCCAAGG CCATCGATGC CGCCGCCCGG
GAGGTGGCCG AGGGCGGTCA CGACCACCAG TTCCCGGTGG ACGTATTCCA GACCGGTTCC
GGCACCAGTT CCAACATGAA CGTCAACGAG GTGATCGCCC GGCTGGCCAG CCAGCGCCTG
GGCAAGCCCG TGCATCCCAA TGACCACGTC AACCGGGGCC AGAGCTCCAA TGACGTGGTG
CCCACCGCCA TCCACGTCTC TGCCCGCCTG GCCCTGGTCA ATCACCTGCT GCCGTCCCTG
GATCACCTGG CCCTGACCCT GGAGCGTCGC GCCAGCGAAC TGCGCGACGT GGTCAAGACC
GGCCGCACCC ACCTGATGGA TGCCATGCCC GTGACCCTGG GCCAGGAACT GGGCGGCTGG
GCCCGGCAGG TGCGCAACGG CCTGGCGCGC CTGGAGCGCA GCGGCGAGGG CCTGCTGGAG
CTGGCCCTGG GCGGCACCGC CGTGGGCACC GGCGTCAACG CCGAGCCCGG CTTTGCCAAA
CTGGTGGCGG AGGAACTGCA GCAGACCACC GGCGAGATCT TCCGCAGCAA GCCGGATTTC
TTCGAGGGCC TGAGTGCCCA GGACACGGCG GTGGAGATGA GCGGCCAGCT GCGCACCGTC
GCCGTGAGCC TGATGAAGAT CGCCAACGAC CTGCGCTGGA TGAACTCCGG CCCCCTGGCG
GGGCTCGGCG AGATCAGCCT GCCGTCCCTG CAGCCCGGCA GCAGCATCAT GCCCGGCAAG
GTGAATCCGG TGATCCCGGA GTCCGTGGCC ATGGTCTGCG CCCAGGTGAT GGGCAATGAC
GTGACCGTGA CCGTGGCCGG CCAGTCCGGC AGCTTCCAGT TGAACGTGAT GCTGCCGGTG
ATCGCGCTGA ACCTCTTGCA GAGCACCGAG CTGCTGGCCA ATGCGGCCCG CCTGCTGGCG
GACCGGGCCA TCGCCGGCTT CACGGTCAAC GAGGAACGCA TCCGCGAGGC CCTGGACCGC
AACCCCATCC TGGTCACGGC ACTCAACCCC ATCATCGGCT ACGAGAAGGG CGCGGCCATC
GCCAAGAAGG CCTATGCCCA GGGCCGGCCG GTGCTGGACG TGGCGCTGGA GGAGACGGAT
CTTTCGGAAG AGGAACTGCG CCGGCTGCTG GATCCGGGTA AGCTCGTTTA G
 
Protein sequence
MSDQQTRTER DSMGTVEVPA DALYGAQTQR AVDNFPVSGL PMPPGFIHAL GHIKGACARA 
NAALGGLDAD VAKAIDAAAR EVAEGGHDHQ FPVDVFQTGS GTSSNMNVNE VIARLASQRL
GKPVHPNDHV NRGQSSNDVV PTAIHVSARL ALVNHLLPSL DHLALTLERR ASELRDVVKT
GRTHLMDAMP VTLGQELGGW ARQVRNGLAR LERSGEGLLE LALGGTAVGT GVNAEPGFAK
LVAEELQQTT GEIFRSKPDF FEGLSAQDTA VEMSGQLRTV AVSLMKIAND LRWMNSGPLA
GLGEISLPSL QPGSSIMPGK VNPVIPESVA MVCAQVMGND VTVTVAGQSG SFQLNVMLPV
IALNLLQSTE LLANAARLLA DRAIAGFTVN EERIREALDR NPILVTALNP IIGYEKGAAI
AKKAYAQGRP VLDVALEETD LSEEELRRLL DPGKLV