Gene Sfum_2116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_2116 
Symbol 
ID4459575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2588361 
End bp2589527 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content57% 
IMG OID639702883 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_846234 
Protein GI116749547 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000848316 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000966557 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAGGAAA GAGGACCGTG GCCCCCGGAC GGACCGGCCA CGCGGCAACC GAACAAAATG 
AAGCCTTCTG TACCCGAACA TATCGCTTCG CTGGTCCCAT ACCCGCCGGG CAAACCGATC
GAAGAACTCG AACGCGAGTA CGGGATCACC GATTCCATCA AGCTGGCCAG TAACGAGAAC
CCTCTCGGTC CATCCCCGAA GGCGATGCAA GCCGTCACGG GAGCCCTGTC CCGATTGCAT
CGCTATCCGG ACGGAAGCGG ATACTATCTC AGGAAGCGGT TGAGTGGAAA ATATGCGCTC
CCATTCGACG GCATCGTCCT CGGAAACGGG TCCAACGAGA TCATCGAGCT CGCTATTCGC
GCTTTCCTGG TTCCCGACGA CGAAGTCATC ATGCCCGCCC CCTCCTTTCT GGTTTACAAG
CTCGCAGTCC AGACCATGGG AGGCAAGGCG ATCCACATTC CGCTGAAGCG CTTCGCCATT
GACCTCGAAA AAACCGCCGG AGCCGTCACG CCCCGGACGA AGATGATATT TGTCAACAAT
CCCAACAATC CGACCGGTAC TCTCATTTCG AAACGGGATT TCGACGTCTT TCTGGACCGC
ATCCCTCCCG AAATCGTCGT CGTCCTCGAC GAAGCTTACA TCGAATTCGC CCGGGACCCC
GATACCCCGA ACGGTTTTGA CTACATCGAC CGCCAGGGGC CGTTCGTCAT TGTTTTGAGA
ACCTTTTCCA AGGCATACGG ACTGGCCGGC CTGCGCATCG GTTTCGGAGC GATGAACCCT
TTCCTTGCCG ATTATCTGCA CCGCGTGCGG CAGCCCTTCA ACACAGGAAC TCTGGCGCAA
ATCGCCGCGT TGGCGGCACT GGACGATGAG GATTTCCTGC ACAGGACCCA GAGGGTGGTC
TGGGACGGCT TGCAATATCT CTACCGGGAA GTGGAACGGC TGCGGTTGAA CTATCTTCCG
ACGGAGGCCA ATTTTTTCCT CATCGAGGTC CCGGGACCCG CCAAGTGGTT CTACGAGGCG
ATGCTGCGGC AGGGCGTGAT CGTCCGCGCC ATGAGCTCCT ACGGCATGGA CAATCATATT
CGAATCAATG CGGGTCTTCC CGAGGAAAAT GAGCGCTTCA TAAGAACGTT GAGCGACACG
CTGGTTCAGT TTAGGGCATC TCTTTGA
 
Protein sequence
MKERGPWPPD GPATRQPNKM KPSVPEHIAS LVPYPPGKPI EELEREYGIT DSIKLASNEN 
PLGPSPKAMQ AVTGALSRLH RYPDGSGYYL RKRLSGKYAL PFDGIVLGNG SNEIIELAIR
AFLVPDDEVI MPAPSFLVYK LAVQTMGGKA IHIPLKRFAI DLEKTAGAVT PRTKMIFVNN
PNNPTGTLIS KRDFDVFLDR IPPEIVVVLD EAYIEFARDP DTPNGFDYID RQGPFVIVLR
TFSKAYGLAG LRIGFGAMNP FLADYLHRVR QPFNTGTLAQ IAALAALDDE DFLHRTQRVV
WDGLQYLYRE VERLRLNYLP TEANFFLIEV PGPAKWFYEA MLRQGVIVRA MSSYGMDNHI
RINAGLPEEN ERFIRTLSDT LVQFRASL