Gene Sfum_2015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_2015 
Symbol 
ID4459681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2466341 
End bp2467462 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content59% 
IMG OID639702781 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_846133 
Protein GI116749446 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.788991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.413564 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACT TCCCTCCCCT GGACAGGCTC ACACCCGAAT ACATCAGAAA CTTCGAACCC 
TACATCCCGA GCAAGCCCGA CGAGGAACTG AAAAGACTCT ATGGTTGCGA GCGGCTTTTT
CGCTTGAACA ACAACGAAAA CCCCTTGGGG CCTCCTCCCG CCGCACGGCG GGTGATCCGG
GAGTTCCCGC CGCCGCGGGC ATCCGTCTAT CCCAGCGGGG ATGCCTACTA CCTGCGGTTG
AAGCTTGCCG AAAAGTTCGA CATGCATCCG GATCAGTTCC TGGTCGGGAA CGGAGCCAAC
GAAGTCATCT CCTTTGCGAT CAAGGCATTC TGCGAGGCGG GGGACAATAT CGTCACAGCG
GACAAAACCT TTGCGGTCTA CGAGTGGGTG GCCACCTTTT CCGGATTCGG CGCGCACCTC
GTTCCGCTCG CGGACTTCGG ATTCGACGCG GAAGGGATGC TCCGGGCGAT GGACGACCGC
ACCAAGATCC TGTTTGTATG CAATCCCAAC AATCCCACAG GGAGCATCTG GAAGAGGGGT
ATGCTGCGTG GTTTCCTGGA TCGCGTGGCA GGGAGCCGGA TTGTTGTCGT TGACGAAGCA
TACGCGGAAT TCGTGGAAGA TCCGGAATTC CAGAATGCCA TGGACCTGAT CCCGGAATAT
CCCAACCTCG TCGTGTTCAG AACCTTTTCC AAGATGTATG CCCTGGCGGG GCTGCGCATC
GGGTACCTGG CAGGGGCGAT GGAAGTGGTC GACGTCATTC GAAGGACCTG CGTCGTCTAC
TCCGTCAATG TGCTGGCGCA ACTCGCCGCC CTGGCGGCCA TCGAGGAATG CGCGGAACAC
ATCGAACGCA CGCGGGAGCT GGTGCGGAAG GGGAAGTCCT TTCTCGTACG GGAAATCGGG
GCGCTGGGAC TGGAGTACGT TTCCGGCGAG GGGAACTTCG TCATGCTCAA ACTGCCCATG
AATGACGGTC TGGCCTATCG CAAGCTCATG ACTCGGGGCG TCATGATCCG CAGCATGACC
GGGTTCCGTT TTCCCAACTG GATCCGGGTG ACGGTTTCCA CGGATGAAGC CATGGAGTGC
TTCATCGAGG CATTGACCGA AGCGCTCGGA GAACGCGGGT GA
 
Protein sequence
MSNFPPLDRL TPEYIRNFEP YIPSKPDEEL KRLYGCERLF RLNNNENPLG PPPAARRVIR 
EFPPPRASVY PSGDAYYLRL KLAEKFDMHP DQFLVGNGAN EVISFAIKAF CEAGDNIVTA
DKTFAVYEWV ATFSGFGAHL VPLADFGFDA EGMLRAMDDR TKILFVCNPN NPTGSIWKRG
MLRGFLDRVA GSRIVVVDEA YAEFVEDPEF QNAMDLIPEY PNLVVFRTFS KMYALAGLRI
GYLAGAMEVV DVIRRTCVVY SVNVLAQLAA LAAIEECAEH IERTRELVRK GKSFLVREIG
ALGLEYVSGE GNFVMLKLPM NDGLAYRKLM TRGVMIRSMT GFRFPNWIRV TVSTDEAMEC
FIEALTEALG ERG