Gene SNSL254_A1577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1577 
Symbol 
ID6484400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1541629 
End bp1543275 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content55% 
IMG OID642736963 
Productfumarate hydratase class I, anaerobic 
Protein accessionYP_002040715 
Protein GI194443891 
COG category[C] Energy production and conversion 
COG ID[COG1838] Tartrate dehydratase beta subunit/Fumarate hydratase class I, C-terminal domain
[COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain 
TIGRFAM ID[TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region
[TIGR00723] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, beta region 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.551964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACA AACCCTTCTT TTATCAAGAC CCTTTTCCCC TCAAAAAGGA CGATACCGAG 
TACTACTTGT TGACCAGCGA ACATGTTTCC GTCGCCGAAT TCGAAGGACA AGAGATCCTG
AAAGTCGCGC CGGAAGCGCT AACGCTGTTG GCGCGTCAGG CCTTCCACGA TGCGTCTTTT
ATGCTACGTC CCGCGCATCA GCAGCAGGTG GCGGATATTC TGCGCGATCC GCAGGCCAGT
GAAAACGACA AATATGTCGC GCTGCAATTC CTGCGAAACT CCGATATCGC GGCCAAGGGC
GTGTTGCCCA CCTGCCAGGA TACCGGTACG GCCATTATTG TCGGCAAAAA AGGTCAACGC
GTCTGGACTG GTGGAGGTGA TGAAGCGGCG CTGGCTCGCG GCGTATATAA CACCTATATC
GAAGATAACC TGCGCTATTC ACAGAACGCG GCGCTGGATA TGTACAAAGA GGTCAATACC
GGTACTAACC TGCCGGCGCA GATCGATCTC TACAGCGTCG ATGGCGATGA ATATAAATTC
CTCTGTATCG CTAAAGGCGG CGGATCGGCG AATAAAACCT ACCTCTATCA GGAGACAAAA
GCGCTTCTGA CGCCGGGTAA GCTTAAAAAT TATCTCGTCG ATAAGATGCG TACGCTGGGC
ACCGCCGCTT GTCCGCCGTA TCACATCGCG TTTGTCATTG GCGGCACCTC GGCGGAAGCG
AACCTGAAGA CCGTTAAGCT GGCCTCGGCA AAATACTATG ATGCGCTACC AACCGAAGGC
AATGAACACG GGCAGGCCTT CCGCGATATT GAGCTGGAAA AAGAATTGTT GCTTGAGGCG
CAAAATCTTG GCTTAGGCGC GCAGTTCGGC GGCAAGTATT TCGCTCACGA TATTCGTGTC
ATCCGATTGC CGCGCCATGG GGCATCGTGT CCGGTAGGTA TGGGCGTGTC CTGCTCCGCG
GATCGAAACA TTAAAGCGAA GATCAACCGG GAAGGGATCT GGATCGAGAA GCTGGAGCGC
AATCCCGGTA AATATATTCC AGAGGCGCTG CGCCAGGCGG GAGAGGGCGA GGCGGTGCGC
GTTGATCTTA ACCGTCCCAT GAGCGAGATA CTGCAACAAC TGTCGCAGTA TCCGGTATCA
ACGCGCCTGT CGCTGAACGG CACGATTATT GTGGGCCGCG ACATCGCGCA CGCTAAACTG
AAAGAACGGA TGGACAGAGG CGAAGGTTTG CCGCAGTACA TCAAAAATCA TCCTATCTAT
TATGCGGGGC CGGCAAAAAC GCCGGAAGGG TATGCCTCCG GGTCGCTTGG GCCGACTACC
GCAGGACGAA TGGACTCTTA TGTTGATCAG CTCCAGTCGC AGGGCGGCAG TATGATCATG
CTGGCGAAAG GCAACCGCAG CCAGCAGGTG ACCGATGCCT GTAAGAAACA TGGCGGCTTC
TACCTGGGCA GTATCGGCGG CCCGGCAGCC GTTCTGGCGC AGGGTAGTAT CAAGCGCCTG
GAGTGCGTGG AATATCCTGA ACTGGGTATG GAGGCTATCT GGAAAATTGA AGTGGAGGAT
TTTCCGGCCT TCATTCTGGT GGATGATAAA GGCAACGATT TCTTCCAGCA GATTCAGTCA
TCGCAGTGCG CGCGCTGCGT TAAGTAA
 
Protein sequence
MSNKPFFYQD PFPLKKDDTE YYLLTSEHVS VAEFEGQEIL KVAPEALTLL ARQAFHDASF 
MLRPAHQQQV ADILRDPQAS ENDKYVALQF LRNSDIAAKG VLPTCQDTGT AIIVGKKGQR
VWTGGGDEAA LARGVYNTYI EDNLRYSQNA ALDMYKEVNT GTNLPAQIDL YSVDGDEYKF
LCIAKGGGSA NKTYLYQETK ALLTPGKLKN YLVDKMRTLG TAACPPYHIA FVIGGTSAEA
NLKTVKLASA KYYDALPTEG NEHGQAFRDI ELEKELLLEA QNLGLGAQFG GKYFAHDIRV
IRLPRHGASC PVGMGVSCSA DRNIKAKINR EGIWIEKLER NPGKYIPEAL RQAGEGEAVR
VDLNRPMSEI LQQLSQYPVS TRLSLNGTII VGRDIAHAKL KERMDRGEGL PQYIKNHPIY
YAGPAKTPEG YASGSLGPTT AGRMDSYVDQ LQSQGGSMIM LAKGNRSQQV TDACKKHGGF
YLGSIGGPAA VLAQGSIKRL ECVEYPELGM EAIWKIEVED FPAFILVDDK GNDFFQQIQS
SQCARCVK