Gene Sfum_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1037 
Symbol 
ID4460940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1281567 
End bp1283498 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content62% 
IMG OID639701801 
Productextracellular ligand-binding receptor 
Protein accessionYP_845166 
Protein GI116748479 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000129321 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000121148 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAACGGA TTAGCAGATA CTCCCGGATT TTGCTGCTGG TGGTAACGGT TGCATTCCTT 
GCCGGATGTC CCGGTTCCCA GCAACCTGCC GAACAGGCTC CGAGTCGCCC GTCTCTCACC
GGGACGACGC CTCCGGACGC CGAAAAGATG GTCCAGCAGG CCGAGCAGGC GCGCAAAAGC
GGAAATATTC CCAAAGCCAT ATCCCTCTGG GAAAAGGTCA TCCAGAAATA TCCCGGCCAT
GCCGTCGCGG CACGGGGATT TTCCGTGGTC GGCAACCTTT ACCTGGCCCA GGGACAACCG
GATCGCGCCC TGCAGTATTT CGACTACCTG CTTTACACCT ATCCCAACTG GGACGGAATC
GGCCAGGCAC GGCTGGATCG ACTCCGCGCG CTTGCCGCCG CCGGCAAGAA GAAACAGGCC
ATGAAGGAAG CGGTGCCGCT GTGGGAGACT TCGACGTCCC AACCTGATTT GCAGGTGGGC
CTTGCGTCCT TCATGGCCGG GCTCTACGGC GCCGAAGGGG ATATCGAGAC GGGCTGGGAC
TGGGCCGGTT CGGGCTTCCC GGCGGCCTCA ACCCCGGAGC AGAAGAAAAC GCTCACCCGG
GCCACCGTCG ATTTGCTGAA AAATGCGGAC GAGGGGCAGG TGAAGAGGCT GTATAAAAAG
ACTCCTTCGG ATTTCATGAA GGTCTTTCTG GATTTTCGGA TGGCCCAGCT GGAAATGCAG
AAAGGGCGCC AGGACCCGGC GCGAGCCCGC CTTAAGGAGC TTCTCGCCCG GAACGGGACT
CACCCGCTGG TCCCGGAGAT ACAGGCGGCC ATCCGCGGCA CCCGCCCGGA AGGGGCTGAA
TATGCCGTCA ATGCCGATCG CATCGGAGTG CTCATTCCAC TCAACGGTTC ATTTGCCAGG
TACGGCGACA TGGTGCTCAA AGGCCTGACG CTCGCGAATT CGGACTGGAG CGAGCGATAC
CCGGGCCAGC AGGTGTCGCT CGTGGTCAAG GACGCCCAGG CCGATGCAGC CGTCACCACC
CGGTCTTTCG AGGAAATGGT GAAAAAGGAC GGCGTCCTTG CCGTGATCGG TCCCCTTGGG
GCGCAGGCGG CCAAAGAGGT GGCGCCGCTC GCGGACCGCT ACGGGGTGCC CGTCCTGACC
ATGACGCAAA AGGATGACGA GGGCGCGGCG AGTTCCTTCG TCATTCACAT CTTCCTGGAC
AGTCGCGAGA TCGTGCGATC CGTCGTAAAG CACTGCCGCG ACAAACTCGG TCACACCCGT
TTCGCGGCGC TCTATCCGGA CGACCGCTAC GGGCAGAAGC TGGCGAAGAT TTTCTCCGAA
GTCGTTCCGG AACTGGGCGG CCAGGTTATG GCCAGTGTTT CCTACAAGGA AAAAACGACG
GATTTCAAGG AGTCGCTGCA AAAGCTCATC ACCATCGCCA AGAAGAACCA GCCCCCGACC
GGCGTCGAAA CCACGCCGTT CGATGCGCTG TTCATTCCCG ACCAGGTGCA GTCGGTATCG
CTCATCGCCC CCCAACTCCC ATACAACAAC GTGGTGGGCG CCACTCTGCT CGGGACCAAC
CTGTGGAGCG AAGCACCGCT CGTGCAGGCC GGCGGGGTCT ACATCGAGCA TGCGCTGTTC
GCCACGGCCT ACTACCCCGA AAACCCGAGC TCCCGGGCGA AGGACTTCCG CGAGCGATTT
CAGGAGAAAT TCGGGGCGCC GCCATCCTAC CTGGAAGCCC AGGCCTACGA CGCGCTCATG
CTGGTGCTGC AGGCCCGCAG CGCTTTGCGA TCGACCGGGA TCGACCGCGC ATCCCTCCTG
CAGACCATCA TGACGGCAAA GGGCTTCGAG GGGATCGCGG GGAAGTACTC CTTCTCCCCG
ATGGGAGGCT TGCAGCGGAA CTATTTGCTC CTCCAGGTGC AGGACGGAAA GCTTGTGCAG
ATTGCTCCAT GA
 
Protein sequence
MKRISRYSRI LLLVVTVAFL AGCPGSQQPA EQAPSRPSLT GTTPPDAEKM VQQAEQARKS 
GNIPKAISLW EKVIQKYPGH AVAARGFSVV GNLYLAQGQP DRALQYFDYL LYTYPNWDGI
GQARLDRLRA LAAAGKKKQA MKEAVPLWET STSQPDLQVG LASFMAGLYG AEGDIETGWD
WAGSGFPAAS TPEQKKTLTR ATVDLLKNAD EGQVKRLYKK TPSDFMKVFL DFRMAQLEMQ
KGRQDPARAR LKELLARNGT HPLVPEIQAA IRGTRPEGAE YAVNADRIGV LIPLNGSFAR
YGDMVLKGLT LANSDWSERY PGQQVSLVVK DAQADAAVTT RSFEEMVKKD GVLAVIGPLG
AQAAKEVAPL ADRYGVPVLT MTQKDDEGAA SSFVIHIFLD SREIVRSVVK HCRDKLGHTR
FAALYPDDRY GQKLAKIFSE VVPELGGQVM ASVSYKEKTT DFKESLQKLI TIAKKNQPPT
GVETTPFDAL FIPDQVQSVS LIAPQLPYNN VVGATLLGTN LWSEAPLVQA GGVYIEHALF
ATAYYPENPS SRAKDFRERF QEKFGAPPSY LEAQAYDALM LVLQARSALR STGIDRASLL
QTIMTAKGFE GIAGKYSFSP MGGLQRNYLL LQVQDGKLVQ IAP