Gene Sfum_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3801 
Symbol 
ID4457866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4646924 
End bp4648447 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content63% 
IMG OID639704574 
Producthypothetical protein 
Protein accessionYP_847905 
Protein GI116751218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTAA CCGACAAGGA GCGCAAACTC GCGGCGACCC TGAGCGATCC CGTGTTGTGG 
GGGCAAGCCT ACCTCTACAA CCGGGATGGC TCAGGCCGCG ACTACTGGCC GCACCAGGTG
GAGGACCTGC GCTGCCCGGC CAAGAACATC ATCCACCTCG ACGGCCGGGA CGTGGGAAAG
TCCATCGTGC TCTCGACCGA CGCGCTCCAT TACGCCTTCA CCACCCGGGG CGGCCAGGGC
CTCATCGCGG CTCCGCACCA GGGGCACCTC GACACCATCA TCGAGGAGAT CGAGTTCCAG
CTCGACAGCA ACCCGGATCT GATGAACAGC ATCGCCCTGA CCAAGTACGG CAAGCCCAAG
ATCCACCGCA AACCCTACTT CCGGCTGGAG TTCACCAACG GTTCGGTGCT CTATTTCCGC
CCGGCCGGGG CTTATGGCGA CGCCTTCCGG TCCCTGCACG TGGGCCGCGT CTGGGTCGAT
GAAGGAGCCT GGCTGACCGA ACGGGCCTGG AAGGCGCTGC GCCAGTGCCT CAAGGCCGGG
GGGACGCTAC GCATCTACTC CACGCCCAAC GGCCTGCGCG ACACCACCTA TTACCGGCTC
ACCTCGTCGG ACCAGTTCCA TGTGTTCCGC TGGCCGTCCT GGCTCAACCC CCTGTGGACC
GAGGATCGCG AGGCCGAACT GCTGGAGTTC TACGGCGGCC GCGACAGCTC CGGCTGGCAG
CACGAGGTGG CCGGTGAACA CGGCAAGCCC TCCTATGGGG CCTTCAATGT CGAACAGTTC
AACCTCTGTC GGCAGGATCT GCTGGAGTAC CAGAAGATCG TCATCACCGA TTCCGAGCTG
CGCGATTGCG ACACCGAGGA AGCGGCCCAC GACCGGCTGG AGATGCTGCT CAACCTCACT
CCCCGCAGCG GGCAGTTCTG GGTCGGCGGC GACTTGGGCT ACACCAACGA CCCCACCGAG
ATCGTCGTAT TCCAGGAGAC GGAAATCGGC GAGCGGACGC TGCTGAAGAT GATCCTGCGC
GTCCATCTCG AACACGTATC CTATCCGCAC ATCGCCCAGA TCATCGCGCT GCTGGAGCGT
TACTACACCC CGGCGGGCAT CGGCGTGGAT AATGGCGGCA ACGGTCTGGC CGTGGTTCAG
GAGCTGCTCA CCCTGGACAA GTACAAAGGG CTGGAGCTGG AAGGCAGGCT CAAGGGATAC
GACTTCGGCG GCATGACCCG GCTGGCGGTG CGGGACGGCA AGGAAATCAA GAAACGGACC
AAGGAGCTGA TGACCAGCCT CATCAACGGG GCGCTGCAAC GCAAGCAGCT CATTTTCCCC
TCGGACGACC TAGAGGTGGA AGACCAGTTC ACCACCCACA CCTACACCCT GCGGGACGGC
AAGATCATCT ATTCCAAGGG CAACGACCAC ATCATCGACG CGGTGCGCTG CGCCATGCTG
ATCCGGGAGG AAGGCAACCT CGACCCGGTC GGTGAAGAGG TGGTCTCCCT CAAGCCGGTG
CTCACCAATC CGGTCTTCAT CTGA
 
Protein sequence
MAVTDKERKL AATLSDPVLW GQAYLYNRDG SGRDYWPHQV EDLRCPAKNI IHLDGRDVGK 
SIVLSTDALH YAFTTRGGQG LIAAPHQGHL DTIIEEIEFQ LDSNPDLMNS IALTKYGKPK
IHRKPYFRLE FTNGSVLYFR PAGAYGDAFR SLHVGRVWVD EGAWLTERAW KALRQCLKAG
GTLRIYSTPN GLRDTTYYRL TSSDQFHVFR WPSWLNPLWT EDREAELLEF YGGRDSSGWQ
HEVAGEHGKP SYGAFNVEQF NLCRQDLLEY QKIVITDSEL RDCDTEEAAH DRLEMLLNLT
PRSGQFWVGG DLGYTNDPTE IVVFQETEIG ERTLLKMILR VHLEHVSYPH IAQIIALLER
YYTPAGIGVD NGGNGLAVVQ ELLTLDKYKG LELEGRLKGY DFGGMTRLAV RDGKEIKKRT
KELMTSLING ALQRKQLIFP SDDLEVEDQF TTHTYTLRDG KIIYSKGNDH IIDAVRCAML
IREEGNLDPV GEEVVSLKPV LTNPVFI