Gene Sfum_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1889 
Symbol 
ID4459795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2303861 
End bp2306881 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content61% 
IMG OID639702656 
Producthypothetical protein 
Protein accessionYP_846009 
Protein GI116749322 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.405118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.121372 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAA ACAGGATCCA GATTGTGATC GTTGCCGATT CCAGTAAGGC CGTGAAGAAC 
ATACGCATGG TCTCCGACGA ATTCGATCGG TTCAAATCCT CCGCCTGCAA CTCCATTGCG
GGCACAAATA GCGCGATTTC GTCGATGGGC GCCGGCCTCG CCTCGGCCAT CAGCCTTTTG
CAGGGCTTCG CTGCCGCTAT GGCGGCGGCG AAAGCCGCTG AGATCGGTCT TGGCTTCAAC
AAAGAGGTCG AGGATGCGCG CCTCGGCATA GCGACCCTCC TGGCGGCACA AGGCCAATTT
GTCGATCAAC AGGGCCAACA ATTGCAGGGG CAGCAAAGGA TAAACGCGGC CATGGCCTAT
TCGGCCGATC TCACCAGGCA GCTCCAGGTC GACAACCTCA AGACGACGGC CACGTTTCAG
CAATTGCTCA AGGCGTTTCA GCAGACGCTC TCGCCGGGTT TGTCCGAGGG ACTCTCCGTC
GACCAGGTGC GACAATACAC CCTGGCGATG GTGCAGGCCG CCTCCGCCAT GCAAATCCCT
CTCGACATGA TGGCGGAAGA GACCCGGTCC CTGCTCAAGG GGACGATCAC TCCGCGCAAT
ACTCTTATCG CGACCGCCCT CGGCATCACA CCGGAGGATA TTCGAAAACT CCAGGGGGAC
GCGGAAGGCC TGTTTACCTT CATCATGGGC AAGCTGCAGG CGTTCCAGGA TTTCGGCCAG
GTCACCCAGA CGACTTATTC GGGGCTCCTC AGCAATGCCC AGGATGCCCT GGCAAACGTT
CTGGGCAAAG CCACCGAACC ATTTTTCGAG AGCCTCAAAG CGAGCCTCAA GCGATTCACC
GACTATGCGC TCCAGGTCGA TGCGGTCACC GGGAATTTGA AACTCAACCC GGAACTGGTG
AACGCCTTCG AGCTCCTGAA CGACACTCTG CGCATGGCCA TTGCCCTTGC GGAAAGCCTT
GCAAAAACGC TCGGGGCCGT GGGCATCGCA TACAAGGCGA TGAAGGAAGC TCAAAAGGCG
GCCGCCCTTC AGAATACTCC AGCCGGATTT GCCGGTCCGG AGCTGGCTGA TTTTACCGAC
GTGCAGCGCT TGGAGCGCGT GAAGCAGCTC ATGGCGAAAA TCGTCGAAGA CCAGCAAACC
ATAGCCAACT GGAAAGCCGG AGGCATTGGC GGGTCGATAT TGGGCGCCCC TCAAATCGGC
CTGGCGAATC TTGATATCTC GAATTCGGTC ACCTCCCTTG AGAACTTCCG CGCTCAAATG
GTTGCGGCCG CCCAGAACAC CGAGGAGCTC GACGCCTTCC TCAAAAGCCT CAAGCAGTCT
TCCGACGCGG TTGCCGCGGC ACAGGAAACG GCAGCCGGGA ACATCGGCAG CGTGAGATAT
GAGGCGGCCA GAGCCGCAAT GGACCTGGAC TCGTTCAAGA AGAAGCTCGA AGAAGTCGCC
CGCATAAATC TCACCGGCCT GAGCGACGAT CTCAGCAAGT CCATTGAGGG ATTGAAGGCT
AAGATCGCGG TCGCTCTTCA AGGCGGGGAC CCCAGGCAGC AGGCGATCGC CGCCGCGTAT
ACGGAGAGAA TTCGAAAACA GGCGACAGCC TGGGAACTGG CTCTCAAGGA AAATGCCGCA
CCGGAGGTTT TTTCCGCCAT ATCGGCACAA AAACAACAGG CCGAAGAAGA GGCGTATTGG
GGCAGATATC TGGCCGGCGC CGAAGCCGTC CGCAAAACCA CGGCCCGTGG CGCCGCCTCC
GGCGCATCCC CCATCGACAT CGACAAGATC GACAAGGACG TGCTTCAGTT CAGGGAGCGC
ATGCAAAAGC TCTACTCCGA GCTCGAAGAC CTCGACTCCG AGTACCGCAT CGCACAGCTC
GAACAATCCG GCCGCAACTA CGACGCCGAA GCCGAGCGGA TCGACCGCCA GGCCGCAAAG
CGCAAGGAAG CCTTCGCCAA GGAGGTGGCC GATGCAGAGC AGGCCTATCT CGAAATGGAG
CAAAAACTCT CCGGCAGCCG GGGCGGAACT GCCGAGGCGT GGGCGCAGCT TGCGGATCTC
AAGACGAAAT ACGACGCCTT GAAGAAAGCC GCCCAGGAAT ACGGCGACAA GGTCGACCGC
AATCTGCAAC TCACCAAGGA CATGAAGAAG GCCGAAGACG ACGCGAAACG CGCGGAGGAT
CTCGCCCAAC TGAACCTCGA ATACGGCAAG CTGACCGGAA CCCTCCAGGA GCAGCTCGCC
CTCCAGATTC TCCTGCTCCG GGCCGAAAAG GACCGGAAGA TCCTCGCGGC CGACCCGGAG
CTCCGGGCGG CCTACGAGCG CCTCTATGCC GAGCAGGAAC GGCTGCTCCG CCTCCAGCGC
GACGGCTCCT TTATGGACGG CCTGTCCGAA GGACTCAAGA AATGGCAGCG CGAAATGCCG
ACGGCGTTCG GCCAGGGCCT TGAGGCAATC GAAATGCTGA AGCGCGGCAT CGATTCCGCG
GCCGACGCCC TGGCCGAATT CACCATGACC GGCAAGATGG ACTTTTCGAG CTTTGCCGAC
TCGATCATCA GGGACATCCT CCGCATGCAG TACAAGGCAC TGCTCACGCA AATGTTCGGC
GGTGAAGGCG GTCTCTTCGA TTGGCTAAAG GGACTCTTTG GCGGCGGCGG TCCGACCACC
GGCGCCTACG GGGTCGAGAC CTACGCCGGG ATCGGGCACC GGGGAGGACC GGCCGAAAGC
CTGCCGGACC ACCGATACGT GGCCTCGTAT CTGTTCGCCC GCGCCCCGCG CCTCCACGAC
GGGCTCGCCC CGGATGAGTT CCCGGCCGTC CTGCAGCGGG GCGAGCGGGT GCTCTCGCGC
CGGGAAACCC GCGAATACGG CGCCGCAAAC CGCGCCCCGG AGGTCGTCGT CAACGTCCAA
AACAAGACGA ACACGCCGGT CACGGCCGAT AAGACCCGCG CGGCCTTCGA CGGCAAACGC
TACGTGGTCG ACGTGATCCT CGACGACTAC AGCCGCGGGG GAGACATCTG GAAGATGATA
AGGGGGAACC GCAATGGCTG A
 
Protein sequence
MAENRIQIVI VADSSKAVKN IRMVSDEFDR FKSSACNSIA GTNSAISSMG AGLASAISLL 
QGFAAAMAAA KAAEIGLGFN KEVEDARLGI ATLLAAQGQF VDQQGQQLQG QQRINAAMAY
SADLTRQLQV DNLKTTATFQ QLLKAFQQTL SPGLSEGLSV DQVRQYTLAM VQAASAMQIP
LDMMAEETRS LLKGTITPRN TLIATALGIT PEDIRKLQGD AEGLFTFIMG KLQAFQDFGQ
VTQTTYSGLL SNAQDALANV LGKATEPFFE SLKASLKRFT DYALQVDAVT GNLKLNPELV
NAFELLNDTL RMAIALAESL AKTLGAVGIA YKAMKEAQKA AALQNTPAGF AGPELADFTD
VQRLERVKQL MAKIVEDQQT IANWKAGGIG GSILGAPQIG LANLDISNSV TSLENFRAQM
VAAAQNTEEL DAFLKSLKQS SDAVAAAQET AAGNIGSVRY EAARAAMDLD SFKKKLEEVA
RINLTGLSDD LSKSIEGLKA KIAVALQGGD PRQQAIAAAY TERIRKQATA WELALKENAA
PEVFSAISAQ KQQAEEEAYW GRYLAGAEAV RKTTARGAAS GASPIDIDKI DKDVLQFRER
MQKLYSELED LDSEYRIAQL EQSGRNYDAE AERIDRQAAK RKEAFAKEVA DAEQAYLEME
QKLSGSRGGT AEAWAQLADL KTKYDALKKA AQEYGDKVDR NLQLTKDMKK AEDDAKRAED
LAQLNLEYGK LTGTLQEQLA LQILLLRAEK DRKILAADPE LRAAYERLYA EQERLLRLQR
DGSFMDGLSE GLKKWQREMP TAFGQGLEAI EMLKRGIDSA ADALAEFTMT GKMDFSSFAD
SIIRDILRMQ YKALLTQMFG GEGGLFDWLK GLFGGGGPTT GAYGVETYAG IGHRGGPAES
LPDHRYVASY LFARAPRLHD GLAPDEFPAV LQRGERVLSR RETREYGAAN RAPEVVVNVQ
NKTNTPVTAD KTRAAFDGKR YVVDVILDDY SRGGDIWKMI RGNRNG