Gene Sfum_2693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_2693 
Symbol 
ID4458992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp3333410 
End bp3334468 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content59% 
IMG OID639703464 
ProductApbE family lipoprotein 
Protein accessionYP_846806 
Protein GI116750119 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00390804 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGCGG AAGATCGTTT CCCTTTCAAA CAACGGTTCT TGAACAGGCG CTCATTTCTG 
AAGATGTCCG GTCTGCTCGG CCTCGGAGTC GCCTCGGCCG CCATCATATC CCCCTGGGCC
GAAACCGTCC GGTTCAACGG CAAGATGCAC AAGGTTTCCA GGACAAGGCT CGGGATCGGG
ACTTTTGTCT CGATGACCTT GATCCACGAA TCGAAGGATC GGGCCGAGGA AGCCATCGCG
GCCGCAAACC GGGAAATCGA CCGGCTGGTC GCCCTGATGA ACCGGTTCGA CCCCGCCACC
CCGCTGGCCG GGCTGAACCG GGAGGGGTTT CTCAAAGACG CGCCCGAAGA GCTGATCGAG
GTGGTCCAAA GCGCTCTTCA TTATCACGCG CTCTGCAACG GCTGTTTCGA CTGCACGGTC
GCACCCGTGA TCGATCTATT CCAAAAGAAG ATGGGAGGGG AGAATCCCGT CTTCCCCGAA
GAGAGTGAGA TACGGGCTCT GCTGACGCTC GTGGGTTCCG ACAAGATCGA CTTGAAAGGC
CGCTCGATCT CTTTCCGCGA GAGTGGAATG GCCGTCACGC TGGATGGAAT CGCCAAAGGC
TATATCGTGG ACAAAGCCGC TGAAGCGATT GAGCGGCACG GGATTTCCAA CTTTCTCATC
AATGCCGGCG GCGAAATCAG AACCCGGGGG GAAGCCGGCA GGAAGCCGTG GACCGTCGCG
GTGCAAGACC CGGGAAAGAG AAGTCAGTAC CCCCAGGTCA TCAAGGTGCG GGATGCAACG
ATTGCCACTT CGGGCAATTA CGAGGTCTTT TTCGACCGGG AAAAGATGTT CCATCACATC
GTCGACCCCA AAAACGGGCA TTCACCGGCC TTTGCGACGA GCGTATCCGT CATGGCCAGG
ACCACCATGG AATCCGACGC TCTGGCAACG GCGGTTTTCG TGATGCCCCC CGCCGACGGC
GTCGGTCTCG TCAATAGCCT TCCCTGGTGC GAATCCCTCG TGATCTCAAA CAACGGTTCC
ATGCTCAAAT CGAGGGGTTG GCCCGGGACG GCGGCCTGA
 
Protein sequence
MQAEDRFPFK QRFLNRRSFL KMSGLLGLGV ASAAIISPWA ETVRFNGKMH KVSRTRLGIG 
TFVSMTLIHE SKDRAEEAIA AANREIDRLV ALMNRFDPAT PLAGLNREGF LKDAPEELIE
VVQSALHYHA LCNGCFDCTV APVIDLFQKK MGGENPVFPE ESEIRALLTL VGSDKIDLKG
RSISFRESGM AVTLDGIAKG YIVDKAAEAI ERHGISNFLI NAGGEIRTRG EAGRKPWTVA
VQDPGKRSQY PQVIKVRDAT IATSGNYEVF FDREKMFHHI VDPKNGHSPA FATSVSVMAR
TTMESDALAT AVFVMPPADG VGLVNSLPWC ESLVISNNGS MLKSRGWPGT AA