Gene Sfum_3723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3723 
Symbol 
ID4457975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4538749 
End bp4539699 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content59% 
IMG OID639704496 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_847828 
Protein GI116751141 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA GACTACTCGT CGTATCCGTT CTGCTCCTGG TCTTTGTGGG CACCCTCTGC 
TCGGCGCAAG CCGAAGAAGT CAGGCTGATT CTCGGCACGG GAGGCACCGC CGGGACCTAC
TACCCGCTCG GGGGCTCCAT GGCCAAGATC TGGAATTCGA AGATTCCCGG CATGAACGTC
ACGGCACAGA CCACGGGGGC TTCCGCCGAA AACGTTCGCC TGGTGAACAA GAAGGAAGCG
GAGCTCGCGC TGGTCCAGAG CGACACGCTG GATTTCGCGT TCAAGGCCGA GCCTCCGTTC
AAGGAGAAAC TCACGGCCAT GGCGGCCATT GCCGTGCTCT ATCCGGAAGT CATCCAGGTC
GTGGTGCGCG CCGATAAGCC GGTCAAGACC TTCGCCGATC TCAAGGGACT GAAGATGGGC
GTGGGAGCCC CGGGCAGCGG AACGGAGGCG AATTTCCGGC AGCTTTGCGA CGTGCACGGA
CTGGTAAAAG GCGACATCAA CGCCCAGTAT CTTTCCTTTT CCGAGAGCGC CGAACAGTTC
AAGGACAAGC ACATCGACGC CTTCCTCGTG ACGGCCGGTC TCCCCAACCC GGGCATCATG
GACGTCAGCA CCCAGAACGA CATTCGGATC CTCAGCATTT CCGACGATAT GCTGAAGAAG
ATCACGACCA AGTATCCTTT CCTCTCTCCC GTGAAGGTCC CTGCCAATAC CTACAAGAAC
GTCCCCGAAG CGAGCACCGT GGCGGTGAAC GCCGTGCTCA TCGTGAATTC GGGGATCAAG
GAGGACGTCG TCTACAACCT GACCAAGGCT CTGTTCGACA ACCAGCCGGA ACTGGCCGCG
GCCCACGCCA AGGGCAAGGA AGTGAACCTG CAGACGGCGG TCAAGGGTGT GTCCATCCCG
TTCCACCCGG GAGCGGTGAA GTACTACAAA GAAAAAGGCG TCATGAAATA G
 
Protein sequence
MMKRLLVVSV LLLVFVGTLC SAQAEEVRLI LGTGGTAGTY YPLGGSMAKI WNSKIPGMNV 
TAQTTGASAE NVRLVNKKEA ELALVQSDTL DFAFKAEPPF KEKLTAMAAI AVLYPEVIQV
VVRADKPVKT FADLKGLKMG VGAPGSGTEA NFRQLCDVHG LVKGDINAQY LSFSESAEQF
KDKHIDAFLV TAGLPNPGIM DVSTQNDIRI LSISDDMLKK ITTKYPFLSP VKVPANTYKN
VPEASTVAVN AVLIVNSGIK EDVVYNLTKA LFDNQPELAA AHAKGKEVNL QTAVKGVSIP
FHPGAVKYYK EKGVMK