Gene Sfum_3967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3967 
Symbol 
ID4457716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4820102 
End bp4821562 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content63% 
IMG OID639704738 
Productfibronectin, type III domain-containing protein 
Protein accessionYP_848069 
Protein GI116751382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.476574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.587814 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAA GCAGAGCGTC GTTCGTCCGC ACAACGAAAT TCCGGCGCAT CCGCCGTTTG 
ACGACTTTCC TGGTTGCCCT CGCGGCCGTC CTCTCGATCG CCCCCGATGC CTTTCCCGCT
TCAGTCACCC TCTCCTGGAA TCCCGTGTCG TTCAAGAACC TTCGCGGCTA CAGAATCTAT
TGCGGCACGT CGCACCGCAA CTATTCCCGC CGCGTGACGC TGGGCAAGGC GACTTCCGGG
ACGGTAACGA ACCTTGTGAA GGGCAGAACC TATTATTTTG CCGTAACCTC CTATGCCGCC
GGGGTGGAGA GCGGATTCTC AAAAGAAGTG GTCTTCCCCC CGAATGAGGC ATCGATCCAC
GATACGCGCC TCCAGACGGG TGATTCCGGG GATGCGTCAG ACACCCCGGC ATCCGAGGGC
GCATTCGAGC TGCAAGTCTT GTCCAGAACC GATCCCCCGG CTCCCGGCGT CAACGATGCG
GGAGGCGATT CCGTTGCGGC GCTCGACTCC GGGGGCATGC AACCCCACGA ATCGGTTTAC
CTGGAGGCGG AAGCCGGCGC TCTGGAGGTC CCCCTGGAAA CGGGCGTGAC CACGTCCGCC
ACCGGTTTTA TCCGGGTGCC CGTCGGGCTT GAGCCCGTTT CGGACCCCCT GCGGGAAGGG
GGCACCGCCA CCTATGCATT CACGGTGACC GCCGCCGGAG ACTATACCTT CTGGGGACGC
GAATACTGTC CGTACAGCAC CCGCAATTCC TTTTTCGTCT CGGTCGATTC CGGCCCCTTC
CTCACCTGGA ACACCGCGAT CGTCAACGGG TGGCTGTGGG ACCAGGTGCG CGACGGGTCC
TCCGGTTCCC CGTTGAAACT GCGGCTGGGA GCCGGCGACC ATACGCTCAG GATCAAGCAA
AAGGAGGACG GCACGCGCTT GGACAAGATC CTGATCACCA GCGCGCCGCA GCCGTTCCCC
TCAACGCTCT ATTGCGCCGT ATCGCGCGGA GTGCCGGGCC GGTGGAGCAT CACCGACCCC
GACCCGCCCG GGGCGAAGGT CGTGAGCGTG TTTGACGACG AGCGGGGCGG TCCCGTCACG
GAATTGTCCG GCTCCGAAAC GGCCAACGCG TTTCACCTGG GCGGCCGGGA CCTCGACGGT
TGGCACAACA CCCGTCAGTT CGTCCTGGAG TGGAGCATGA AGTTCTCCGA GGACTACGTG
GTTTCGGTGC ACGTGCAGAC CACGGACGGC TACCGGGAAA TCCGGTACAA GCCTCTTATC
GGAAACGGTC CGGGAGACGA CCGGACAATC GATTGCGGGC TTGGGCCCGG CACCCTGGAC
GGCGACTGGC ACACCTTCGC CCGCCACCTG CAAAAGGACT TGTCCAGGGC ACAGCCGGGG
GTAAAGATCC TCGAAGTGAA TGGATTCTCG GTTCGCGGCA GCGGCAGAAT AGATAACGTG
AGGCTCGGGG CCGGCCGTTG A
 
Protein sequence
MTESRASFVR TTKFRRIRRL TTFLVALAAV LSIAPDAFPA SVTLSWNPVS FKNLRGYRIY 
CGTSHRNYSR RVTLGKATSG TVTNLVKGRT YYFAVTSYAA GVESGFSKEV VFPPNEASIH
DTRLQTGDSG DASDTPASEG AFELQVLSRT DPPAPGVNDA GGDSVAALDS GGMQPHESVY
LEAEAGALEV PLETGVTTSA TGFIRVPVGL EPVSDPLREG GTATYAFTVT AAGDYTFWGR
EYCPYSTRNS FFVSVDSGPF LTWNTAIVNG WLWDQVRDGS SGSPLKLRLG AGDHTLRIKQ
KEDGTRLDKI LITSAPQPFP STLYCAVSRG VPGRWSITDP DPPGAKVVSV FDDERGGPVT
ELSGSETANA FHLGGRDLDG WHNTRQFVLE WSMKFSEDYV VSVHVQTTDG YREIRYKPLI
GNGPGDDRTI DCGLGPGTLD GDWHTFARHL QKDLSRAQPG VKILEVNGFS VRGSGRIDNV
RLGAGR