Gene Sfum_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0041 
Symbol 
ID4460981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp55576 
End bp56631 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content65% 
IMG OID639700793 
Productchorismate synthase 
Protein accessionYP_844179 
Protein GI116747492 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0022126 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.467855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGGGA GCAGCTTCGG CAGGCTTTTT CGTATCACCA CCTGGGGAGA ATCGCACGGC 
CCCGCACTGG GGGTCGTAAT AGACGGCTGT CCGCCGGGAA TCCCGTTGGC TCCCGAAGAC
ATCCAGCGCG ATCTCGAACG GCGCCGTCCC GGCAAGCGCC TCACCTCACC CCGCGGCGAA
CCGGACCGGG TGGAGATTCT CTCGGGCGTC TTTCAGGGGG TCACCACAGG CACGCCCATC
AGCCTGGTGA TTTTCAACCG GGATGTCCGC AGCGGCGATT ACACGGAATT GGCCGAAGTT
TACCGACCCG GGCACGGCGA CCGCACCTAC GAACAAAAAT ACGGCGTCAG GGACTGGCGC
GGAGGAGGCC GGAGCTCGGG GCGCGAGACC GCCGCCCGTG TGGCCGCCGG CGCCGTCGCC
CGCAAGTTCC TGGCCGGCCG TGGCGTTGAA GTGAAAGCCT ACACGGTTGC CTTCGCCGGC
TTGCATGTGG ACTCCTTCAA CCGGGACGAA ATCGATCGCA ATCCCTTTTT CTGCCCGGAT
GCGACAGCCG CAGCCGCCAT GGAGCGTCGC GTCGAGGAAC TGCGGGATGC GGGGGACTCC
TGCGGAGGCG TCGTCGAAGT GTCGGCAAGA GGCTGTCCGG CGGGCCTCGG AGAGCCTGTC
TTCGACAAAT TGGACGCGCG CCTGGCCGGG GCGCTCATGT CCGTGGGAGC AGTGAAAGGA
GTGGAGATCG GCGCCGGTTT TGCCGCCGCC GCCATGCTCG GCAGCGAGAA CAACGACCCC
CTTACCCCCG ATGGCTATGC AAGCAACAAT GCCGGCGGCG TTCTGGCGGG AATTTCCACC
GGGATGGACA TCGTCGCGAG GGCGGCCGTC AAACCCATAC CCTCCATCTC AAAACCGCAA
CAGACCGTCA ACACCAGGGG TGAACCCGTC ACCCTCTCCA TCAAAGGACG ACACGACGTA
TCGGCCATCC CGCGCATCGT CCCGGTGTGC GAAGCCATGG TTCTCCTGGT GCTGGCCGAC
TTCATGCTTC ACCCGGCGCC CGTGGAAAAG CGGTGA
 
Protein sequence
MAGSSFGRLF RITTWGESHG PALGVVIDGC PPGIPLAPED IQRDLERRRP GKRLTSPRGE 
PDRVEILSGV FQGVTTGTPI SLVIFNRDVR SGDYTELAEV YRPGHGDRTY EQKYGVRDWR
GGGRSSGRET AARVAAGAVA RKFLAGRGVE VKAYTVAFAG LHVDSFNRDE IDRNPFFCPD
ATAAAAMERR VEELRDAGDS CGGVVEVSAR GCPAGLGEPV FDKLDARLAG ALMSVGAVKG
VEIGAGFAAA AMLGSENNDP LTPDGYASNN AGGVLAGIST GMDIVARAAV KPIPSISKPQ
QTVNTRGEPV TLSIKGRHDV SAIPRIVPVC EAMVLLVLAD FMLHPAPVEK R