Gene Sfum_0400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0400 
Symbol 
ID4461423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp482019 
End bp483035 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content61% 
IMG OID639701155 
Productthymidylate synthase complementing protein ThyX 
Protein accessionYP_844535 
Protein GI116747848 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1351] Predicted alternative thymidylate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTC AATTCGTCAA AACCCGGGTG CAGCCCCAGG GAATCGCTCC GGCCGAAGAA 
GGCCGCGCCC TGCAGTTGGT CGAGCTCTGC GGCAGGACCG CGTACAAATC GGAAGACAAA
ATCACTCCCG ATTCTGCGCG CAATTTCGTC CTGATGTTGA AGAGTCACGG TCACCTGTCC
GTCCTGGAGC ACAGCAACAT CGTGCTGGAG ATCGAGGCGA CGCCGTCAAG CGGCGCTACA
CAGGCCCTTT CTTCGATTTC CGAGCTCTAC GGGGCGCTGC TCGATGGCCT TGGCTTCCGA
ACCGCCTACC ACCGCATCCA CCTCTCGGCC CGGACTTCCC CCGGCGCTCT TTTCATCGCC
GGCAACCTGC GTTCCTGGAT CGAGACGCTC ACCCATCTGG GCAACGGCGG AAGCCCCCTC
CACGCCCTCC TGTCCGCGGC TTTGCGCGAT TTCTTTCCCA TCATCTTCGG GGGCGAGGAA
ACGGTTTCGA GCAACCTTTC CTTCAAGGTC ACCCTGGTCC GCGAAGACGA ACAACTGGCG
CTGCTCCGGC GCGATGCCGC ATGCGATCTG CCCGTCTTCG TGTTCAAGTT CGTCTGCGAC
CGCGGGATCA CTCACGAGGT GGTACGGCAC CGGGTGCTCT CGTTCACGCA GGAGAGCACC
CGCTACGTGA ATTACAAGAA CAAGGGAATG GTGCTGATCC TTCCCGAAGA GCTCTATCCC
TTTTACGACG ATGCGACGCA GCAGCTCACG GGCCGGTCGC CCCTCGTGGA CATGTGGATC
GACAGGGCCG AGAAGCTCTT CGCCTGGTAC CGGGAAGACC TCGACCGGGA AAAACCGGAA
ATTGCCCGGG ACATCCTCCC CAATCTGCTC AAGAGCGAAA TATTCGTGAG CGGCAGATGG
AGCGGATGGA AGCACTTCGT TCAGCTGCGC GATTCCAAGC ACGCGCACCC GCGCATCCGG
GCCATCGCCA AGGAAGTGAG GAACCACTTC GATTCCCTGG GAATGACCGT CGAGTAA
 
Protein sequence
MSVQFVKTRV QPQGIAPAEE GRALQLVELC GRTAYKSEDK ITPDSARNFV LMLKSHGHLS 
VLEHSNIVLE IEATPSSGAT QALSSISELY GALLDGLGFR TAYHRIHLSA RTSPGALFIA
GNLRSWIETL THLGNGGSPL HALLSAALRD FFPIIFGGEE TVSSNLSFKV TLVREDEQLA
LLRRDAACDL PVFVFKFVCD RGITHEVVRH RVLSFTQEST RYVNYKNKGM VLILPEELYP
FYDDATQQLT GRSPLVDMWI DRAEKLFAWY REDLDREKPE IARDILPNLL KSEIFVSGRW
SGWKHFVQLR DSKHAHPRIR AIAKEVRNHF DSLGMTVE