Gene GSU0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0441 
Symbol 
ID2686372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp472705 
End bp473790 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content65% 
IMG OID637125107 
Productradical SAM domain-containing protein 
Protein accessionNP_951500 
Protein GI39995549 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTT CATTGGCGAC AATTAGCGGC CGGGTCCATG CCGGCGAACG GATCAGCGAC 
GAGGAGGCCC TGTTCCTCTT CGAGAGCCGC GACCCCCTGG CCGTGGGAGA ACTGGCCGCA
GCCGTCAACC GCCGCCGCAA CGGCGACCGG GTCTTCTTCA ACGTGAACCG GCACATCAAC
CACACGAACA TCTGCGTCAA CCGCTGCTCC TTCTGTGCCT TCTACCGCGC CGCCGACGAG
CCGGGGGCAT ACCTCTACGA CCTGGAGGAG ATCCGCAACC GCGCGGCCGA GGCCCACGCC
CAGGGTGCCA CCGAGATTCA CATCGTGGGC GGCCTCCACC CCGATCTTCC CTTCGACTTC
TATCTCGCCA TGCTCCGGAC CGTGAAGGAG GTCTCCCCGG ACCTCCACGT GAAGGCCTTT
ACCGCGGTTG AGATCGAGTA CCTGTCGCGG CTCGCCGGCC TTTCGACGGC CGAAACCCTG
ACAGTGCTGA AGGAGGCGGG ACTCGGCTCG CTCCCCGGCG GCGGGGCGGA AATCTTTGCT
CCGGCCGTGC GCAACCGGCT CTGTCCCGAG AAGATCTCCG GCGACAAGTG GCTGGCCATC
ATGGAGGAGG TCCACCGGGC CGGGCTCAAA TCCAATGCCA CCATGCTCTA CGGCCACATC
GAGAGCTACG CGGACCGCGT GGACCACATG CGCCGCCTGC GTGAGCTTCA GGACCGCACC
GGCGGCTTCC AGGTCTTCAT CCCCCTGGCC TTCCAGAAGG ATAACAACCC CCTGGGGCAC
CTGAAACGCC CCGGGCCCGG TGGGGTCGAC GCCCTGCTCA CCCTGGCCGT GGCCCGCATC
TACCTGGACA ATTTCGCCAA TATCAAAGCC TACTGGGTCA TGCTCGGGGT AAAGATCGCC
CAGACTTCCC TGGCCTTCGG GGTAAACGAT CTGGACGGCA CGGTGGTGGA GGAGAAGATC
GGCCATGATG CCGGCGCCGC TTCCCCCCAG ACCATGGGGC GCGACGAAAT CGTCTCCCTG
ATCCGCACGG CCGGCCGGGT GCCGGTAGAG CGGGATACGC TGTACAACGA ACTGCGGGTG
TACTGA
 
Protein sequence
MSISLATISG RVHAGERISD EEALFLFESR DPLAVGELAA AVNRRRNGDR VFFNVNRHIN 
HTNICVNRCS FCAFYRAADE PGAYLYDLEE IRNRAAEAHA QGATEIHIVG GLHPDLPFDF
YLAMLRTVKE VSPDLHVKAF TAVEIEYLSR LAGLSTAETL TVLKEAGLGS LPGGGAEIFA
PAVRNRLCPE KISGDKWLAI MEEVHRAGLK SNATMLYGHI ESYADRVDHM RRLRELQDRT
GGFQVFIPLA FQKDNNPLGH LKRPGPGGVD ALLTLAVARI YLDNFANIKA YWVMLGVKIA
QTSLAFGVND LDGTVVEEKI GHDAGAASPQ TMGRDEIVSL IRTAGRVPVE RDTLYNELRV
Y