Gene Shewana3_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1941 
SymbolthiH 
ID4479738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2307669 
End bp2308784 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content50% 
IMG OID639726523 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_869578 
Protein GI117920386 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTG TCGACCAATT TGCCCGTATT GAACGGGATA AGTTATTGCT GGCGTTATAT 
TCCTGCACGG CAGTGGAGGT TGAGCGCGCC CTGATGCAAC CCGAGGGCAA TCTACAGAGT
TTACTTGCCT TGTTGTCTCC AGCGGCCGAG CCCTATATCG AAGAGATGGC GCAGCGCTCG
GCGGCGCTCA CTCGGCAACG CTTTGGGGCC AATATCGGAC TCTATCTGCC CTTATACCTG
TCGAACCTGT GTGCCAATGA GTGCGACTAT TGCGGCTTTA GCATGAGCAA TAAGCTAAAG
CGCAAAGTGC TCAATGAGCA GGAAATTGCG GCTGAAATGG CGATTATCAA ATTCCGTGGT
TTTGACTCCA TCTTGCTGGT GTCGGGAGAG CATGACACCA AAGTGGGGAT GGATTACTTT
AAGCGCGTGT TACCCATTGT AAAACAGCAG TTTAGTTATA TGGCCATGGA GGTTCAGCCG
CTTGAAGAGG TTGATTATCG TCAGCTTGTC GAGCTAGGGC TGGATGCTGT GATGGTGTAT
CAAGAAACCT ATCAAGCGGC GACCTATGCC AAGCATCATA CCCGAGGCAA TAAACAGGAC
TTTGCATATC GGCTCGCAAC GCCCGACCGC GTTGCTAGTG CGGGTGTCGA TAAGATAGGC
CTAGGTGTGC TACTAGGTTT AGATGACTGG CGGCTCGATG CTTTACTGAT GGGCCATCAT
CTGGACTATT TAGAACGGAA TTACTGGCGT ACTCGTTTTA GTATTTCGTT ACCACGTTTG
CGGCCTTGTA CCGGCGGTAT AACACCAAAA GTGCATTTAA CCGATCTTGG ACTGGTGCAG
ATGATCTGTG CCTTCAGGCT TTTTAATCAG CAACTTGATA TCAGTTTATC GACACGCGAG
GCGCCATCGC TTCGGGACAA TTTACTGCCG CTTGGGATAA CACAGATGAG TGCGGGGAGC
TCGACACAGC CAGGCGGTTA TCAGGCGCCA GAGAGCCAAT TAGATCAGTT TGAGATAAGC
GATGAACGTA CTGTTGAGCA GGTCATGACT CAGATGCGAC TCAGGGGATT TAATCCGGTT
TTCAAGGATT GGGAATCGGC TTGGATTGCG CGTTAA
 
Protein sequence
MSFVDQFARI ERDKLLLALY SCTAVEVERA LMQPEGNLQS LLALLSPAAE PYIEEMAQRS 
AALTRQRFGA NIGLYLPLYL SNLCANECDY CGFSMSNKLK RKVLNEQEIA AEMAIIKFRG
FDSILLVSGE HDTKVGMDYF KRVLPIVKQQ FSYMAMEVQP LEEVDYRQLV ELGLDAVMVY
QETYQAATYA KHHTRGNKQD FAYRLATPDR VASAGVDKIG LGVLLGLDDW RLDALLMGHH
LDYLERNYWR TRFSISLPRL RPCTGGITPK VHLTDLGLVQ MICAFRLFNQ QLDISLSTRE
APSLRDNLLP LGITQMSAGS STQPGGYQAP ESQLDQFEIS DERTVEQVMT QMRLRGFNPV
FKDWESAWIA R