Gene Shewana3_0708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_0708 
SymbolthiH 
ID4476918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp822821 
End bp824260 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content49% 
IMG OID639725243 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_868352 
Protein GI117919160 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.519768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000306942 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACAC ACGAGCATCA TTCCATTACC GTCTCTGACT ATAATCCCAA CGTCAGCTTT 
ATTGACGATC TGGCGATTTG GCAGGCCATT GAAGAGGCCA GCAATCCGAG TCGTGAACAA
ATCCAAGCCA TTCTCGAAAA GGCGCGCCAA TGCGAAGGCT TAAGCATTCG CGAAACCGCT
CTCCTGCTAC AAAATCAAGA TAAAGCGCTG GATGAAGCAC TCTTTGCCGT CGCCCGTGAG
ATTAAAAACA CCATCTACGG CAATCGTATA GTGATGTTTG CGCCACTCTA TGTCTCCAAC
CATTGTGCCA ACAGTTGTAG TTACTGCGGC TTTAATGCCG ATAACCATGA GCTGAAACGC
AAGACCTTAA AACAGGATGA GATCCGCCAA GAGGTCACCA TCCTCGAAGA AATGGGCCAC
AAACGGATCT TGGCCGTTTA TGGCGAGCAT CCACGCAACA ATGTGCAAGC CATTATTGAA
AGTATTCAAA CCATGTACAG CGTGAAGCAG GGCAAGGGCG GCGAAATTCG CCGTATCAAC
GTCAACTGTG CGCCAATGAG TGTGGAGGAC TTTAAACAGC TCAAAACGGC GGCGATAGGC
ACTTACCAAT GCTTCCAAGA AACCTATCAT CAAGACACTT ACAGTGAAGT GCACCTAAAA
GGTAAAAAAA CTGACTTTTT ATACCGCCTC TACGCCATGC ACAGAGCCAT GGAAGCGGGA
ATCGACGATG TCGGTATCGG CGCCCTCTTT GGCCTGTATG ACCATAGATT TGAGCTGCTC
GCTATGCTCA CTCATGTTCA ACAACTCGAA AAAGACTGTG GCGTTGGCCC GCATACTATC
TCCTTTCCGC GGATAGAACC CGCCCATGGC TCTGCCCTTA GTGAAAAGCC GCCCTATGAG
GTTGATGATG AGTGCTTTAA GCGTATCGTT GCCATCACTC GCCTCGCCGT ACCTTATACA
GGCTTGATTA TGAGCACGCG GGAGAGCGCC GCAATGCGCA AAGAATTGTT AGAGCTTGGC
GTTTCACAGA TCAGTGCAGG CTCACGCACT GCACCCGGTG GTTATCAAGA CAGCAAACAA
AATCAACACG ATGCCGAACA ATTTAGCCTT GGCGATCATC GCGCCATGGA TGAAATCATC
TATGAATTAG TCACAGATTC GGATGCCATC CCCTCCTTCT GTACGGGCTG TTACCGTAAA
GGGCGCACAG GCGACCACTT TATGGGATTA GCCAAGCAGC AGTTTATTGG CAAGTTCTGC
CAGCCCAATG CCTTAATCAC CTTTAGGGAA TATCTGAACG ACTACGCCAG CGATAAAACC
CGTGAAGCAG GTAACGCCCT GATAGAGCGA GAACTCGCCA AAATGAGTCC ATCACGGGAA
CGTAATGTGC GCGTTTGCCT GAAAAAAACC GATGCGGGTG AACGGGATAT CTATTTGTAA
 
Protein sequence
MSTHEHHSIT VSDYNPNVSF IDDLAIWQAI EEASNPSREQ IQAILEKARQ CEGLSIRETA 
LLLQNQDKAL DEALFAVARE IKNTIYGNRI VMFAPLYVSN HCANSCSYCG FNADNHELKR
KTLKQDEIRQ EVTILEEMGH KRILAVYGEH PRNNVQAIIE SIQTMYSVKQ GKGGEIRRIN
VNCAPMSVED FKQLKTAAIG TYQCFQETYH QDTYSEVHLK GKKTDFLYRL YAMHRAMEAG
IDDVGIGALF GLYDHRFELL AMLTHVQQLE KDCGVGPHTI SFPRIEPAHG SALSEKPPYE
VDDECFKRIV AITRLAVPYT GLIMSTRESA AMRKELLELG VSQISAGSRT APGGYQDSKQ
NQHDAEQFSL GDHRAMDEII YELVTDSDAI PSFCTGCYRK GRTGDHFMGL AKQQFIGKFC
QPNALITFRE YLNDYASDKT REAGNALIER ELAKMSPSRE RNVRVCLKKT DAGERDIYL