Gene Ping_0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPing_0365 
Symbol 
ID4624149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePsychromonas ingrahamii 37 
KingdomBacteria 
Replicon accessionNC_008709 
Strand
Start bp467974 
End bp469092 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content40% 
IMG OID639795559 
Productthiazole biosynthesis protein ThiH 
Protein accessionYP_941826 
Protein GI119944146 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTA GTGAACAGAT AAAAAATTAT CAGTGGGATG ATATCCGCCT GTCTATTTAT 
GGTAAAAGTG AAAATGATGT AAAGCGCGCT TTGTCTAAAG AACGTTTAGA TTTAGAAGAC
TTTAAAGCGT TAATCTCTCC CGCAGCTGAG CCTTTCCTCG AGCAAATGGC ACAAAAATCA
CAACAACTGA CCCAGCAGCG TTTTGGTAAA ACACAGCAAT TTTTTATTCC CCTCTATTTG
TCCAATATGT GCAGCAATAT CTGCACCTAT TGTGGTTTTT CTATGCATAA TGCTATTCGT
CGCAAGACCT TGGATATGAA AGAGATTGAA GATGAGTGTT TAGCGATTAA AAAAATGGGT
TTTGCCCATA TTTTACTGGT GACCGGGGAG TCTGAACGCA AGGTTGGGGT TGAGTATTTT
AAACAGGCGC TGCCTATTAT AAAAAAACAT TTTTCGCATA TCTCCATTGA GGTGCAGCCT
TTAGATCAGC ACGAATATGA AGCGCTTATT GAGTATGGTG TTGATGCCGT ATTGGTGTAT
CAGGAAACCT ATAATCCGGT TACCTATGCA GAGCATCATT TAAAGGGTAA AAAATCTGAT
TTTAAGTACC GTTTAGATAC CCATGATAGA CTCGGCAAAG CGGGTATGCT TAAAATGGGC
TTAGGCTGTT TAATTGGTCT TGAGGAATGG CGCACCGATT GTTTTTATGT GGCCGCACAT
CTGAATTATT TAGAGAAAAT ATACTGGCAA AGCCGCTATG CAATCTCATT TCCACGTCTA
CGTCCCTGTG CAGGCGGCAT GGAAATAAAG TCGGTGATGG ATGATAAAGA ACTGGTACAA
TTGATTTGTG CCTATCGATT ATTTAGTCCC GAGGTTGAAT TATCACTTTC AACACGCGAA
TCTGAACATT TTAGAGATCA TGTTTTACCT TTAGGTATTA CATCATTAAG TGCGGGATCC
AGTACACAGC CCGGTGGTTA TGCAGCAACA TCGACAAAAG CCCTTGAGCA GTTTGAAATA
TCCGATGACA GATCCCCGGC TAAAATGGCG AAGTTGGTTA AATCAAAAGG TTTTGAGGTG
GTTTGGAAAG ATTGGGATCA TAGTCTGACG GGTAAATAA
 
Protein sequence
MSFSEQIKNY QWDDIRLSIY GKSENDVKRA LSKERLDLED FKALISPAAE PFLEQMAQKS 
QQLTQQRFGK TQQFFIPLYL SNMCSNICTY CGFSMHNAIR RKTLDMKEIE DECLAIKKMG
FAHILLVTGE SERKVGVEYF KQALPIIKKH FSHISIEVQP LDQHEYEALI EYGVDAVLVY
QETYNPVTYA EHHLKGKKSD FKYRLDTHDR LGKAGMLKMG LGCLIGLEEW RTDCFYVAAH
LNYLEKIYWQ SRYAISFPRL RPCAGGMEIK SVMDDKELVQ LICAYRLFSP EVELSLSTRE
SEHFRDHVLP LGITSLSAGS STQPGGYAAT STKALEQFEI SDDRSPAKMA KLVKSKGFEV
VWKDWDHSLT GK