Gene MADE_00141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMADE_00141 
Symbol 
ID6777139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlteromonas macleodii 'Deep ecotype' 
KingdomBacteria 
Replicon accessionNC_011138 
Strand
Start bp150821 
End bp151957 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content48% 
IMG OID642753569 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002124453 
Protein GI196154964 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTAG CACAAGCGCT AATGCAGCAA GACTTAGCTC ATCTAGCTAT GTTGATTAAC 
AGTAAAACCG CGAGTGATGT AGAGCGAGCT TTAAATGCCA GCACACTTTC AATAGATGAT
TTTATGGCGC TTATCTCCCC TGCTGGTGCA CCCTATTTGG AGGCGATGGC AACTCGTGCT
CAGGCACTCA CCCGCCATCG TTTCGGCAAT ACCATGCAGC TATTCGTGCC TTTGTATCTG
TCGAACCTGT GTGCCAACGA ATGTACCTAT TGCGGCTTTA CCATGAGCAA CAAAATAAAG
CGCAAAACCC TATCTATAAG TGAAGTGCTT ACCGAGATTG AAGCTATTAA AGATATGGGC
TTTTCTCAGG TGTTGCTGGT AACTGGCGAA CACGAAACCA AAGTAGGTAT GGACTATTTT
GAGCAAACCC TTAGCGCTAT ACGAGACAAA GTTAGCTACC TCATGATGGA AGTGCAGCCG
CTAAAGCGTG AGCAATATGA AACGCTAAAG CAGAAGGGGT TAGATGCCGT TTTAGTGTAC
CAAGAAACCT ATTCGCCCCA GCAGTACGCA ACCTATCATA CGCGAGGGAA AAAACAGGAT
TTTTTATGGC GTTTGGAAAC CAGCGACCGG TTAGGCCAAG CCGGTGTTGA TAAAATAGGC
ATAGGTGCGC TACTTGGTTT AGGTGACTGG CGAGTTGATA GTGCTATGAC CGCCCTTCAC
GGTAAGTTGA TTCAGCAGCA TTATTGGCAA AGCCGCGTAT CTATCGCCTT TCCTAGACTT
CGCAGCTGTG AAGGTAACAA CACTGGCGGG AATGCACTTA CCAACAGCAA GCTGCCAAAT
GAGCGGGACT TGCTACAACT AATTTGTGCC CACCGCCTGT TTAACCCACA AGCCGAACTA
TCAATGTCTA CCCGTGAATC GGCCGCGTTT AGAGATGGCG TAATGCCGCT GGGTATAACA
TCTATGAGTG CCGCGTCACA AACGCAGCCA GGCGGCTACA GCGAACCCTC ACAGGCATTA
AATCAGTTTG ATATTGATGA CAATCGTTCG GTGCCAGAAG TGGTAAACGC GATTTCGGCG
AGAGGATTAG AGCCCGTTTG GAAAGATTGG ATGCCCTTTA GGCTCTCGCC GACATAG
 
Protein sequence
MKVAQALMQQ DLAHLAMLIN SKTASDVERA LNASTLSIDD FMALISPAGA PYLEAMATRA 
QALTRHRFGN TMQLFVPLYL SNLCANECTY CGFTMSNKIK RKTLSISEVL TEIEAIKDMG
FSQVLLVTGE HETKVGMDYF EQTLSAIRDK VSYLMMEVQP LKREQYETLK QKGLDAVLVY
QETYSPQQYA TYHTRGKKQD FLWRLETSDR LGQAGVDKIG IGALLGLGDW RVDSAMTALH
GKLIQQHYWQ SRVSIAFPRL RSCEGNNTGG NALTNSKLPN ERDLLQLICA HRLFNPQAEL
SMSTRESAAF RDGVMPLGIT SMSAASQTQP GGYSEPSQAL NQFDIDDNRS VPEVVNAISA
RGLEPVWKDW MPFRLSPT