Gene MARTH_orf098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMARTH_orf098 
SymbolthiI 
ID6418256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycoplasma arthritidis 158L3-1 
KingdomBacteria 
Replicon accessionNC_011025 
Strand
Start bp87313 
End bp88458 
Gene Length1146 bp 
Protein Length381 aa 
Translation table
GC content31% 
IMG OID642715289 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_001999748 
Protein GI193216506 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00000238711 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCAAA AAATTCTAAT AAGATATGGC GAACTTACGC TCAAAGGCCA AAATAAACGG 
GATTTTATTA ATGATTTAAA ACGTAATTTA ATGTTCCATA TACCAAAAGA GCAAATTAAA
ATGGAATACG ATCGTGCTTT TTTGGATTTT AGTTTGACTA ATTTAGATGC TCTTAAGTAT
GTTTTTGGTA TTTCTTCTTA TTCATGCGTT TATGAAGTAG AAAGCTCTTT GGCAGCAATT
ACTTCAAAAG TACTAGATAT TGCTAAACAA AAATATCCTT TTAAAACTTT TGCTATTGCA
GCAAGAAGAC ATAATAAAAA TTTTGAAATG AATTCCAATG ATTTAAATAG ACATTTAGGC
TGTGCTATTC TAAGTAATTT CGAAGTAAAA GTAAATTTAG AAGAGCCTGA TTTAAAAATA
TATGTTGAGG TTAGGGATGC TTCAACTTAT ATTTTTATTG ATTATATTGC CGGCCTTGGT
GGCATGCCTT TAAATTCTGC TGGTCAAGTT TTGCATCTAA TGAGTGGTGG CATTGATTCA
CCAGTAGCGG CTTATTTACT ACAAAAACGA GGTCTAAGAA TTAATTTTTT AAATTTCATC
ACGCCACCTC ATACTGATGA AAAAACCACA CAAAAAGTTG ATGAATTAAT TAAAGTTATC
GCTAAATACC AAGGGAGCGC CAAACTATAT CAAGTTAATT TTACAGATAT CATGAATTAT
ATTGGCCTCG TAAGTAATCA AAAATATAAA ATTATCTTAA TGCGACGTTC TTTTTATCGG
ATCGCTCAAA TGCTTGCAAA AAAATTGCAC ATTAAAGCTT TATCTAACGG TGAAAATTTG
GCACAAGTGG CATCACAAAC ATTAGAAGCA ATTCACACAG TTAGTGCTCC GATTACACTT
CCGATTTTTA GACCACTTCT TAGTTTTGAT AAAAACGAAA CGATTAAGAT TGCCGAAAAA
ATAGGAACTA TGCCAATTTC AATTTTAAAA GCTTGTGAAA CTTGCGAACT TTTTGCTCCT
AAAAATCCAA TTATTAAACC AACGCCCGAA GAAGCAAGCG AGCTAGAAAA AGAATTAGAT
AAACTACCAG AGCTAGAAAA ATTAGCTGTT GAAAATGTAA CTATTAAAAC AATTAGCACC
TTATAA
 
Protein sequence
MYQKILIRYG ELTLKGQNKR DFINDLKRNL MFHIPKEQIK MEYDRAFLDF SLTNLDALKY 
VFGISSYSCV YEVESSLAAI TSKVLDIAKQ KYPFKTFAIA ARRHNKNFEM NSNDLNRHLG
CAILSNFEVK VNLEEPDLKI YVEVRDASTY IFIDYIAGLG GMPLNSAGQV LHLMSGGIDS
PVAAYLLQKR GLRINFLNFI TPPHTDEKTT QKVDELIKVI AKYQGSAKLY QVNFTDIMNY
IGLVSNQKYK IILMRRSFYR IAQMLAKKLH IKALSNGENL AQVASQTLEA IHTVSAPITL
PIFRPLLSFD KNETIKIAEK IGTMPISILK ACETCELFAP KNPIIKPTPE EASELEKELD
KLPELEKLAV ENVTIKTIST L