Gene Aasi_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0402 
Symbol 
ID6377341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp473271 
End bp474290 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content37% 
IMG OID642681568 
Producthypothetical protein 
Protein accessionYP_001957549 
Protein GI189501832 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAT CCCGAACAGA ACTTAAAGAT TTAGGTAAAT TTAAACTTAT TGAAACTATT 
ACAGAAAACT TTAAACTTCA TCATCCAACT TCCATTCATG GAATTGGAGA TGATGCTGCT
GTAATTGATA CAGGTAATTT TTATATCCTG GTTACTACAA ATATGTTGGT GGAAGGGGTA
GCCTTTGATC TTACTTATTG CCCCCTTAAG CATGTAGGCT ATAAAGCTGT TGTTGGTACC
ATGGCTGATA TTATAGCTAT GAATGGTACT CTTGAACAGA TAACGGTCAG TATTGCTATC
AGTAATCGAT TTACCCTAGA AGCTGTGCAG GAGCTTTATC AAGGTATTTA TAGAGCTTGC
GAGCATTACA AAGTAGATCT AGTAGGCGGT GATACAACTT CTTCTACTTC TGGATTAATT
CTTTCTATTA CTGCTATAGG AAAGGTAGTC AAAGAAAAGT TATGTTTAAG AAAAGGTGCT
AAGCCTTATG ATTTGGTTTG CGTAACAGGT GACCTGGGGG CAGCTTATTT GGGACTGCAA
ATATTAAACA GAGAAAAAAA GATATTTGAG GTAGATCCTC ATATGCAGCC AAAGCTAGAA
CCTTATCAAC ATCTTATAGA AAGGCAGCTA AAACCAGAGG CGCGTACCGA AATAATACAG
CTATTTGACC AAGAAAATAT ACTTCCCTCT AGTATGATTG ATATTTCAGG TGGATTAGCT
TCTGGGCTAT TGCATATCAA CAAAGTTTCT GGCGTAGGGA TTACTATTTA TGAGAATAAA
TTACCTATTA ACCAGAAAAC TTATGAGACT GCTGAGGCGC TTCATCTTTC ATCCACGTTA
TGTGCATTGC ATGGCGGCGA AGATTATGAA TTGTTGTTTA CTATCCCACA AAGCGAGCTT
CCTAAAATAG AAAAACACCC TAACATACAT GTAATAGGTT ATGTTACTGA CGTAAGTCTA
GGTGTAAAAT TAATAACCAA CAGTGATGAA TCAATAGAGA TAAACGCCCA AGGTTGGTGA
 
Protein sequence
MSTSRTELKD LGKFKLIETI TENFKLHHPT SIHGIGDDAA VIDTGNFYIL VTTNMLVEGV 
AFDLTYCPLK HVGYKAVVGT MADIIAMNGT LEQITVSIAI SNRFTLEAVQ ELYQGIYRAC
EHYKVDLVGG DTTSSTSGLI LSITAIGKVV KEKLCLRKGA KPYDLVCVTG DLGAAYLGLQ
ILNREKKIFE VDPHMQPKLE PYQHLIERQL KPEARTEIIQ LFDQENILPS SMIDISGGLA
SGLLHINKVS GVGITIYENK LPINQKTYET AEALHLSSTL CALHGGEDYE LLFTIPQSEL
PKIEKHPNIH VIGYVTDVSL GVKLITNSDE SIEINAQGW