Gene Hore_22110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_22110 
Symbol 
ID7313759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2407616 
End bp2408668 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content45% 
IMG OID643612663 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_002509951 
Protein GI220933043 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG GCATAACCTA TGCTGATTCG GGGGTAAATA TTGATAAAGG TAACGAGTTT 
GTGGATCGCA TTAAAGATAA AGTAAAAGAA ACCCACCTTC CCGGGGTTAT CGGTGGGGTG
GGGGGATTTG GGGGGCTTTT TGCCCCTGAT TTTAAACAAT ACCAAAAACC GGTTCTTGTC
TCTGGAACCG ATGGGGTTGG TACTAAACTG AAAATTGCCC AGCTTGTTAA TAAACATGAT
ATCATCGGGA TTGACCTGGT AGCCATGTGT GTTAATGATA TCCTGGCCCA GGGGGCAAAA
CCCCTTTTTT TCCTTGATTA CATGGCTACC GGGCAACTTG AGCTCGAGAC CGGAGAGGAG
ATTATTACCG GAATTACCAG GGGATGTAAA CAGGCCGGTG TTGCCCTGCT CGGTGGGGAG
ACAGCGGAGA TGCCGGGCTT TTATCAGCAG GGGAGCTATG ACCTGGCCGG TTTTGCAGTT
GGGATCGTTG ACCGTAAGAA TATAATTACC GGTTCGAAAA TAAAAGCAGG GGATCAGCTA
CTGGGCCTTC CTTCATCTGG GATACATAGT AATGGTTATT CCCTGGTCAG GAAAGTACTT
CTTGAAAAAG AGGGGTTGAA AGTAGATGAT AGAATAAAAG AGCTAGACTG TACCCTTGGT
GAGGAGCTTT TAAAACCAAC CCGTATTTAT GTACCTGTGG TTCTTCCGTT GCTAGAAAAA
TATGAAGTAA AGGGAATAGC TCATATTACC GGTGGGGGAA TGCCGGAAAA TATAGCCCGG
ATTATTCCTG ATGGCCTTCA GGCCAGGGTA AACAGAGAAA GCTGGTCGTG TCCCCCAGTA
TTTACCTATA TTCAGGCCAA GGGGGATATT GCTACAGTTG AAATGGAGAG AACCTTCAAT
ATGGGGATCG GTATGGTGCT GGTTGTTTCT CCGGATATAT TAGAAAATGT TATGTCAGAT
ATAAAGGCCC GGGGAGAAAA AGTATACCAT ATTGGAGAAA TAAATAGTAT CGGTAAAAAA
GAGGGTAAGG TGGTTATCTA CAATGGGCAA TAA
 
Protein sequence
MKKGITYADS GVNIDKGNEF VDRIKDKVKE THLPGVIGGV GGFGGLFAPD FKQYQKPVLV 
SGTDGVGTKL KIAQLVNKHD IIGIDLVAMC VNDILAQGAK PLFFLDYMAT GQLELETGEE
IITGITRGCK QAGVALLGGE TAEMPGFYQQ GSYDLAGFAV GIVDRKNIIT GSKIKAGDQL
LGLPSSGIHS NGYSLVRKVL LEKEGLKVDD RIKELDCTLG EELLKPTRIY VPVVLPLLEK
YEVKGIAHIT GGGMPENIAR IIPDGLQARV NRESWSCPPV FTYIQAKGDI ATVEMERTFN
MGIGMVLVVS PDILENVMSD IKARGEKVYH IGEINSIGKK EGKVVIYNGQ