Gene Htur_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1119 
Symbol 
ID8741707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1165995 
End bp1167158 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content69% 
IMG OID646511698 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_003402684 
Protein GI284164405 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACGC TACGGACGCC GGGACCGACG CTCGGGGTCG TCGGCGGCGG ACAGCTCGGA 
CGGATGCTCG CAGAGGCGGC GTCGCCGCTG GGAGTCGAGG TCGTCGTGCT CGATCCGACG
CCGGACTGTC CGGCCGCGCC GGTCGCCCGC GACCAGATCG TCGCCGACTT CGACGACGAG
GCGGGGATCC GCGAACTCGC CGCGCGCGCG GACGTGCTCA CCTTCGAGAT CGAACTCGCC
GATCAGGACG TCTTAGAGCG CATCAGCGAG GACACGGGGA CGCCGGTCCA TCCGAAGCCG
TCGACGCTGC GGACGATCCA CGACAAACTC GTCCAGAAGC GCGAACTCGA GGACGCGGGC
GTTCCGGTGC CGCCGTTCCG CGAAGTCGAG GACGCCGACG ACATCCGCGC GGCCATCGAC
GACTACGGCG CGCCGGTAAT GTTGAAGGCC CGAACGGGCG GCTACGACGG CCGCGGCAAC
GTCCCCGTCG AGTCGAAAGC CGAAGCCGAC GAGGCCCTCG AGTCGGTCGC CGGCCCCGCG
ATGGTCGAGT CGTTCGTCGA CTTCGAGCGC GAGGTCTCGG TGATCGCCGT CAAAGGCGAT
GACGAGGTCG CGACCTTCCC GCTGGGCGAG AACGTCCACG TCGACGAGAT CCTCCGGGAA
ACCATCGTTC CCGCGCGCTC GAGCGACGCG GCCGCGGAAC GCGCCTACGA CGTCGCGCGG
GACGTCCTCG AGGTGATGGA CGGCCGCGGC GTCTACGGCA TCGAACTGTT CGAAACGCCC
GACGAGGAGA TCCTGCTCAA CGAGATCGCG CCGCGCCCGC ACAACTCCGG CCACTGGACG
ATCGAGGGCG CGGCGAATTC GCAGTTCGAA CAGCACGCCC GCGCCGTGCT GGGCTGGCCG
CTGGGCTCGA CGGACCTGCG CTCGCCGACC GTCCTGACGA ACCTGCTCGG CGACGTCGAC
GAGGAGCAGC GCGCGGAACT GGGCGATATC GACCGCCTTC TCGAGACACC CGGCGCGAAC
CTCCACTGGT ACGGCAAGCG TCAGGTCCGG CCGCTGCGCA AGATGGGTCA CGTGACGGTC
TCGGCCGAAG ACGAGGACGC CGACGTCGAG GACCTGCTCG AGACGGCGCG CAAACTCGAG
GACGCGGTAA CGTTCCGAAA CTGA
 
Protein sequence
MTTLRTPGPT LGVVGGGQLG RMLAEAASPL GVEVVVLDPT PDCPAAPVAR DQIVADFDDE 
AGIRELAARA DVLTFEIELA DQDVLERISE DTGTPVHPKP STLRTIHDKL VQKRELEDAG
VPVPPFREVE DADDIRAAID DYGAPVMLKA RTGGYDGRGN VPVESKAEAD EALESVAGPA
MVESFVDFER EVSVIAVKGD DEVATFPLGE NVHVDEILRE TIVPARSSDA AAERAYDVAR
DVLEVMDGRG VYGIELFETP DEEILLNEIA PRPHNSGHWT IEGAANSQFE QHARAVLGWP
LGSTDLRSPT VLTNLLGDVD EEQRAELGDI DRLLETPGAN LHWYGKRQVR PLRKMGHVTV
SAEDEDADVE DLLETARKLE DAVTFRN