Gene Htur_4106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4106 
Symbol 
ID8744734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp370328 
End bp371326 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content66% 
IMG OID646514663 
Productformiminoglutamase 
Protein accessionYP_003405610 
Protein GI284167332 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACA CCGAGGAGTC GATGAACGAG TTCACGATCT CCCCCGACTG GACCGACGAC 
GAATGGCGCG GAATGGCTCA ATCGACCGAC CCGAACGACG AACTGGTCGG CCATATCGTC
GAGGGAATGA CACTCGAGGC CGTCGACGAC GCCGACGTCG ACGCGGTGCT GGTGGGCGAA
CCGTACGACG GTGCAGTTAT CAGTCGATCG GGGGCGCGCG AGGGGCCGAC CGAGATCCGC
CGCTCGCTTG TGCGAACGAA AACCCACCAC TTCGACTGTG GCCCGCTTCG GGTACTCGGT
GACCTCGGCG ACGTTCGCTC GCTGGTCGAC GCTGGCACGC CCGGCACCGA CTCGTCCGTC
GCCGTCGTTC AGTCGACGCT CCGTGAGACG ACGGCCCGCG TGCACGAGTG CGACGCGGTA
CCGATTTTCC TCGGGGGAGA CAACTCTCTG ACGTACCCGA ACGTCGCCCC GCTACTCGAA
CAGAGCTCCG TGGGCGTGAT CAATCTCGAC GCGCATCTGG ACGTCCGCGA GGTCCGGGGC
GAACCGACGA GCGGCACGCC GTACCGACAG CTCTTCGCAG CCGGTCTCGA TCAATACGTC
TGCCTCGGGG CGCGACACTT CGAGACGGCA ACCCCGTACC ACGAGTTCGT CCGTGAGCGT
GGCGGCGCGG TCATCACGGC CGAAGAAGTC GCGGATGACG CCGTTGAGAC GGCGACGCAC
GCACTCGATG CGATGGGTGA CGTCGATCGA CTCTACGTGA GCGTAGACTG CGATGTACTC
GACGCGAGTG CAGCCCCCGG CGTGAGTGCG CCGACGCCGG GCGGCATCAC CACGCGAGAG
CTGTTTCGCT GCCTGCGACT GCTTACGAGC GACGAGCGAC TCGCGGGGTT CGAGGTTGTC
GAATGTGCCC CGCCGCTCGA CCGGAATGGA CTGACGACCG ATGCGGCGGC CCGTGCCGTT
GCGCACGCCC TTGCCGGCTT TCTGGGGGGA CAACAATGA
 
Protein sequence
MTDTEESMNE FTISPDWTDD EWRGMAQSTD PNDELVGHIV EGMTLEAVDD ADVDAVLVGE 
PYDGAVISRS GAREGPTEIR RSLVRTKTHH FDCGPLRVLG DLGDVRSLVD AGTPGTDSSV
AVVQSTLRET TARVHECDAV PIFLGGDNSL TYPNVAPLLE QSSVGVINLD AHLDVREVRG
EPTSGTPYRQ LFAAGLDQYV CLGARHFETA TPYHEFVRER GGAVITAEEV ADDAVETATH
ALDAMGDVDR LYVSVDCDVL DASAAPGVSA PTPGGITTRE LFRCLRLLTS DERLAGFEVV
ECAPPLDRNG LTTDAAARAV AHALAGFLGG QQ