Gene Htur_4104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4104 
Symbol 
ID8744732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp367307 
End bp368998 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content68% 
IMG OID646514661 
Producthistidine ammonia-lyase 
Protein accessionYP_003405608 
Protein GI284167330 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGTCG AACTCGACGG TGAGAGCCTC ACGCCCGACG CCGTCGTCGC AGTCGCTCGC 
GACGATGAAC CGATCAGCGT ACCGGAATCG GCTCGCGAGC GGGTTCGCGA GTGCCGTGCC
CGCGTCGAGG ATATCGTGGA GAGTGGCAAC GTCGTGTACG GCCTCAACAC CGGTTTCGGC
CAGTTAGTCG ATGAACGGAT CCCGTCCGAA GACTTGGCGC AGTTACAGGT CAACCTCCTC
CGCAGCCACG CCGCCGGCAC GGGGCGCGAA CTCACGCGCG AGGAAGTCCG GGCCATGCTG
GTCGCTCGAA TCAATGCCCT CGTCAAGGGC TACTCGGGGG TTCGCGAGCG CGTCATCGAC
TATCTCGTGG CGATGTGTAA CGCGGGCGTC CACCCTGTCG TCAGAGCCCA GGGTAGCCTT
GGCGCCAGCG GCGATCTCGC TCCACTGGCG CATATGTCGT TGGTATTGAT CGGCGAGGGC
GAAGCGGTCG TCGACGTCGC TATCGATGAG CGAGACGAGC GCGACGAGGC CGACGAGGCC
GGCGGCACCG ATGGTGCCGA CACGGAAGCC GGGGCGACTC GCCGGCTGTC CGGGCAAAAC
GCCCTCGCGA CTATCGATCG CGAGCCGCTC GATCTCGCCC CGAAGGAGGG ACTCGCACTC
ATCAACGGCA CGCAGTTGAC GGCGGCGCTC GCGGCTCTGG TCGTCGTCGA CGGGGAGCAC
CTCGTCCGCG CCGCCGACGC AGCCGGCGCG CTGACGACTG AGACGACGCT CGCAACGACG
GCGACCTCGG CCCCGGCTAT CCAGAGCGTC CGGCCCCACA CGGGCCAGCA GGAAAGCGCG
GACTTGGTCC GCCGGCTCAC CGCCGACAGT GAGATCGTCG AGGCCCACCG AAACTGCGAT
CGAGTGCAGG ATGCCTACTC GCTGCGGTGT CTGCCACAGG TCCATGGCGC CGTCCGGGAC
GCGGTGTGCC ACCTCCGCGA GGGCGTCACG ACCGAACTCA ACAGTGCGAC GGACAACCCG
TTGATCTTCC CGGCGGCCGA GGTCGACGAC CGCGCTTCCG GAACCGATCG CGGCGCGGCC
CTTTCGGGCG GTAACTTCCA CGGCGCCCCG TTGGCTCTCC GCCTCGAGTA CGTCCGCCTC
GCGCTGACCG ACCTCGCGGC GATCTCCGAG CGGCGGATCG ATCGGCTCCT CAATCCGAAC
CTTCAGGAGG ACCACCTCCC ACCGTTTCTC GCGGTCGAGA GCGGCCTCGA ATCGGGCTAC
ATGATCGCTC AGTACAGCGC CGCCGCCTTG GTCAACGAGT TGCGCTCGCT CGGTGCCGCC
TCGGCGGACA ACACCCCCGT TAGCGGCAAC CAGGAGGATC ATGTCAGTAT GAGCGCCCAA
GCCGCGCTGA ACGCCCGAAC TGCAATCGAG AACGCGAGAG GTGTCGTCGC TGCCGAACTC
GTCTGTGGAA CCGAAGCCAC CGAGTACGTC GACGACGCGT TCGAGGCCAC CGACCTTTCG
CTCGGCGTCG GAACACGCGC AGTTCGCGAC CTGATCCGCG AGGTCGTTCC GCCGCTCACC
GGCGACCGAC CACTCCATCC GGATATCGAC GCCGTTGCGG ACCTGCTCGC CGCGGGCCAC
CTCGACACCG CCCTCGAGCA GGCGCTTGAG ACGTTCGACC CGGCCAGTGG GCGCTCACGG
GTCGGAGAGT AG
 
Protein sequence
MTVELDGESL TPDAVVAVAR DDEPISVPES ARERVRECRA RVEDIVESGN VVYGLNTGFG 
QLVDERIPSE DLAQLQVNLL RSHAAGTGRE LTREEVRAML VARINALVKG YSGVRERVID
YLVAMCNAGV HPVVRAQGSL GASGDLAPLA HMSLVLIGEG EAVVDVAIDE RDERDEADEA
GGTDGADTEA GATRRLSGQN ALATIDREPL DLAPKEGLAL INGTQLTAAL AALVVVDGEH
LVRAADAAGA LTTETTLATT ATSAPAIQSV RPHTGQQESA DLVRRLTADS EIVEAHRNCD
RVQDAYSLRC LPQVHGAVRD AVCHLREGVT TELNSATDNP LIFPAAEVDD RASGTDRGAA
LSGGNFHGAP LALRLEYVRL ALTDLAAISE RRIDRLLNPN LQEDHLPPFL AVESGLESGY
MIAQYSAAAL VNELRSLGAA SADNTPVSGN QEDHVSMSAQ AALNARTAIE NARGVVAAEL
VCGTEATEYV DDAFEATDLS LGVGTRAVRD LIREVVPPLT GDRPLHPDID AVADLLAAGH
LDTALEQALE TFDPASGRSR VGE