Gene Htur_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4040 
Symbol 
ID8744668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp293461 
End bp295035 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content63% 
IMG OID646514606 
Producthistidine ammonia-lyase 
Protein accessionYP_003405553 
Protein GI284167275 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGACG AACCAGTCGT CGTCGACGGG GAATCGCTCA CACCGGACGC TGTCGAACGC 
GTCGCACGGC ACGGTGCCAC CGTTCGTATC CCGGAGGAGG CCCGTGAGCG CGTTCGCGAG
TCACGCGAGC GCATCGTCGA CATCGTCGAG TCCGGGCAGG CCGTCTACGG TGTGAACACG
GGATTCGGCG AACTCGTCCA GGAACGGATA CCCGAGGACG ACATCGAAAC GCTCCAGCAG
AACCTCGTCC GGAGCCACGC TGCCGGAACG GGCCGCAAAC TCGATCAGGA CGAGGTCCGA
GCGATGCTCG TCACCCGGCT CAACGCGCTG GTGAAAGGGT ACAGCGGCGT CCGAGAGCGG
ATCGTTGACG TTCTCGCAGG GATGATAAAC GAAGGGGTCC ATCCCGTCGT GAAGGCGAAA
GGGAGTCTCG GTGCGAGCGG CGATCTCGCC CCTCTCGCCC ACCTCGCTCT TGTCGTCACA
GGAGAGGGCG AAGCCACTGT GGAGGGTGAA CGACTTCCGG GTGGGAAGGC CCTGAAGCGG
AAGAACCTCG AACCAGCGAC GCTTCATGCA AAGGAGGGGC TGGGGCTCAT CAATGGGACC
CAACTGACCG TCGGGTTGGC CTCGCTCGTC GTGTGTGACG CTGAACGTGC GATGCGGGCG
GCGGATATCG CCGGTGCGAT GACTACGGAA GCGACGATGA GTACGACCGC GAGCTCACAT
CCCAGTATCC AGCGCGTCCG GCCACATCAG GGGCAAACCG AAAGCGCGGA GAACGTCCGT
CGGCTCACTC AGAACTCCGA GATCGTCGAG TCGCACCGTA ACTGTGACCG GGTGCAGGAC
GCGTACTCGC TTCGCTGTCT CCCCCAAGTA CACGGTGCGG TCCGGGATTC GATCCAACAC
CTCCGTGAAG CCGTCGAGAC GGAGCTCAAC AGCGCGACAG ACAACCCGCT CGTATTCCCC
GCCGACGACG CGGACGACCG CGCAAGCGGC ACTGAGCGGG CTGCCGTCCT TTCAGGCGGG
AACTTCCATG GGCAACCATT GGCCCTTCGG CTGGATTACG TCACGAGCGG TCTCGCGGAA
TTGGCCTCGA TCTCGGAGCG GCGGATGGAC CGAATGCTCA ACCCTAATGT TCAGGAAGAA
CATCTGCCCC CGTTCCTGAC TGAAGGGAGC GGCCTTCGCT CGGGGTATAT GATCGCCCAG
TACACCGCCG CAGACCTCGT GAGTACGAAC CGTTCGCAGG GACGCCCGTC GATGGACAGC
ATCCCCGTCA GCGGGAATCA AGAGGACCAC GTCAGTATGA GCGCACAGAG CGCCCATATC
GCGAGCGAAA CCGTCAACTC TACGCTTCGT GTCGTCGGGA TCGAACTGGC CTGTGCGGCC
CAAGCGCTCG ATTTTATTGA GGATTGTTGC CCCGGCCTCG GGACCCACGC GGCCTATCAC
ACAATTCGCG AACACGTCCC TCACCTCAAC GAAGACCGGC CGATCCATCG AGACATCACG
TCTATGCTGG CGATCCTTCG CTCCGATACG TTATTCGACG CCGTCGAAAC GGCACTCGAC
GAGCCACTGT CGTAA
 
Protein sequence
MTDEPVVVDG ESLTPDAVER VARHGATVRI PEEARERVRE SRERIVDIVE SGQAVYGVNT 
GFGELVQERI PEDDIETLQQ NLVRSHAAGT GRKLDQDEVR AMLVTRLNAL VKGYSGVRER
IVDVLAGMIN EGVHPVVKAK GSLGASGDLA PLAHLALVVT GEGEATVEGE RLPGGKALKR
KNLEPATLHA KEGLGLINGT QLTVGLASLV VCDAERAMRA ADIAGAMTTE ATMSTTASSH
PSIQRVRPHQ GQTESAENVR RLTQNSEIVE SHRNCDRVQD AYSLRCLPQV HGAVRDSIQH
LREAVETELN SATDNPLVFP ADDADDRASG TERAAVLSGG NFHGQPLALR LDYVTSGLAE
LASISERRMD RMLNPNVQEE HLPPFLTEGS GLRSGYMIAQ YTAADLVSTN RSQGRPSMDS
IPVSGNQEDH VSMSAQSAHI ASETVNSTLR VVGIELACAA QALDFIEDCC PGLGTHAAYH
TIREHVPHLN EDRPIHRDIT SMLAILRSDT LFDAVETALD EPLS