Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4040 |
Symbol | |
ID | 8744668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 293461 |
End bp | 295035 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514606 |
Product | histidine ammonia-lyase |
Protein accession | YP_003405553 |
Protein GI | 284167275 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACG AACCAGTCGT CGTCGACGGG GAATCGCTCA CACCGGACGC TGTCGAACGC GTCGCACGGC ACGGTGCCAC CGTTCGTATC CCGGAGGAGG CCCGTGAGCG CGTTCGCGAG TCACGCGAGC GCATCGTCGA CATCGTCGAG TCCGGGCAGG CCGTCTACGG TGTGAACACG GGATTCGGCG AACTCGTCCA GGAACGGATA CCCGAGGACG ACATCGAAAC GCTCCAGCAG AACCTCGTCC GGAGCCACGC TGCCGGAACG GGCCGCAAAC TCGATCAGGA CGAGGTCCGA GCGATGCTCG TCACCCGGCT CAACGCGCTG GTGAAAGGGT ACAGCGGCGT CCGAGAGCGG ATCGTTGACG TTCTCGCAGG GATGATAAAC GAAGGGGTCC ATCCCGTCGT GAAGGCGAAA GGGAGTCTCG GTGCGAGCGG CGATCTCGCC CCTCTCGCCC ACCTCGCTCT TGTCGTCACA GGAGAGGGCG AAGCCACTGT GGAGGGTGAA CGACTTCCGG GTGGGAAGGC CCTGAAGCGG AAGAACCTCG AACCAGCGAC GCTTCATGCA AAGGAGGGGC TGGGGCTCAT CAATGGGACC CAACTGACCG TCGGGTTGGC CTCGCTCGTC GTGTGTGACG CTGAACGTGC GATGCGGGCG GCGGATATCG CCGGTGCGAT GACTACGGAA GCGACGATGA GTACGACCGC GAGCTCACAT CCCAGTATCC AGCGCGTCCG GCCACATCAG GGGCAAACCG AAAGCGCGGA GAACGTCCGT CGGCTCACTC AGAACTCCGA GATCGTCGAG TCGCACCGTA ACTGTGACCG GGTGCAGGAC GCGTACTCGC TTCGCTGTCT CCCCCAAGTA CACGGTGCGG TCCGGGATTC GATCCAACAC CTCCGTGAAG CCGTCGAGAC GGAGCTCAAC AGCGCGACAG ACAACCCGCT CGTATTCCCC GCCGACGACG CGGACGACCG CGCAAGCGGC ACTGAGCGGG CTGCCGTCCT TTCAGGCGGG AACTTCCATG GGCAACCATT GGCCCTTCGG CTGGATTACG TCACGAGCGG TCTCGCGGAA TTGGCCTCGA TCTCGGAGCG GCGGATGGAC CGAATGCTCA ACCCTAATGT TCAGGAAGAA CATCTGCCCC CGTTCCTGAC TGAAGGGAGC GGCCTTCGCT CGGGGTATAT GATCGCCCAG TACACCGCCG CAGACCTCGT GAGTACGAAC CGTTCGCAGG GACGCCCGTC GATGGACAGC ATCCCCGTCA GCGGGAATCA AGAGGACCAC GTCAGTATGA GCGCACAGAG CGCCCATATC GCGAGCGAAA CCGTCAACTC TACGCTTCGT GTCGTCGGGA TCGAACTGGC CTGTGCGGCC CAAGCGCTCG ATTTTATTGA GGATTGTTGC CCCGGCCTCG GGACCCACGC GGCCTATCAC ACAATTCGCG AACACGTCCC TCACCTCAAC GAAGACCGGC CGATCCATCG AGACATCACG TCTATGCTGG CGATCCTTCG CTCCGATACG TTATTCGACG CCGTCGAAAC GGCACTCGAC GAGCCACTGT CGTAA
|
Protein sequence | MTDEPVVVDG ESLTPDAVER VARHGATVRI PEEARERVRE SRERIVDIVE SGQAVYGVNT GFGELVQERI PEDDIETLQQ NLVRSHAAGT GRKLDQDEVR AMLVTRLNAL VKGYSGVRER IVDVLAGMIN EGVHPVVKAK GSLGASGDLA PLAHLALVVT GEGEATVEGE RLPGGKALKR KNLEPATLHA KEGLGLINGT QLTVGLASLV VCDAERAMRA ADIAGAMTTE ATMSTTASSH PSIQRVRPHQ GQTESAENVR RLTQNSEIVE SHRNCDRVQD AYSLRCLPQV HGAVRDSIQH LREAVETELN SATDNPLVFP ADDADDRASG TERAAVLSGG NFHGQPLALR LDYVTSGLAE LASISERRMD RMLNPNVQEE HLPPFLTEGS GLRSGYMIAQ YTAADLVSTN RSQGRPSMDS IPVSGNQEDH VSMSAQSAHI ASETVNSTLR VVGIELACAA QALDFIEDCC PGLGTHAAYH TIREHVPHLN EDRPIHRDIT SMLAILRSDT LFDAVETALD EPLS
|
| |