Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4436 |
Symbol | |
ID | 8745065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 15581 |
End bp | 16516 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646514973 |
Product | dihydrodipicolinate synthetase |
Protein accession | YP_003405920 |
Protein GI | 284172538 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0722382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAGAG CTAGCCTTAG CGAGCGCTTC CAGGACGTCG CCTTCACCAC CGCCGTCCCG TTCAGCGACG ACGGATCGGA CGTGCTATAC GAGGACCTCG CGGACAACCT CGCGAAGCAG TACGACGCCG GCGCGCGCCT GTTCATCCCC TGTGGCAACA CCGGAGAGTA CTACTCGCTG ACCGACGAGG AGCGAACCGA GATCGTCGAG ACCCACGTCG AGGCGACCGG CGACGAGGCG ATGATCGCCG GCGGCGTCGC CGGCAGCCTC GCGGAGGTCG AACGGCTCGC CGACGCCTAC GAGGACGCCG GGGCGGACGC GATCATGGTG ATGCACCCCG ACCACACCTA CCTGCACCAG CGCGGGCTGG CGAACTACTA CCACCGGATC TGCGACGCGA CCGACCTCGG CGTCGTCATC TACAAGCGCG GTCCCGAGGT GCCCCGCGAC GTGATCGTCG ACCTCTCCGA GCGCGAGAAC GTGGTCGCGG TGAAGTTCGC TGTCAACGAT ATCAAGGAGT TCTCCCAGAC CGTCGCGGAC GCCCCGGGCG AGGTCACGTG GGTCAATGGC ATCGCCGAAC GGTACGCGCT CTCCTTCGCC ATCGAGGGGG CGACGGGGTA CACCACCGGT CTCGGCAACT TCGCGCCGGA GGCGACGCTG GCGCTGTTCG ACGCCGTCGA GGACGAGAAC TGGGAGCGAG CCAGATCGAT CCAGCGGCTA CTCCGTCCGA TCGAGGACCT TCGCGAGGAA CCCGGCGAGG ACAACGCGCT CTCCGGCGCG AACAACGTCT CCGTCATCAA ACGCGGGATG GATCTCGCCG GCTATACGGG CGGTTCCCTC CGCGATCCGC TGGTCGATCT CTCCGCCGAC GACGCGGCGC GTCTCGAGGA GTACTACGAA ACCGTACAGT CGACGCCGCT GCTGGAAGCG GCCTGA
|
Protein sequence | MPRASLSERF QDVAFTTAVP FSDDGSDVLY EDLADNLAKQ YDAGARLFIP CGNTGEYYSL TDEERTEIVE THVEATGDEA MIAGGVAGSL AEVERLADAY EDAGADAIMV MHPDHTYLHQ RGLANYYHRI CDATDLGVVI YKRGPEVPRD VIVDLSEREN VVAVKFAVND IKEFSQTVAD APGEVTWVNG IAERYALSFA IEGATGYTTG LGNFAPEATL ALFDAVEDEN WERARSIQRL LRPIEDLREE PGEDNALSGA NNVSVIKRGM DLAGYTGGSL RDPLVDLSAD DAARLEEYYE TVQSTPLLEA A
|
| |