Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2894 |
Symbol | |
ID | 8743511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2965855 |
End bp | 2967120 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646513479 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_003404436 |
Protein GI | 284166157 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCGAG ATAACAGGCC GCGTGGCCGG AGGCGTGATT TACTAAAGGC AATTACCGCT GGCGGTACGA TCGGGATATC CGGATTGGCC GGCTGCGTCG GCGATCCGGA CGAGTTGGGT GGCGGAGACG ACGAGAACTT CGACACGGTC CAGTTCGGCG TTCTCGAGCC GAGGACAGGC GAGTTTAGCG CTCTCGCGAA GGAACACTAT CAGGGAACCG AACTGGCGAT CCAGCAGATC AACGACAGCG ATGAGTACGA CTTCACGATC GAACACGAGG AGTACGATAC GCAACTCGAC CCGGCGACGG CGACCCAGCA GGCCCAGCAG GCGGTCCAAT CCGACGGCGC ACAGTTCATC AGCGGCTGTA TCTCGAGTTC GGCCGCGCTC GCGATCAACA GTTTCGTCGC CGATAACGAA GTCGTCTACA CGCCGGGAGC GGCGGATATC TCGATCACCG GCGAGAACTG CAACGAGTAC GTGTTTCGGT TCGAGACGAG CACCGCACAG ATCGCGGAGG TGATGGCCCA GTGGACCGCC GACGAACTCG GCGATCAGAT CGTCTATCAC ATCGCGGACT ACGCGTACGG CGAGTCGGTA CTGAACGAGG TCGAGACGCG AATGGAGTCC ACTAGCGACT CCTACGAGCG GGTCAACGTA ACCAGGTCGG ATCAGGGCTC GACGAACTTC GAGGCGTTCA TCAGTCAGAT TTCGGACGTC AGTGACGAGG CCGACGCGCT CGTCGTGGGG ATGACCGGTG CCGACCTCGC GATTTTCCTC TCGCAGGCCA GTTCGCGCGG CCTGCCGGAC GAGATCCCCA TCGTGACGAC GACCGGTTCG TTCCGAGCCG TACGGGCGGG CGGCGGAGAG GGTGTGTACA ACACCTACAG CGGTGTTCGA TACGTTCCGG AGATCGAAAC CGGAGACAAC CAGGAGTTCG TCCAGGCCTA CGAAAGCGAG TACGACGCCC CGCCGGACAA CTTCTCGCGC GTCGGCTACG AATCGATCCG CATGGTCGCC AACGGCATCC GCGAGGCGGG GTCGCGCGAT CCCACGACGG TCAGGGAGTC GCTCTCGGGC ATGGAACACG ACACGATCTT CGGTCCCAAC CGGTTCCGGA AGTGCGACCA GCAGGCGATG AATCCGGTCT GGATGGGCGA GTGCGTCGAA CCGGACTCCG GGGAGCTCGC CGACGTCGAA CTCCTGACCC AACTCTCCGG CGAAGAGGCC GCACCCGACT GCGAGGAAAC CGGCTGTGAA CTGTAA
|
Protein sequence | MARDNRPRGR RRDLLKAITA GGTIGISGLA GCVGDPDELG GGDDENFDTV QFGVLEPRTG EFSALAKEHY QGTELAIQQI NDSDEYDFTI EHEEYDTQLD PATATQQAQQ AVQSDGAQFI SGCISSSAAL AINSFVADNE VVYTPGAADI SITGENCNEY VFRFETSTAQ IAEVMAQWTA DELGDQIVYH IADYAYGESV LNEVETRMES TSDSYERVNV TRSDQGSTNF EAFISQISDV SDEADALVVG MTGADLAIFL SQASSRGLPD EIPIVTTTGS FRAVRAGGGE GVYNTYSGVR YVPEIETGDN QEFVQAYESE YDAPPDNFSR VGYESIRMVA NGIREAGSRD PTTVRESLSG MEHDTIFGPN RFRKCDQQAM NPVWMGECVE PDSGELADVE LLTQLSGEEA APDCEETGCE L
|
| |