Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1414 |
Symbol | |
ID | 8742005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 1468262 |
End bp | 1470172 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646511992 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003402975 |
Protein GI | 284164696 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACCA AAGAAAATCA TGATGGATCG GCCACCGAGG GAGGTGACCG ACTCACGCGA CGCGGCTACG TCGCCACGGC CGCCGCGGCG CTCGGGACGA GCGTGCTCGC GGGCTGTAGC GGCAGTCGCG GCACCAGCCT CGAGCCGGAC GCGCCCGATG GGGTACCGGA AACGGTCGAG ACGCAGTACT GGCGCGAGTG GGAGACGATC GACGCCGACT CGCCGCCACT GGAGTACAGC GCGACGGCCG GTGCGGTGCT CGATCGGTTC CCCGTCGAGT TCTCGAGCGA GGACGACCCG TGGATGCGCG AACACGCGTT GATGGTCAAG CGGGGGCTCG GCGATTTGGG TATCGCCGTC GAACTCAACG ACCGTCCGCT GAATCAGCTG TACGCCCAGA GCTGGGAAAC GCGCGGACTC GAGGCGATCG TCTCGATGAG TACGCACGGA CCGGACCCGC AGCGGGGGCT CGATCCGAAC CCGCTGTTGA TGCGGCGGAC CGAGGGCTCG CTCTCGAACT ACGATAACTA CTACCATCCG GAACTCCAGG AGGTGCTCAC CGAGCAGGCC CAGACGACCG ATCGGGCCGA ACGCGAGGAG CTCGTCGACC GGGCACAGGA ACTCTTCGCC GAGGACGTCG GGGCGCTCAT CACGCTCTTC CCGGAGATCA TCACGGCGGT GAACACGGAC CGATGGACCG GCTACGTGGA GACGCCGGGG AACGGCCCGA CGATGGACTC GTTCGTCTGG ACGGAGGTCA ACCTCCAGCC CGAGACGGAC AACCGGACCT ACGTCAAGGG CGTCACCACG TCGATGAACT CGCTGAACCT GCCGTGGGCC GCCGGCGGCG CGGAGGCCAA TCGACTTACG TTCATCTACG ACGGGTTATT CGACGCGACG CCCGATTTGG ACGTGGCCCC CGCACTGGCG ACCGGCGGCG ATTTCGTCGA CGACACGACC GTCGAACTCA CGCTGCGCGA GGGCGTCGAG TGGCACGACG GCGAGGCGTT CACCGCCGAG GACGTGAAGT TCACCGTCGA ACTGTACAAG GAGTACTCCT CCACGAGTCA GGTCCCGTTC TACGAGCCGA TCGAGTCCGT CGAGGTACTC GGGGACCACG AGGTCCGGTT CGAGCTGTCG AACCCCGACG CCTCGTTCAT GACCCAACGG GTCGTCCGGA GCGTCATCCT CCCGAAACAC CGGTGGGAGG ACGTCGACAA CCCGTCGCAG CACAACCCGG ACGCCCCCGT TGGCACCGGC CCCTTCCAGT TCGAGAACTG GGAGCAGGGG ACCCGATTCG AGGCCACGCG CAACGACGAC CACTGGATGT TCGACGACGA CTGGCGGGCC GACGCCCTCG GCGAGCAGGC CGAGCGCGGC CCCGGCATCG AGAGCGTCAT CTGGATCAAC GTGAGCAACG TCGACGCGCT GATCGGCTCG CTCCAGAGCG GATCGATCGA CGCCATCGGG ACGACCCTCT CGACCCTGCA GGCCGACCGG GCGGCCAACA CGGACGGGAT CGAGAAGCTG TCGACCGGGA GCTACGCGCC GCTCGACACG AAGCTCATGT TCTCCTGTCC GCCGATCAGG GACAAGGAGT TCCGCGTCGC ACTGGCGAAA GCGGTCGACT CGCAGGGGTT CGTCGACGAC TTCCTCGACG GGCAGGCGAC GGTGCCGGCC GGCGAGAACC CGATCTCGTC GCTCACCCAG TGGCACAACG CCGACACGAC CGACTACAGT TACGACGTCG AGGAAGCCCG GAACGTCCTC GAGCGCGCGG GCTACACCTG GGACGACGAC GGCAACCTGC GGTTCCCCAA CGGCGAGGCG TGGGGCGCGT TCGTCGACCG CATTCAGCCC GAGAACACCC ACAAACGCCG CTCGGAGCTC GGCCAGCCCG ACTTCTCATG A
|
Protein sequence | MTTKENHDGS ATEGGDRLTR RGYVATAAAA LGTSVLAGCS GSRGTSLEPD APDGVPETVE TQYWREWETI DADSPPLEYS ATAGAVLDRF PVEFSSEDDP WMREHALMVK RGLGDLGIAV ELNDRPLNQL YAQSWETRGL EAIVSMSTHG PDPQRGLDPN PLLMRRTEGS LSNYDNYYHP ELQEVLTEQA QTTDRAEREE LVDRAQELFA EDVGALITLF PEIITAVNTD RWTGYVETPG NGPTMDSFVW TEVNLQPETD NRTYVKGVTT SMNSLNLPWA AGGAEANRLT FIYDGLFDAT PDLDVAPALA TGGDFVDDTT VELTLREGVE WHDGEAFTAE DVKFTVELYK EYSSTSQVPF YEPIESVEVL GDHEVRFELS NPDASFMTQR VVRSVILPKH RWEDVDNPSQ HNPDAPVGTG PFQFENWEQG TRFEATRNDD HWMFDDDWRA DALGEQAERG PGIESVIWIN VSNVDALIGS LQSGSIDAIG TTLSTLQADR AANTDGIEKL STGSYAPLDT KLMFSCPPIR DKEFRVALAK AVDSQGFVDD FLDGQATVPA GENPISSLTQ WHNADTTDYS YDVEEARNVL ERAGYTWDDD GNLRFPNGEA WGAFVDRIQP ENTHKRRSEL GQPDFS
|
| |