Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0876 |
Symbol | |
ID | 8741460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 892670 |
End bp | 894349 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646511454 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003402444 |
Protein GI | 284164165 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCGTA TTCAGCGGCT CATCACCGGT GCGGCCAGCG ATCAACTCTC GATAACTATC CTCACGCTGC CCACTGATAA CGACCGACAG GCGATTCAGA TCGCCCGCCA GCTCGAGAAG AACCTCAACG CGGTTGGTAT CAACGTCCGG ATCGAACCCC GCGCCAACGC CGAGTTGCTG CAGACGGTCT TGCTCGAGCA CGACTTCGAT TGTTACATCA GCAGCCACCC GGGCGGCTAC GACCCCGACT TTCTGTACGA GACGTTTCAC TCGAAGTTCG CCCCCGAATC CGGCTGGCAG AACCCCTACG GGTTCGTAGA TACCGAGATC GACGAGGTAC TCGAGCGACA GCGCTCGACA ACGGGGTCGG AACGAAAGGA CGCCGTCGGC GAAGCGCTCG ACCGACTGGC GCAAACGCAC CCGATCGTGC CCATCTGCAT TCCCGACGAG CGGCGACTCG TCAGAACGGA TCGGTTCGAC GGCTGGAACG ACCACCACCT CGGGACCCAA CTCGGCTACC TCGGTCTCGA GCCCGACGAG GACGACTTCG AGGACGAGGT GGTCCTGAAC GCGGTGATCA CCGACTCGAG CCCGTCGAGA AACCTCAACC CGCTTGCGGC GCCGTACCGC TACCGGGGCC CGTTCATCGA TCTCCTCTAT GATTCGATTG CGACCGAAGA CAACGGGGAA CTCCGGCCGT GGCTCGCCGA GTCCTTGGAG TGGGACGGCT CGACCGCGAC GGTCACGCTC CGATCGAACT GTCGGTTCCA CGACGGCGAG CCGGTGTCCG CCGACGACGT CAAATTCACG TACGAATTTC TGGACGACAC GTCACTCGGC GTCCGCGACG GACAGTCTCC GGCGCCGAGA TACCGCGGAC TGGGCGACGC CGTTGAATCG GTGACGGCCG TCGACGATCT GACCCTCCGG ATGCGATTCG GGACGAGCGA CGAAGTCGGC AAACTGGCCT TTACGGTCCC GATTCTCCCG AAACACATCT GGAAGACGGC GGTCGAAGAC CGCCTCGAAA ACGGCGCCGA GCCCATCCAG GGGACGTGGG ATATCGTGAC CACCGACGAG ATCTCGCCGA TCGGCAGCGG TCCGTACGCA CTCGCCGAGC GGGAGCCACG GTCGCACATT CGATTCGAGC GCTTCGGCGA TCATTTCACC AGTCGCGAGT ACGTCGACCT GCCGGAACCC CGCGTCGACG AACTCGTCTT TCACGTCGAG CCGAACAGTA GCGCGGCCAT CGAACAACTC GAGGCGGGGA ACGCCGATGT GACGGCCTCC AGTCTCGGAG CGGAAACGGT CGGGGACGCT CCGTCCGGCC TCGAACTCGT CGAGTCCAAG TCGTGGTCGT TCTATCACAT CGGGTTCAAC GTCCGGAACT CGCCGTTCAG CAACCTCCAC TTCCGACGAA ACGTCGCGCG ATTGATCGAC AGGGAATCGC TCGTGGCGGA CGTCTTCAAC GGGCAGGCGA GCCCGTCCGT CACACCGGTT ACGGAGGAGT GGGTACCGGA CGACCTCGAG TGGGACGGTG CCGCACCATA CGCGCCCTTT TTCCGTGATG AAACGAACAG TGGGGATACG GGAGAACTGG ACGTCGAACG GGCGAAGCGA TCCTTCGAAC GGCACGGATT TCAGTACGAC GAAGAGGGAG AATACATCGT GAGGTCCTGA
|
Protein sequence | MDRIQRLITG AASDQLSITI LTLPTDNDRQ AIQIARQLEK NLNAVGINVR IEPRANAELL QTVLLEHDFD CYISSHPGGY DPDFLYETFH SKFAPESGWQ NPYGFVDTEI DEVLERQRST TGSERKDAVG EALDRLAQTH PIVPICIPDE RRLVRTDRFD GWNDHHLGTQ LGYLGLEPDE DDFEDEVVLN AVITDSSPSR NLNPLAAPYR YRGPFIDLLY DSIATEDNGE LRPWLAESLE WDGSTATVTL RSNCRFHDGE PVSADDVKFT YEFLDDTSLG VRDGQSPAPR YRGLGDAVES VTAVDDLTLR MRFGTSDEVG KLAFTVPILP KHIWKTAVED RLENGAEPIQ GTWDIVTTDE ISPIGSGPYA LAEREPRSHI RFERFGDHFT SREYVDLPEP RVDELVFHVE PNSSAAIEQL EAGNADVTAS SLGAETVGDA PSGLELVESK SWSFYHIGFN VRNSPFSNLH FRRNVARLID RESLVADVFN GQASPSVTPV TEEWVPDDLE WDGAAPYAPF FRDETNSGDT GELDVERAKR SFERHGFQYD EEGEYIVRS
|
| |