Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0883 |
Symbol | |
ID | 8741467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 900332 |
End bp | 901975 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646511461 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003402451 |
Protein GI | 284164172 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACACA CTGGTGAGGG TGATTCTGAC CGAATCCCTC GTCGATCGGT GCTCGCCGGA CTTGGGGGGT TCGGCATGGC CGGTGCGGCC GGCTGTATCA GCACCGATAT CGGGATCGAC CTCGATAGCG ACCCCGAGAG ACTCATCTTC GAGGGCTTTC AGGAGGCGGG CGTCGAACCG CCCGTCGAGA CGACGATTTA CTCGAACGCC GAGACCGAGG GGCGAAAGCG GTGGGCGCAG TTGGTCCAGC ACGAACTGGA CGGCACCGAT CTGTTCGACG TCGAGTTCGA GACCCTCGAG TGGACGTCCT ACATCGATCT GGTCAACAAC ATGGCCGCCA ACGAGGAGAA CGCGCTCGTC TGTCTGGGGT TTATCGGCGG CTGGGATCCG CATCAGTACG TCTATCCTGG CTTTCACTCC GACAGTTTCG CTCCGACCGG CCTCAACATC AACCACTACG AGAACGAGCG GGTCGACGAG CTCATCGACG AAGGCGTTGC GACCGTCGAC GGCGACGAGC GCGTCGCGAT CTACGAGGAG TTGCAGGAAC TCCTCGTCAA GGAATCGCCG CTGTCGTTCG TTCGCGCGCC CGAAGAGATC GTCACCTACC GAGCCGACGC GATCGACGGA TTCCGGACGT ATCCGGTCCC CGGCGATGAG TACAAGTCGA TCTACGCGCC GACGCTCGGC GTGTACACGG AGCTCACCAC CGACGAGACC GAGCTCGTCG GCGACGCAGG GACGAAGATC GACAGCTACG ATCCGGTCCG GGCGGCCGAC GACGTCTCGT ATATGGCCAC CGGCCTGCTC TACGAACAGC TCCTCGAGAT CGACTTCGAC GGGAGCGCTC GGCCGCTGCT CGCGACCGAC TGGCACCGGA TCGACGAGAC GACCTGGCGG TTCGACCTGC GAGAGGGCGT CCAATTTCAC ACCGGCGAGC CGTTTACCGC CGCGGACGTC CGTGCGACCC TCGAGCGATA CGAGGGGTCG CCGAGCGAGC AGGACGTGTA CAACTGGTAC GAGAGCGCAG AGATCCTCGG CGACCACGAG ATCGAAATTT CCCTCCGGCG GCCGTACGGG CCGCTGGAAA CGGCGATCGC GCAGGTGCCG ATCCTCCCGA AGGCCGTCGC GGACGGCACG CACGACATCA CCGAGCGACC GGTCGGAACC GGCCCGTACG CGTTCGAGGA ACACGAGCCC GCCACCCTCT GGCGGCTGGT CGCCAACGAG GACCACTGGC ACGACGGGAG CAACGGCGTC CCCGAGACGC CGCCGATCGA GACCGTGACG ATGCGGATCG TCACCGAATC GTCGGCCCGG CGCGGCGCCC TCGAGGCGGG CGATATTCAC CTGACCACGG GGCTCCCGAA CGAGAGCCTC GAGACGTTCG AAGCCGACGA CGCGTTCGTC GTCGACCGAA CGACCGGCGC CGCCGTCGAC TTCCTCGGCT ATCCGAGCTA CCGCGAGCCG TTTTCCAATC CGAAGGTTCG GCGCGGGATC GGCCAACTCA TCCCGCGCGA GCGGATCATC GAGGACGTGT TCCACGGCGC CGGCACCGTG GCCTACACGC CGATCTCGCC GACCCACGAG ACCTTCGTCG GTCCAGAGTT CGAAGCGCGA ATCGTCGAGG AGTACTTCAG CTAA
|
Protein sequence | MAHTGEGDSD RIPRRSVLAG LGGFGMAGAA GCISTDIGID LDSDPERLIF EGFQEAGVEP PVETTIYSNA ETEGRKRWAQ LVQHELDGTD LFDVEFETLE WTSYIDLVNN MAANEENALV CLGFIGGWDP HQYVYPGFHS DSFAPTGLNI NHYENERVDE LIDEGVATVD GDERVAIYEE LQELLVKESP LSFVRAPEEI VTYRADAIDG FRTYPVPGDE YKSIYAPTLG VYTELTTDET ELVGDAGTKI DSYDPVRAAD DVSYMATGLL YEQLLEIDFD GSARPLLATD WHRIDETTWR FDLREGVQFH TGEPFTAADV RATLERYEGS PSEQDVYNWY ESAEILGDHE IEISLRRPYG PLETAIAQVP ILPKAVADGT HDITERPVGT GPYAFEEHEP ATLWRLVANE DHWHDGSNGV PETPPIETVT MRIVTESSAR RGALEAGDIH LTTGLPNESL ETFEADDAFV VDRTTGAAVD FLGYPSYREP FSNPKVRRGI GQLIPRERII EDVFHGAGTV AYTPISPTHE TFVGPEFEAR IVEEYFS
|
| |