Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4883 |
Symbol | |
ID | 8745513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013746 |
Strand | + |
Start bp | 74727 |
End bp | 76064 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 646515358 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_003406305 |
Protein GI | 284172924 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCTCT TCCTGTGTCA CAGACACATG GCTGGTAATA GCATGCGCAG ATCGTGGCAG AAGTATCGTT CCGTTCCCAT CATCTATCGG ATCGGCGTAG CGTTCATTCT CGGATCGGTT TTAGGACTAA TAGTCGGCGA ACCGGCGACC CGTCTTGAGC CACTCGGCAC TCTTTTTGTC CGACTTCTCA CGATGATTAT CATACCCATT GTGTTCTTCA CTCTGCTGAT GGGGGCACGG CGGCTCTCTC CTTCGAGTCT CGGGAAAATC GGTGCTCAAA CTGTATTTCT ATATATTATT ACGACCGGTG TAGCAATCGG ATTTGGACTA TTAGTTGGGA ATTTGATAAA TCCTGGAACC GGCCTCGAAC TGGCCGATAC AAATGTTGAG CCTGAAGAAG CACCCAGTAT GCTTGAAGTC TTCCTGAACA TCGTCCCGGA GAATCCGGTG GGTTCGATGG CCGAGGGGAG TGTCCTCCCA ACGATCTTTT TCACTATCGT GTTCGGTCTG GCGCTGACAT ACCTTCTAGA CGAATACGAT GCCGGGACAA CCGTTCACGA GGGCGCTCAG ACGGTGTTCA ACATCGCCGA GACCGGTGCG GAGGCGATGT TCAAAATTGT CTGGGGCGTC ATGGAGTATG GCGTTATCGG GGTATTCGCA TTGATGGCAG CGACATTTGG GCAAGCAGGC GTCAGTGCCA TCGTGCCGTA CGCAAAACTG ATTGGAGCGG TTGCCCTCGC AGTTGGACTC CACATCGGAG TCACGTACCT CTTAATTATA CAGGTTGGGC TACTCCGGAG ATCGCCGATA GACTTCCTGC GAGGAGCTAA GGATGCGATG GTGACTGCCT TGAGTATCCG TTCGAGTAGC GGAACGCTTC CAGTTACCAT GGAAGACGCT GACAAGAACT TCGGCGTCAA CGAGGAAGTT TACAGCTTCT CACTGCCACT CGGGGCGACA ATTAACATGG ACGGGACAGC GATGTACCAG GGCATTGCAG CTATTTTTGC TGCCAACATG GTGGGACAGA CGCTTACCCT CGGAGAGCAA CTGACTGTCC TTGTGACAGC CCTCCTCGCA AGCGTTGGAA CCGCTGGCGT TCCTGGGAGT GGGTTAATCA TGCTAACCTT AGTTCTCACC CAACTCGGAC TCCCGCTTGA GGTCGTTGGT ATGGTGGCAG GTGTCGATCC GATACTAGAT CGGATGCGGA CGATGAACAA CGTGACTGGT GACCTCGCGG TGACGACCCT CATTGCCGAC TGGAACGGCA AGATAGACCT CACGGGAACC GTCTGGGAGG TGACAGATAA GGTGAGTTCG GTTACGAGCA CCGACTGA
|
Protein sequence | MALFLCHRHM AGNSMRRSWQ KYRSVPIIYR IGVAFILGSV LGLIVGEPAT RLEPLGTLFV RLLTMIIIPI VFFTLLMGAR RLSPSSLGKI GAQTVFLYII TTGVAIGFGL LVGNLINPGT GLELADTNVE PEEAPSMLEV FLNIVPENPV GSMAEGSVLP TIFFTIVFGL ALTYLLDEYD AGTTVHEGAQ TVFNIAETGA EAMFKIVWGV MEYGVIGVFA LMAATFGQAG VSAIVPYAKL IGAVALAVGL HIGVTYLLII QVGLLRRSPI DFLRGAKDAM VTALSIRSSS GTLPVTMEDA DKNFGVNEEV YSFSLPLGAT INMDGTAMYQ GIAAIFAANM VGQTLTLGEQ LTVLVTALLA SVGTAGVPGS GLIMLTLVLT QLGLPLEVVG MVAGVDPILD RMRTMNNVTG DLAVTTLIAD WNGKIDLTGT VWEVTDKVSS VTSTD
|
| |