Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3995 |
Symbol | |
ID | 8744623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 248735 |
End bp | 250039 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 646514569 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_003405516 |
Protein GI | 284167238 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.845603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTCGA CAGTCGGTCC GGTATGGCAG CGGTACCGGT CGGTGCCATT GATCTATCGC ATCACACTCG CGTTCCTTCT CGGATCCGCA GCGGGGATCG CATTCGGTGA GCAGATGACG GTCGTCGAAC CACTCGGAGA CCTGTTCTTG CGGCTGCTCA ACATGCTGGT GATCCCGATC ATCGTCTTCA CGCTGCTGAC TGGAATCCGC CAGCTATCGC CAGCTCGACT CGGCAAGATC GGGGGAGCAA CTGTCGGCCT CTACGCAGTA ACGACCACCA TCGCAGGCAT CATCGGACTT GCCGTCGCGA ACGCCCTTCA ACCCGGCCGT GGCGTCGAGT TTACCGGCGG TGAAGCTGAG TCCCAGGCAC CCCCCTCACT CACCGAGGTC GTGCTTGGCA TCGTCCCGAG CAATCCTGTC ACTGCAATGG CAGAGGGGAA CCTGCTCGCG ACCGTCTTCT TCGTGATCAT TTTCGGTATC GCGCTCACCT ACGTGCGTGC CCAACAAGAT GAACTCGCGG ATCGTGTCGA CTCAGTGTTC GAGGCATTTG AGATCGGAGC CGAAGCGATG TTTGTCGTTG TTCGTGGCGT CCTCGAGTAC GGCGTGGTCG GCGTATTCGC CCTCATGGCT GCCGGGATCG GCACTGAGGG AATCGGCGTG TTCTCGTCGC TCGGTGAACT CGTGCTCGCT GTCGCAATCG CAGTTGCCAT CCACATCACG TTCACGTATC TACTGCTTCT CATGGGCGTG GTCGCTGACG TCTCGCCGCT CGCCTTCCTC ATGGGCGCGA AGGACGCAAT GGTGACCGCC TTCGCCACCC GCTCCTCGAG TGGCACACTT CCAGTGACGA TGAACAACGC CGAAGAGGAC CTCCGTATCA AGGAGCGTAT CTACTCGTTC GCGCTTCCAG TTGGTGCCAC AGCAAATATG GACGGCGCCG CCATCCGACA AGCGATTACC GTCGTCTTCG CCGCGAACGT GGTCGGACAA CCACTCGCGT TCTCCGAGCA AGTACTCGTG CTGGTCGTCG CCGTGCTTAT CAGCATTGGC ACCGCCGGCG TCCCCGGAGC AGGGATTGTC ATGCTCACCG TCGTACTCAA TCAGGTCGGT CTTCCGCTTG CGGTGGTCGG ATTCGTCGCC GGTGTCGACC CGATCCTCGG TCGTATCGCG ACGATGAACA ACGTGACCGG TGACCTCGCG GTTTCGACTG TCGTAGGCAA ATGGAACGAC GCGATCGACT TGGACAACGG CGTCTGGACG CAGAAATCAG CGGGCAGTGG AAATATCGTC TCTAGCGATG ACTAG
|
Protein sequence | MYSTVGPVWQ RYRSVPLIYR ITLAFLLGSA AGIAFGEQMT VVEPLGDLFL RLLNMLVIPI IVFTLLTGIR QLSPARLGKI GGATVGLYAV TTTIAGIIGL AVANALQPGR GVEFTGGEAE SQAPPSLTEV VLGIVPSNPV TAMAEGNLLA TVFFVIIFGI ALTYVRAQQD ELADRVDSVF EAFEIGAEAM FVVVRGVLEY GVVGVFALMA AGIGTEGIGV FSSLGELVLA VAIAVAIHIT FTYLLLLMGV VADVSPLAFL MGAKDAMVTA FATRSSSGTL PVTMNNAEED LRIKERIYSF ALPVGATANM DGAAIRQAIT VVFAANVVGQ PLAFSEQVLV LVVAVLISIG TAGVPGAGIV MLTVVLNQVG LPLAVVGFVA GVDPILGRIA TMNNVTGDLA VSTVVGKWND AIDLDNGVWT QKSAGSGNIV SSDD
|
| |