Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0494 |
Symbol | |
ID | 8741075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 519944 |
End bp | 521632 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 646511072 |
Product | SSS sodium solute transporter superfamily |
Protein accession | YP_003402065 |
Protein GI | 284163786 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAGTA ACGGGCTCGC CGGCTCGGCC GGCATCTGGG TGCTCGGAAC GTTCGCCGTG TATCTGCTCG TCCTCCTCGG CATCGGACTG TACTCCTCTC GGCTCATGGA CACCGTCGAC GACTACGTCA TCGGCGGTCG AAGCGTCGGT CCAGTCGTCA CCGGGTTCTC CGAACGTGCC TCCGAGATGA GCGGCTGGCT CACCCTCGGT GTCCCCAGCG ACGCGTTCGG CACCGGTGTG ATGGCCTTCT ACAACGGCCT CGGGATGATT CCCGCCGACC TGTTCGCCTG GGCCGGGATC GCAAAGCGGC TCCGAAAGTA CACGGAGATC GTGAAAGCGG TCACGCTACC GACCTTCTTC GAGACACGTC TACAGGACGA TACCGGCTAC GTCAAAGGCA CGTCCGCCAT CGTCCTGATG ATCTTCGAGG GCGGTTACGT GGGCGCACAG ATCGTCGCCG CCGGGACGCT CTTGGAGGTT CTCACCGGCG TCTCGTCGCT GGTCGGAATC CTCGTCGGTG GCGTCATCGT CGTCGGCTAC ACCATGCTCG GTGGCTACTT CGCCGTCGCG TGGTCCGACT ACGTGCAGGG CGCGATCATC CTGATCGCGT TCATCATCCT GCCGATCATC GCCTTTACCA ACTACGGGCT CCCGTTCAGC GAACTCGAAT CCGTCGGTAG TTCGTACACG AGCGTCACGG CCGGCATGAC CGGTTGGGCC GCTCTCTTCG GTATCATCAG CTACGCCGCG ATCGGTCTCG GTATCCCCGG CAACCCCCAC GTGATGGTCC GGTTCATGGG GATCGACGAG GTCGAGAACA TCCGTCTGGC GGCGCTGGTC GCCCAGCTGT TCATGTTCGT CGCCTACATC GGCGCCGGCT TCGTCGGACT GTACGCGCTG GTCGTCTTCG GCCAGGGCGG CATCGAAGAC CCGAACAACG TCATGCCGCT GCTCACGCTC GAGTTCTTCC CCGGCGCGAT CGCGGGTATC ATCCTGGCGG CCGCACTCGC CGCGATGATG TCCAGTGCGG ACTCGCAGCT CCTCGTCGCG ACGAGCGCGA TCGTCGAAGA CGTCTACCAC GGCTACATCA ACCCGGACGC GAGCCAGGAG ACGCTCGTTC GCTACTCCCA GTACGTCACG CTCGGACTCG GAGCAGCGAG CGTCGCCTTC GCCTTCCTCG CACAGAACAC GCCGATCTAC ACGCTCGTCC TCGACTACGC CTGGGGCGGC CTCGGCGCGG CCATCGGCCC GACGCTCATC GCGTCGCTCT GGTGGAAGCG CATCACCGCC AAAGGCTCGG TTGCGAGTAT GATCGTCGGG ACCCTGACGA TGATCGTCTG GATCCAGCTT TCGAGCCTCC TCGAAGCCCT CGGACTCATG GGTGTCGTCG AGGGGTCGGC GTTCCTCACG GGACTCATCG GCGTCTACGG TCTCGTCCCC GCGTTCATCC TTTCGACGCT CACGCTCATC GTCGTCTCGC TCGTCACGGA GCCCCCGGAG GGCGTCGACG ACCACTTTGA GTCCTTCAAC AAACCGCTGT CGGCCCTCTC GAGCAGCGAT GATCCGACGG GGACCCCTGA CTACGTGACC GACGGCGGTC AGGACGTCGA TCCGAAGGCC GTCACGGAAA CCGACAACAT CCGTGCACAC GTCACGGCCA GCGACTACTG GGAAACGGGT GACGAGTAA
|
Protein sequence | MASNGLAGSA GIWVLGTFAV YLLVLLGIGL YSSRLMDTVD DYVIGGRSVG PVVTGFSERA SEMSGWLTLG VPSDAFGTGV MAFYNGLGMI PADLFAWAGI AKRLRKYTEI VKAVTLPTFF ETRLQDDTGY VKGTSAIVLM IFEGGYVGAQ IVAAGTLLEV LTGVSSLVGI LVGGVIVVGY TMLGGYFAVA WSDYVQGAII LIAFIILPII AFTNYGLPFS ELESVGSSYT SVTAGMTGWA ALFGIISYAA IGLGIPGNPH VMVRFMGIDE VENIRLAALV AQLFMFVAYI GAGFVGLYAL VVFGQGGIED PNNVMPLLTL EFFPGAIAGI ILAAALAAMM SSADSQLLVA TSAIVEDVYH GYINPDASQE TLVRYSQYVT LGLGAASVAF AFLAQNTPIY TLVLDYAWGG LGAAIGPTLI ASLWWKRITA KGSVASMIVG TLTMIVWIQL SSLLEALGLM GVVEGSAFLT GLIGVYGLVP AFILSTLTLI VVSLVTEPPE GVDDHFESFN KPLSALSSSD DPTGTPDYVT DGGQDVDPKA VTETDNIRAH VTASDYWETG DE
|
| |