Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4843 |
Symbol | |
ID | 8745473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013746 |
Strand | - |
Start bp | 41435 |
End bp | 43069 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 646515329 |
Product | SSS sodium solute transporter superfamily |
Protein accession | YP_003406276 |
Protein GI | 284172895 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGAGA CCATCATCTA CATCGAGTTC ATCCTGTATC TTGTCGCACT CCTCGTCATC GGCGCGTACG GTGGACGTCT CACCGAAACG GTCCCAGACT ACCTCCTCGG CGGCCGGAAA CTGAATCTGT TCACTGGCGC GTTAAGCGAA CAGGCTAGCC TGTGGAGCGG ATGGTTGGTG GTCGGCTTCC CGGCACTTGT CTATGCGAAC GGTATCTCGT CGCTGTGGTG GATGGTCTGG CAGATTCCGC TTGGAATCGT GACCTGGGGG ATACTGGCGA AACGTATCGG CCGCTATTCA CGCGTCTTGA AGTCGCTGAC CGTCCCCGGA TTCCTGTCCG CCCGGTACGG GGATACCAGT CACCTCATCC GCATCACGAG CACGCTCATC ATCGGCGTAT TCATGGCTGG CTACATCGCA GGCCAATTAC TTGCTGCCGC CAGCGCAATC TCTGTCGGCT TCGAACTCTC CTACGAGCTT GGGTTCGTGA TTGCACTTAG CGTCGTTGTC ATCTACACCG TGATGGGCGG ATTCACCGCG TCGGCTTACA CTGATGTACT CCAAGCCCTG CTGATGACCG CGTTCGCCAT CATTGTCCCA ATCGCAGTAT TGGTGGTTAT TGGCGGTCCG AATGAGCTGA TGACCCAGTT CAACAACGCT GCCAGCGACA ACATGACCTC GTTTACCGGT GGCCGGTCGC CATATGAGTT CCTCATCTTC TCGACAATCG CTGTCATCGC CTTGGGCGGG CTCGGCCAAC CGCACGGCGT CGTCCGATAC ATGGGAATGG AACGTCCGTC CAAAGCTGGC TACGCCATGA TCGTCGCCGT CGTATTCATG TTGATTGCGC TAATCGGCAT CCCGATCATC TCACTCGGCG CGGTAGTGAT GCTCCCTGGA ATTGAGAACC AGGATCTCGT CGCGCCGATG ATGATTCTCG AAACGCTCCC GCCGTGGCTC GCCGGCTTCC TCCTTGCGGG CGGCGTAGCA GCGATTATGA GTACCGCCGA CTCGCAGCTC CTTGTCGCTG CCAGCGCGTT TGGCGAAGAC GTATATAGTG GGATTCTCAA TCAAGACGCG AGCGACCGAC AGATTCTCCT CGTAAACCGT ATGTCTGTTC TCGCCATTGG CCTCCTGGCC GCTACCTGGG CTTGGGTTAC ACCAGGATCA GTGCACACGA CAATCCTGTT TGCCTGGGCG GGCCTCGGCG CGAGCATCGG TCCCGTGCTC GCTATGTCGA TATATTGGAA GAAAACAACA GGGCCGGGTG TACTCGCGGG GATGTTCACT GGACTAGTCA CAACCATCAT CTGGAATCAA GCGTCCGGGG GGCCGGGCAT GATCTTCGAC GTCTATGAAC TCCTGCCAGC ATTCACGCTC AGCATGCTGG CGGTCATCAT CGGTAGCTAT CTCTCCGGTC CGCCAAAGCG CGGTGAAGAA GAAATCCAAC GGGAACTCCG CGAAATCTCG AAGCCGCTGC AAGACGAAAT CGACCTCGTC CGTGAACGAC AAGAAGCCGC GCAGGCAGCC TCGAACCAGC ATTCGCCGCA GCTGACGGCG GTAACAGAAG AAGAGATTGC AACAACCTAT CTTGCAGACC GGCAGCTGGA TGATCTCTCT CCGGCCGACA GCTAA
|
Protein sequence | MVETIIYIEF ILYLVALLVI GAYGGRLTET VPDYLLGGRK LNLFTGALSE QASLWSGWLV VGFPALVYAN GISSLWWMVW QIPLGIVTWG ILAKRIGRYS RVLKSLTVPG FLSARYGDTS HLIRITSTLI IGVFMAGYIA GQLLAAASAI SVGFELSYEL GFVIALSVVV IYTVMGGFTA SAYTDVLQAL LMTAFAIIVP IAVLVVIGGP NELMTQFNNA ASDNMTSFTG GRSPYEFLIF STIAVIALGG LGQPHGVVRY MGMERPSKAG YAMIVAVVFM LIALIGIPII SLGAVVMLPG IENQDLVAPM MILETLPPWL AGFLLAGGVA AIMSTADSQL LVAASAFGED VYSGILNQDA SDRQILLVNR MSVLAIGLLA ATWAWVTPGS VHTTILFAWA GLGASIGPVL AMSIYWKKTT GPGVLAGMFT GLVTTIIWNQ ASGGPGMIFD VYELLPAFTL SMLAVIIGSY LSGPPKRGEE EIQRELREIS KPLQDEIDLV RERQEAAQAA SNQHSPQLTA VTEEEIATTY LADRQLDDLS PADS
|
| |