Gene Htur_4843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4843 
Symbol 
ID8745473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013746 
Strand
Start bp41435 
End bp43069 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content58% 
IMG OID646515329 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003406276 
Protein GI284172895 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGAGA CCATCATCTA CATCGAGTTC ATCCTGTATC TTGTCGCACT CCTCGTCATC 
GGCGCGTACG GTGGACGTCT CACCGAAACG GTCCCAGACT ACCTCCTCGG CGGCCGGAAA
CTGAATCTGT TCACTGGCGC GTTAAGCGAA CAGGCTAGCC TGTGGAGCGG ATGGTTGGTG
GTCGGCTTCC CGGCACTTGT CTATGCGAAC GGTATCTCGT CGCTGTGGTG GATGGTCTGG
CAGATTCCGC TTGGAATCGT GACCTGGGGG ATACTGGCGA AACGTATCGG CCGCTATTCA
CGCGTCTTGA AGTCGCTGAC CGTCCCCGGA TTCCTGTCCG CCCGGTACGG GGATACCAGT
CACCTCATCC GCATCACGAG CACGCTCATC ATCGGCGTAT TCATGGCTGG CTACATCGCA
GGCCAATTAC TTGCTGCCGC CAGCGCAATC TCTGTCGGCT TCGAACTCTC CTACGAGCTT
GGGTTCGTGA TTGCACTTAG CGTCGTTGTC ATCTACACCG TGATGGGCGG ATTCACCGCG
TCGGCTTACA CTGATGTACT CCAAGCCCTG CTGATGACCG CGTTCGCCAT CATTGTCCCA
ATCGCAGTAT TGGTGGTTAT TGGCGGTCCG AATGAGCTGA TGACCCAGTT CAACAACGCT
GCCAGCGACA ACATGACCTC GTTTACCGGT GGCCGGTCGC CATATGAGTT CCTCATCTTC
TCGACAATCG CTGTCATCGC CTTGGGCGGG CTCGGCCAAC CGCACGGCGT CGTCCGATAC
ATGGGAATGG AACGTCCGTC CAAAGCTGGC TACGCCATGA TCGTCGCCGT CGTATTCATG
TTGATTGCGC TAATCGGCAT CCCGATCATC TCACTCGGCG CGGTAGTGAT GCTCCCTGGA
ATTGAGAACC AGGATCTCGT CGCGCCGATG ATGATTCTCG AAACGCTCCC GCCGTGGCTC
GCCGGCTTCC TCCTTGCGGG CGGCGTAGCA GCGATTATGA GTACCGCCGA CTCGCAGCTC
CTTGTCGCTG CCAGCGCGTT TGGCGAAGAC GTATATAGTG GGATTCTCAA TCAAGACGCG
AGCGACCGAC AGATTCTCCT CGTAAACCGT ATGTCTGTTC TCGCCATTGG CCTCCTGGCC
GCTACCTGGG CTTGGGTTAC ACCAGGATCA GTGCACACGA CAATCCTGTT TGCCTGGGCG
GGCCTCGGCG CGAGCATCGG TCCCGTGCTC GCTATGTCGA TATATTGGAA GAAAACAACA
GGGCCGGGTG TACTCGCGGG GATGTTCACT GGACTAGTCA CAACCATCAT CTGGAATCAA
GCGTCCGGGG GGCCGGGCAT GATCTTCGAC GTCTATGAAC TCCTGCCAGC ATTCACGCTC
AGCATGCTGG CGGTCATCAT CGGTAGCTAT CTCTCCGGTC CGCCAAAGCG CGGTGAAGAA
GAAATCCAAC GGGAACTCCG CGAAATCTCG AAGCCGCTGC AAGACGAAAT CGACCTCGTC
CGTGAACGAC AAGAAGCCGC GCAGGCAGCC TCGAACCAGC ATTCGCCGCA GCTGACGGCG
GTAACAGAAG AAGAGATTGC AACAACCTAT CTTGCAGACC GGCAGCTGGA TGATCTCTCT
CCGGCCGACA GCTAA
 
Protein sequence
MVETIIYIEF ILYLVALLVI GAYGGRLTET VPDYLLGGRK LNLFTGALSE QASLWSGWLV 
VGFPALVYAN GISSLWWMVW QIPLGIVTWG ILAKRIGRYS RVLKSLTVPG FLSARYGDTS
HLIRITSTLI IGVFMAGYIA GQLLAAASAI SVGFELSYEL GFVIALSVVV IYTVMGGFTA
SAYTDVLQAL LMTAFAIIVP IAVLVVIGGP NELMTQFNNA ASDNMTSFTG GRSPYEFLIF
STIAVIALGG LGQPHGVVRY MGMERPSKAG YAMIVAVVFM LIALIGIPII SLGAVVMLPG
IENQDLVAPM MILETLPPWL AGFLLAGGVA AIMSTADSQL LVAASAFGED VYSGILNQDA
SDRQILLVNR MSVLAIGLLA ATWAWVTPGS VHTTILFAWA GLGASIGPVL AMSIYWKKTT
GPGVLAGMFT GLVTTIIWNQ ASGGPGMIFD VYELLPAFTL SMLAVIIGSY LSGPPKRGEE
EIQRELREIS KPLQDEIDLV RERQEAAQAA SNQHSPQLTA VTEEEIATTY LADRQLDDLS
PADS