Gene Htur_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0114 
Symbol 
ID8740677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp123865 
End bp124863 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content62% 
IMG OID646510677 
ProductTRAP transporter solute receptor, TAXI family 
Protein accessionYP_003401688 
Protein GI284163409 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGACA ATCAAAAATG TATAAATAGC TTCGACGGAA TCGCTAACAG TATGAACCAG 
AATATTAATA GACGGACATT CATTGCAGGC ATCGGTGGGA CCGGTATCGC CGCTCTCGCG
GGCTGTTCCG GCGAGGGGAA TGAGAACCAG CTCGCGTGGC ACTCGGGCGG CACCGACGGA
ACGTACTATC CGCTTTCGGG CGAGTTCAAG ACGATCGTCG AAGACCAGAC GGACTACTCT
CTGCAGGTTC AGTCGACCGG TGCGAGCGTC GAGAACGTCA GCAGCCTCAA CAGCGAAGAC
GCCGAGTTCG CGCTGATTCA GAACGACATC GCCTACTTCG CGGTCAACGG CACCGGGATC
GAGGAACTCG AGGGCAACGC GATGGAGAAC ATCCGTGGCG TCGCGACGCT GTACCCCGAG
ACGATCCACG TTATCACGCA GGCCGACTCG GGAATCGACA GTCTCGAAGA CCTCGAAGGC
GCCTCGGTCA ACACCGGTGA CACCGGGAGC GGGACGCAGG TCAACGCCCT GCAGATCCTC
GAGACAGCCG GCATCAGCGA GGACGACTTC GACGAACAGA ACGCCGACTT CGGAACGGCG
GCCGATCAGG TACAGGACGG CGACGTCGAC GCGGCGTTTA CCGTCGGTGG CTGGCCGGTC
GGCTCCGTCG AGAACCTCGC GACCAACCAA GACATCGAAC TGGTCGAGAT CTCGGGCGAC
CTGCGCGAAG ACATCATGGC CGACGCCGAG TGGTTCGCCG AGGACACCAT CCCCGGCGGA
ACCTACGACG GCGTCGACGA CGACGTCGAT ACCGTCTCCG TCCAGGCGAT GATCGCCACC
CACGAGGGCG TCGACGAGGA GACCGTCGAA GGGGTGACGA CGGCGATCTT CGACAACACC
GACGAGATCG GCACGAAGTC GGACTTCATC GACGCCGACT CGGCCCAGGA CGGGATGCCG
ATCGACTTGC ACGCCGGCGC CGAGGCGTAC TTCAACTGA
 
Protein sequence
MIDNQKCINS FDGIANSMNQ NINRRTFIAG IGGTGIAALA GCSGEGNENQ LAWHSGGTDG 
TYYPLSGEFK TIVEDQTDYS LQVQSTGASV ENVSSLNSED AEFALIQNDI AYFAVNGTGI
EELEGNAMEN IRGVATLYPE TIHVITQADS GIDSLEDLEG ASVNTGDTGS GTQVNALQIL
ETAGISEDDF DEQNADFGTA ADQVQDGDVD AAFTVGGWPV GSVENLATNQ DIELVEISGD
LREDIMADAE WFAEDTIPGG TYDGVDDDVD TVSVQAMIAT HEGVDEETVE GVTTAIFDNT
DEIGTKSDFI DADSAQDGMP IDLHAGAEAY FN