Gene Htur_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3887 
Symbol 
ID8744515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp119260 
End bp121065 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content61% 
IMG OID646514471 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003405418 
Protein GI284167140 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.544175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCGTA ACGGATACCC CGTGGGAAAG ATGATCAAGG ACCACAGTCA TTTGAGTAGA 
CGTAAATTTG TCGGGGCCAG CGCCGGAACG CTCGCTGCGA CGCTCGCTGG GTGTGTCGGT
GGCGGCGACA ACAGTACGGA GTTCGTCACG GCGTTCGAAG GGGGTCGTCC GCCGACGGAG
GTCCACTTCA ATCCGTGGAA CGCCTCGGAC CACGCACAGA CGTACAGTAT CTACTGGACA
CAGGAAACGC TCGCGACGCA TTCTGACGGG ACCGTCTCGA CCGATTTCTT CGAGGACATC
AGTGTCGACG GCCGCGAGGT CACGATCAAG TTCTCAGACA AATGGAACTT CTGGAACGGC
AACGACATCA CCGCCGAAGA CTACTTCATC GAGGCGGAGC TCTGGCGCTA CCAGGACCCG
GAGGCTTCCC CCCTCGAAGG CCACGAACTG GTCGACGACT ACACCGTCAA ACGAATCTAC
AAGAACGAGG TCTCGCCGGT TATCGCGAAA TCGAACGCGG GTCTCGGGAC GAGCGCCCCG
AAATCGGTCT TCCGAGAGTA CTACGAGCGC TACGAGGACG CGGGCGGAGA AAGCGGCCGC
CAGGCGGTTA CCGAGGATCT CCTTCAGATG ACGATCGATA CCGAGGAGTT CGTCGAGGAG
GGATACGGAA GCTCGCTGTT CAAGATCGAG GACTTCAACT CCTCCGAGAC GCTGGCGACC
AAGTGGGAGG ACCATCCGTG GGCCGACGAA ACGGATATCG AGCAAATTCG GGTCCTTCCG
AACGTCGAAT CGGGGACGCA GGTCGAGCAG CTCGAGAAGA GTGACAAGCT CGACATGACT
CAGTACATCA CCGAGAGCCA GCGCCCGGAC TACCCCGACA ACATCGAGAA TATCTACGAG
TTGAGCCACT ACAACTGCCA GAAGTTCATG CTGAACTGGA ACAACGAGCA CCTCGCGCGG
CGGCCGGTTC GCCGCGCGAT CATCTCCGCG ATCGACATTC CCGCGATCAT CGACGCCGCG
ACGCAGACGG GAATGCTCGC GAGCCCGACG CAGGTCCAGA CGGGAATCCG AGAGACCATC
GAAGAGGAAT ACCTCGGTGA GGACTTCGTC GACCAGCTCA TCGACTACCC CGTCGAGGCC
GACGAGGAGA CGGCGATCGC CTACATGGAG GAGGCCGGCT ACTCGCGGGA GGGCGACGAG
TGGATCAGTC CCGACGGCAA CGCGACTGAC TTCACCATCA TCACGCAGTC CGCCGTTTCG
CAGTCCCAGC CGACGAAAGT CTTCACCGAC CACCTCAACG AGTTCGGGCT GAACGCGGAG
ATGGAGGCCA TCGGTCAGGA CTACTACTCG CGGGTCCAGG AGTGGGAGTT CGACATCGCC
TGGATGTGGC ACGTCGCACT GCCGTACTGG CATCCCATGG CGTACTTCTC GAACAACTTC
TACGGCCTCC TCGCCGGCGA CGTCAACAGT GATAGCGACA CGGGACCGAC CGGCGTGCCG
TTCTCGCTCG AGATCCCCGA GGAGGTCGGC GCGACGGAAG TCGAGGGTAA CGGCGTCGAG
ATCAACCCGG CCCAGCTCAT GGTCGACCTC GAGGGCGCAT CGTCCGAGGA GGAGACGAAG
GAACTGACCC GAACGCTCGT CCAGTGGGTC AACTTCGACC TACCCGCGAT CATCCACTTA
CAGGAGAGCC GCGGCTTCGC CGGCGACGTC GAGAATTTCG ACTTCCCGAG CGAGGACGAG
TTCCGAATGG ACCGTCCGAA CCCGGGACCG TTCGCGCTGC TGCGAGGACG TATTTCGACG
AATTAG
 
Protein sequence
MGRNGYPVGK MIKDHSHLSR RKFVGASAGT LAATLAGCVG GGDNSTEFVT AFEGGRPPTE 
VHFNPWNASD HAQTYSIYWT QETLATHSDG TVSTDFFEDI SVDGREVTIK FSDKWNFWNG
NDITAEDYFI EAELWRYQDP EASPLEGHEL VDDYTVKRIY KNEVSPVIAK SNAGLGTSAP
KSVFREYYER YEDAGGESGR QAVTEDLLQM TIDTEEFVEE GYGSSLFKIE DFNSSETLAT
KWEDHPWADE TDIEQIRVLP NVESGTQVEQ LEKSDKLDMT QYITESQRPD YPDNIENIYE
LSHYNCQKFM LNWNNEHLAR RPVRRAIISA IDIPAIIDAA TQTGMLASPT QVQTGIRETI
EEEYLGEDFV DQLIDYPVEA DEETAIAYME EAGYSREGDE WISPDGNATD FTIITQSAVS
QSQPTKVFTD HLNEFGLNAE MEAIGQDYYS RVQEWEFDIA WMWHVALPYW HPMAYFSNNF
YGLLAGDVNS DSDTGPTGVP FSLEIPEEVG ATEVEGNGVE INPAQLMVDL EGASSEEETK
ELTRTLVQWV NFDLPAIIHL QESRGFAGDV ENFDFPSEDE FRMDRPNPGP FALLRGRIST
N