Gene Htur_4667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4667 
Symbol 
ID8745268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp249708 
End bp251471 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content59% 
IMG OID646515176 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003406123 
Protein GI284172741 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACA ATAACCGTGA TTACGTTAGT CATATCGATC GCCGTGCGTT CCTGCAGGGC 
GCAGGCGCCT CAGGCGCCGC TGCACTGGCG GGCTGTACGG GCGGAACCGG ATCCGGGGAC
GGCACCCATG TCAGCTATAC GAATCAGGTT CCGACGAAGA TTCAGTATAA CCCGCTCAAT
CCAACGAGTT ACTCTCAGTA CTCGCATTAC CTGCTTTTCG ACCGATTTGC GAATTTCAAC
TTCGCAAAAG GCGAGTTCAC TCCCTACCTC ATCCAGGATT GGGAGTTCGG CGACAAGACG
TTCGAAATGA CCGTTCGCGA CGGCGTCACC TGGGAAGACG GCGACGACGT CACGGCCGGT
GATGTCGCGA CGCAGCTGCG CCTCGCGAGG TTGACCGGCG GAACGATCGA TCAGTTCACC
GAAGATATCG AAGTAGCGGA CGACCAGACC GTCGTTCTCA ACCTCGCTGG AGACGTTAAC
CCCCGAATTG TCGAATTCAA TTCGTTGGGA CAGCGGTTCA TGACCCTGAA AGAGAGCGAG
TTCGGGGAGT ACGTCGACAT GTTCGAGGAT GATGCTGAGC AGGCCCAAAG CGAAATCCAG
AGCCACGCCT ACAAGGATGT CATCGCCAAC GGCCCGTTCA CTGTTTCAGA GCGCGGGAAT
CAGCAGATTC TTCTCGAGAA GCGGGACAAC CACCCCGACT CAGGCAACAT CAACTTTGAT
CGGGTCGCGT TCCGCTACCT CGACGGGAAC ACAGCCGTTC ATCAGGCGAT CGGCGCGAAC
GAACTTGACT CCGTGATGGT GTTCGCCCCG CCGAACGTCG TGAATAACTT CCCCGATCAC
ATCCAGATGG AGACCATTCC GGGGAAGGTC GGATACGGAT TGATTCCGCA GCATGACCAC
GAGCACACGG GCGACAGAGC CGTTCGTCAG GCGATCGCGC ACGTGCTAGA CCGAGAGGCC
ATCGTCAAGA ACGTTGGCGA GACGCTGAAG CAGGCGCCAC CGCTGCTCAC TGGAATTCCC
TCGGACGACC AGGAACGGTG GCTCGACGAC GCCTACGACT CGTTCGAGGA CTATGGCGTG
AACGAACGAC AGACCGACGA TGCCAACCGG ATCCTCCGTG ACGCCGGCTA CTCGAAGGAC
GGCGATACGT GGGTCGATAA TAGCGGAGAC GTAGTCGAAC TTCCCATCAC CGTCCCGTCT
GGGTGGACCG ACTGGGTCAC CGCGACCCAG ACGATCGTCG ATCAACTGAA CGACTTCGGA
TTCGAGTCTC AGGTCGATTC TCGGAATTTC AGCGCGCTGA ACGGAACGGT CTGGCCGAAT
GGCGACTTCG TCCTCTCGGC CGGCGGGTGG CTTCCCGGTG GGGGTCGGGC GTCGTTCCCA
TACTTCTCGC TTCACCACCA GCTCCTCGAG CACTACCGCA ACTTCTCGTA CAACTACGAC
CCCGCCAACG AGAACCGCGG CGGGAGCAAC GGCGACGTCA CCGTCCCCTC TCGGACCGGG
TCAGAAACGA TGACGGTGAA CCCTAGCGAT CGCCTCAAGG AACTGTCCGA AACCTCCGAC
GAAGCGACAA CCCGCGAGAT CTCGATCGAA CAGGCGTGGG TGACCAATGT GGACCTTCCC
ATGATCCCCG TCATGGAAAA GCAAGAACAG GCGTTCCTTG CCACCGACGA GTGGTCCGTT
CCCGAACAGG GTGCCGAGGT TTCGCAGGTT CGGTGGCCGA ACCTCTGGCT CATTCGTCAG
GGAGAACTGC AATACGACGG ATAG
 
Protein sequence
MTDNNRDYVS HIDRRAFLQG AGASGAAALA GCTGGTGSGD GTHVSYTNQV PTKIQYNPLN 
PTSYSQYSHY LLFDRFANFN FAKGEFTPYL IQDWEFGDKT FEMTVRDGVT WEDGDDVTAG
DVATQLRLAR LTGGTIDQFT EDIEVADDQT VVLNLAGDVN PRIVEFNSLG QRFMTLKESE
FGEYVDMFED DAEQAQSEIQ SHAYKDVIAN GPFTVSERGN QQILLEKRDN HPDSGNINFD
RVAFRYLDGN TAVHQAIGAN ELDSVMVFAP PNVVNNFPDH IQMETIPGKV GYGLIPQHDH
EHTGDRAVRQ AIAHVLDREA IVKNVGETLK QAPPLLTGIP SDDQERWLDD AYDSFEDYGV
NERQTDDANR ILRDAGYSKD GDTWVDNSGD VVELPITVPS GWTDWVTATQ TIVDQLNDFG
FESQVDSRNF SALNGTVWPN GDFVLSAGGW LPGGGRASFP YFSLHHQLLE HYRNFSYNYD
PANENRGGSN GDVTVPSRTG SETMTVNPSD RLKELSETSD EATTREISIE QAWVTNVDLP
MIPVMEKQEQ AFLATDEWSV PEQGAEVSQV RWPNLWLIRQ GELQYDG