Gene Htur_0876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0876 
Symbol 
ID8741460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp892670 
End bp894349 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content63% 
IMG OID646511454 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003402444 
Protein GI284164165 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGTA TTCAGCGGCT CATCACCGGT GCGGCCAGCG ATCAACTCTC GATAACTATC 
CTCACGCTGC CCACTGATAA CGACCGACAG GCGATTCAGA TCGCCCGCCA GCTCGAGAAG
AACCTCAACG CGGTTGGTAT CAACGTCCGG ATCGAACCCC GCGCCAACGC CGAGTTGCTG
CAGACGGTCT TGCTCGAGCA CGACTTCGAT TGTTACATCA GCAGCCACCC GGGCGGCTAC
GACCCCGACT TTCTGTACGA GACGTTTCAC TCGAAGTTCG CCCCCGAATC CGGCTGGCAG
AACCCCTACG GGTTCGTAGA TACCGAGATC GACGAGGTAC TCGAGCGACA GCGCTCGACA
ACGGGGTCGG AACGAAAGGA CGCCGTCGGC GAAGCGCTCG ACCGACTGGC GCAAACGCAC
CCGATCGTGC CCATCTGCAT TCCCGACGAG CGGCGACTCG TCAGAACGGA TCGGTTCGAC
GGCTGGAACG ACCACCACCT CGGGACCCAA CTCGGCTACC TCGGTCTCGA GCCCGACGAG
GACGACTTCG AGGACGAGGT GGTCCTGAAC GCGGTGATCA CCGACTCGAG CCCGTCGAGA
AACCTCAACC CGCTTGCGGC GCCGTACCGC TACCGGGGCC CGTTCATCGA TCTCCTCTAT
GATTCGATTG CGACCGAAGA CAACGGGGAA CTCCGGCCGT GGCTCGCCGA GTCCTTGGAG
TGGGACGGCT CGACCGCGAC GGTCACGCTC CGATCGAACT GTCGGTTCCA CGACGGCGAG
CCGGTGTCCG CCGACGACGT CAAATTCACG TACGAATTTC TGGACGACAC GTCACTCGGC
GTCCGCGACG GACAGTCTCC GGCGCCGAGA TACCGCGGAC TGGGCGACGC CGTTGAATCG
GTGACGGCCG TCGACGATCT GACCCTCCGG ATGCGATTCG GGACGAGCGA CGAAGTCGGC
AAACTGGCCT TTACGGTCCC GATTCTCCCG AAACACATCT GGAAGACGGC GGTCGAAGAC
CGCCTCGAAA ACGGCGCCGA GCCCATCCAG GGGACGTGGG ATATCGTGAC CACCGACGAG
ATCTCGCCGA TCGGCAGCGG TCCGTACGCA CTCGCCGAGC GGGAGCCACG GTCGCACATT
CGATTCGAGC GCTTCGGCGA TCATTTCACC AGTCGCGAGT ACGTCGACCT GCCGGAACCC
CGCGTCGACG AACTCGTCTT TCACGTCGAG CCGAACAGTA GCGCGGCCAT CGAACAACTC
GAGGCGGGGA ACGCCGATGT GACGGCCTCC AGTCTCGGAG CGGAAACGGT CGGGGACGCT
CCGTCCGGCC TCGAACTCGT CGAGTCCAAG TCGTGGTCGT TCTATCACAT CGGGTTCAAC
GTCCGGAACT CGCCGTTCAG CAACCTCCAC TTCCGACGAA ACGTCGCGCG ATTGATCGAC
AGGGAATCGC TCGTGGCGGA CGTCTTCAAC GGGCAGGCGA GCCCGTCCGT CACACCGGTT
ACGGAGGAGT GGGTACCGGA CGACCTCGAG TGGGACGGTG CCGCACCATA CGCGCCCTTT
TTCCGTGATG AAACGAACAG TGGGGATACG GGAGAACTGG ACGTCGAACG GGCGAAGCGA
TCCTTCGAAC GGCACGGATT TCAGTACGAC GAAGAGGGAG AATACATCGT GAGGTCCTGA
 
Protein sequence
MDRIQRLITG AASDQLSITI LTLPTDNDRQ AIQIARQLEK NLNAVGINVR IEPRANAELL 
QTVLLEHDFD CYISSHPGGY DPDFLYETFH SKFAPESGWQ NPYGFVDTEI DEVLERQRST
TGSERKDAVG EALDRLAQTH PIVPICIPDE RRLVRTDRFD GWNDHHLGTQ LGYLGLEPDE
DDFEDEVVLN AVITDSSPSR NLNPLAAPYR YRGPFIDLLY DSIATEDNGE LRPWLAESLE
WDGSTATVTL RSNCRFHDGE PVSADDVKFT YEFLDDTSLG VRDGQSPAPR YRGLGDAVES
VTAVDDLTLR MRFGTSDEVG KLAFTVPILP KHIWKTAVED RLENGAEPIQ GTWDIVTTDE
ISPIGSGPYA LAEREPRSHI RFERFGDHFT SREYVDLPEP RVDELVFHVE PNSSAAIEQL
EAGNADVTAS SLGAETVGDA PSGLELVESK SWSFYHIGFN VRNSPFSNLH FRRNVARLID
RESLVADVFN GQASPSVTPV TEEWVPDDLE WDGAAPYAPF FRDETNSGDT GELDVERAKR
SFERHGFQYD EEGEYIVRS