Gene Htur_4095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4095 
Symbol 
ID8744723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp356177 
End bp357880 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content61% 
IMG OID646514655 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003405602 
Protein GI284167324 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAATC AGCCGGATCA GATCAACTAC AACCCCTTCG GACAGCAGAC CCCCGGCGTC 
TTGAACAAGA TCATCTTCGA GCAAGGCGCG CGGTGGCATG CCGACGAGGA AGAATGGGTG
TCACTACTCG TCGAGAACTG GGAATACCCG TCGACGGTCC AGTCTGGGGC GACCGTTCGG
TTCGACCTGT CTGATACGTA TACGTGGTTC AACGGAGACG CGTACACGGC CGAGGACTTC
GTGGGCCAAA TGCGCTGGCG GAACGTGAAC AACGATGCCG TCTGGGACTT TCTCGACGAT
GTCGAGCAGA CGGGCGAATA CAGCGTCGAG ATGACGCTCG CAGAGAAGAT CGACACCCAA
CTGTTCGAGA ACGCGCTGTT CGGGATGGGC GACGCCCCGA CCAACTGGAC GTTCAAGTTC
GACGTCTTCA GGGACTACCT CGAGCGCATG GAAGATGCTG GGTCCGACGA AGACCGGAGT
ACGATCCTCC AAGAACTCGC CGAGTGGAAC GTTTCTCTCA ACGAGGCTCG TGAGAAAGGG
CTCGGGAACG GGCCGTTTAT GCCGTCGGAG GCGACGGCGA ACCAGCTCCT CCTAGAGAAG
TACGAGGACT ACCAGAACCC CCACATCACG GCGGACGACA TCGCGTTCGA TACGATGGAG
GCGCTGCCGC TCCAGGGGCC TCAGGAGAAG CTCCGATCGC TGCGCAATAA CGAGGTGGAT
GCGCTTCACA ACGTGGCGTT CAACTCGGCG CAGGCAGACC AGATCCCGGA CAACTACGAG
TCCGTCAGGT TCTACAGCCA TAGCGGCGAG TCGATCTCAT TCAACTGTCG CCGCGAGCCT
CTCGACAACC AGCAGGTCCG GTGGGCGCTC TCGAACGTGC TGCAAGCGAG CCACGACACG
CTGATGCAGA ACCTGCCACT CTCGGACGTG AACAAGGAGC GCGTCAATCT GTCCGCGGGG
ATGTCACAGC CGCTTATCGA TGAGTGGCTC GGAGACGTCA AAGGTCAGTT CATGCAGTTC
GACGGCGGGA CCGAACGAGC TACCGAACTC CTTCGGGACG AAGGGTTCAC CCAGGAGAAC
GGTACGTGGT ACAAGCCCGA CGGCGAGCAG TTCACGCTCA CGTTCAGAGA CGCCGGTTTC
CACAGCAACC GGACGGAAAC CGCGTCGCGG ATCCTCAGCG ACTTCGGTAT CGAGACCGAA
GCGATCATCG TTGAGGACAC CACGTACTTC GGACAGACGA TCCCCGAGCG GGACTACGAT
CTCACTAACT GGTGGGTCGG CCAGTCCGCA CCGCTTCCGT ACGAGGGGTT CCAGAATCAC
CTCGTCAACG AGGCATGGGT GACCGCGTAT CCGCTCGGCG TACCCTCCGT CTCGGAGTGG
AACGGCGAGG GTACCAGCGA GTTCATCGTC GAAGTTCCGC CGATCGGTGA GCCCGACGGA
GAGCTTCGAG AGATGGACAT CCGGGAACGC CTCCAGGCGA TCGCGCGAGG CCAGAGCAAG
GAAGAACAGC GGCCGCACAT CCAGCAGCTA GCGTGGTCCT GGAACTGGAT GGACGCTTCC
TGGGGACCAT GGACACTGTA CATCGCCTCT GAGTACTACA ACACCGAGAA CTGGAACTGG
CCCGCAAACG ACAGCGCGAT CATGAAGACA CCGAGTGTGC AAGACTGGCC CGTCCGCCAG
GGCCAGCCGA CGCCCAACGA ATAA
 
Protein sequence
MANQPDQINY NPFGQQTPGV LNKIIFEQGA RWHADEEEWV SLLVENWEYP STVQSGATVR 
FDLSDTYTWF NGDAYTAEDF VGQMRWRNVN NDAVWDFLDD VEQTGEYSVE MTLAEKIDTQ
LFENALFGMG DAPTNWTFKF DVFRDYLERM EDAGSDEDRS TILQELAEWN VSLNEAREKG
LGNGPFMPSE ATANQLLLEK YEDYQNPHIT ADDIAFDTME ALPLQGPQEK LRSLRNNEVD
ALHNVAFNSA QADQIPDNYE SVRFYSHSGE SISFNCRREP LDNQQVRWAL SNVLQASHDT
LMQNLPLSDV NKERVNLSAG MSQPLIDEWL GDVKGQFMQF DGGTERATEL LRDEGFTQEN
GTWYKPDGEQ FTLTFRDAGF HSNRTETASR ILSDFGIETE AIIVEDTTYF GQTIPERDYD
LTNWWVGQSA PLPYEGFQNH LVNEAWVTAY PLGVPSVSEW NGEGTSEFIV EVPPIGEPDG
ELREMDIRER LQAIARGQSK EEQRPHIQQL AWSWNWMDAS WGPWTLYIAS EYYNTENWNW
PANDSAIMKT PSVQDWPVRQ GQPTPNE