Gene Htur_4466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4466 
Symbol 
ID8745095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp54153 
End bp55847 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content61% 
IMG OID646515003 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003405950 
Protein GI284172568 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0117733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGGCG CGATGGCAAT GGCCGGTTGT CTCGGGGGCA CATCGGGGGA GACAAACACG 
TTCGTCGAGC CGATAACGGT GGATCCGACG AACTATCAGT TCAACCATCT CAATCTGCAG
AACCCGTTCT CGCACGCGAT GCAGAGCGAC CAGCTCCAGC GGTACTTCAT CAACGACAGC
TCGTTCGAGT CGTACGCGGT CTCCATCGAG GAGTTCGAGG GCGAGACTGC CGTTCTGAAG
GTTCGGGACG GTCTCACGTG GCACAACGGT GACCCGGGAG ATCCGGTCGA CGCCGACGAC
CTCTACACGA AACTCGTCAC CGACGCGATC ATCGGTGACA CGATCTCGAC TCTCTGGACG
GACATCGAGC GGGTCGGTGA CAAGTCCGTC GAACTCACCC TCGACGGAAC GATCAACGAG
CAACTGTTCC GGGACGGACT GAACTACTAC TGGCTCGAGA CGCCGTTCCG ACTGTACAAG
GACTACGTCG AGCGGTGGGA GGACGCGACG AGCGACGACG AGATCAGCGA CATCCGGAGT
GACCTCCGTA ACGACTCGTT CGACGAGTCG AAGGTGAAGG GGAACGGTCC CTTCCAGTTC
GAGGAGCGGG ACAACCGCCA CCTCAAGATG ACCCTGTACG AACACCACCC CGACGCGGAC
AACATCAACT GGGAGAACTG GGAGCTCCAC AAGGTTTCGA CCGACACCGC GAGCGTGCTA
CTCGGCGGCG AAGTCGACGG AATTCGGAAT TTCTCCGCAC CCGAAACGGT CTTCGAGCAG
GCACCCGACG ACCTCCTGAA CGCGCAACTC CCCGCGCTGT GGGGGCACTC GCTTCCCTTC
AACCACGAGG ACGACGACTT CAGTAACATC CGGGTTCGAC AGGCCGTCAG CGAGTTCATC
GACAGGCAGG CCTGCGCGGA CAACTACGGC CGGCTCGGAC AGCCGGTCGA GGCGCCGAGC
GGGCTCGTCG GAAACATCGA CGGCCAGAAC GAACAGACCA ACCGATGGGA GGACAAGGTC
TCCGACGAGA TGGCCGAACA GCTCTACCGC TACGACGATC CGGAGCGCGG TCGACGACTG
CTCCGCAACG AAGGGTACCA GAAGGACGAC GGTACGTGGT ACCGTCCCAA CGGTGACCCG
TTCGAACTGA CGATCAAGGT CCCTGCGGGA TATACCGATT GGCATCCGGT CTACGAGACG
ATCGTCGATA ATCTCACGAA CGAAGGTATC GGTGCCGAAC TGCAGCGGAT CGAGGCGTCT
GCGTACTGGG CCGACCACTA CACCGCGGCG AACTTCAAGG TGGCGTCGAC GGGATGGACG
CTCCAGCGAT CCAGTCCGTA CTACGTCTTC GACATGTACT ACAACATCGA CACGGAGTTC
ATGAACCTCG ACCCCGAGAA CCTCGAGGCT CCGCCGATGG GCGAACCCGA TGGCGACCTC
CAGTCGGTCG ATCCGCGCGG TCTGATGGAC GAGTTGCTCG TCGCACAGGA CGAGGAGAGA
GCGACCGAGC TCACCGATCA GCTCGCGTGG ATCACTAACC AGACGGTCCC CATGCTCCAA
CTCAACGAGA TCAACGACCC GGTCTGGTTC TACACGGACG ACTGGGAGGT CCCGGAGCCG
GACTCCCCGA AGTACCAGTC GAAGTGGCCC CTCTGGTGGT TCCCGCGGAA CGGTGAGCTG
CAGGCGAAGG ACTGA
 
Protein sequence
MGGAMAMAGC LGGTSGETNT FVEPITVDPT NYQFNHLNLQ NPFSHAMQSD QLQRYFINDS 
SFESYAVSIE EFEGETAVLK VRDGLTWHNG DPGDPVDADD LYTKLVTDAI IGDTISTLWT
DIERVGDKSV ELTLDGTINE QLFRDGLNYY WLETPFRLYK DYVERWEDAT SDDEISDIRS
DLRNDSFDES KVKGNGPFQF EERDNRHLKM TLYEHHPDAD NINWENWELH KVSTDTASVL
LGGEVDGIRN FSAPETVFEQ APDDLLNAQL PALWGHSLPF NHEDDDFSNI RVRQAVSEFI
DRQACADNYG RLGQPVEAPS GLVGNIDGQN EQTNRWEDKV SDEMAEQLYR YDDPERGRRL
LRNEGYQKDD GTWYRPNGDP FELTIKVPAG YTDWHPVYET IVDNLTNEGI GAELQRIEAS
AYWADHYTAA NFKVASTGWT LQRSSPYYVF DMYYNIDTEF MNLDPENLEA PPMGEPDGDL
QSVDPRGLMD ELLVAQDEER ATELTDQLAW ITNQTVPMLQ LNEINDPVWF YTDDWEVPEP
DSPKYQSKWP LWWFPRNGEL QAKD