Gene Htur_4449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4449 
Symbol 
ID8745078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp32951 
End bp34765 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content63% 
IMG OID646514986 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003405933 
Protein GI284172551 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAGG GTAACAACTG GCGGAACAGC GGTCCTAGCC GACGAGATGT CCTGAAATAC 
AGCGCTGTGG GTGGCACAGC CCTCGTTGCC GGATGTCTCG GTGGGAGCAG TAGCACAGAT
CGATTCCGAG TGTTCGACCC GGAGACGAGC GGGACGCTGC CGTCCCAGCG TCACTGCAAC
CCCTACAACC CGACCCAGCG CGGGACGTGG CACCCAGGGG CGCTCATCTT CGACCGACCC
GTGATGTACA GTCCGGCGGA GGACGCGGTC TATCCCCTGA TCGCGACCGA CTGGGAGATG
GCCGACGACA CCACGCTGGA GATGACGTTC AGCGAGGACT GGACCTGGCA CAACGGCGAC
GACCTCACCG CCGAGGACTG GGTCATGCAG CTGCAGATGT CCCTCGCGAT CCTCGAGTAC
CAGGCGGAGG ACGGGGCGCG ACCACACCAG TTCATCGAGT CCGCCGAGGC GCCCGACGAG
TACACGGCGC AGATCAACTT CCACGACCCG CTCTCGGAGA CAGTCGCGGT CCAGAACGCC
ACCGCCGATC TCGTGGGTGA CGAGAGTCGC GGCATCTTCA CCAATTCCTC GGACGACCAG
TGGACCGACT GGCACGAGCA GTTGCAGAAC GCCGACGACT CCGGGATGGA GTCCATCCTC
GAGGAGATCA CGTCGGAGGG GTATCCGAAC CTCGAGGACG CGAACGGGAA CGGTCCGTTC
CAGGTCGCCG ATATCGGGGA CAACGTGATG GTCTTCGAGA AGTACGAGGA CCACCCGAAC
GCCGACAACA TCAACTTCAG CGAGTACTCG ATGCACCTCT ACGAGAACAA CAACCCGACC
CAGCCGTACG CAAACGGCGA GGTCGACGCC GCGCACACCC AGTTCCCCGT CGAGGACGAC
GTCAAGAGCC AGCTCCCCAG CGGGCACACG CTCATCAAGG AGAGCTTCTC GACCAACAAA
CTGTTCACGT TCAACTGCGG CCACGACGTC TCTTACGACA CGCCCTTCTC GAGCGTGAAC
GTCCGCAAGG CGGTCTGTCA CGTCTTCGAC CGCCAGCAGG TCACCGAGGT CCTCGAAGGC
GTCAACCGGA TGTTCGACTG GGCGCCGTGT CGCGTTCCCG GAAACGTGCT GGACAGCGGT
ACTCACGACG CCGCGGAGTG GGTCCAGGAC TTCCCCAAGT ACGGCCAGAA CGACACCGAG
CGCGCTACCG AACTCCTCGA GGAGGAGGGC TACACGCTCG AGGACGGGCA GTGGTACACG
CCGGACGGAG AGGAGTTCGA GATCAACATC ATGAACGGCT CCGAGCGGAA GGACTTCGGC
GTTCTCAAAC AGAACCTGAA CGACTTCGGC ATCAAGACGA ACCAGGAGCA GGTCGACGAC
GCGACGTTCG ACGAGCGCCG ACAGAACGGC GAGTACGACA TGATGCCCGA CGGTTCCTCG
GCCAACGGCG TCCGCGCGAT GTGGGCGCTC GATCTCGTGC CGAACTGGAT CCAGTCGATC
TCGCACTTCG ACCCCAACGC GGAGATCCCG ATGCCCGTCG GCGACCCGGA GGGCTCCGAG
GGGACGAAGG AGATCAACGT CGAGGAGCAC ATCCGTCAGT GGCAGGTCAC CGACGATGAT
CAGTACCACA AGGAACTGAT GTGGTGGTGG AACCAGACCC TCCCGGAGAT GGAAGTGATG
TTCCAGCCCG ACGCCGGCGC CTACAACGCC GACAACTGGG AACTCGACGC GCCGGACGGC
ATCATCGACG GCACCGAGGA CGCGCTCTAC CTCATCCCAA AGACGGACGA GGCGAGCATG
GAGTACATCG GGTAA
 
Protein sequence
MPEGNNWRNS GPSRRDVLKY SAVGGTALVA GCLGGSSSTD RFRVFDPETS GTLPSQRHCN 
PYNPTQRGTW HPGALIFDRP VMYSPAEDAV YPLIATDWEM ADDTTLEMTF SEDWTWHNGD
DLTAEDWVMQ LQMSLAILEY QAEDGARPHQ FIESAEAPDE YTAQINFHDP LSETVAVQNA
TADLVGDESR GIFTNSSDDQ WTDWHEQLQN ADDSGMESIL EEITSEGYPN LEDANGNGPF
QVADIGDNVM VFEKYEDHPN ADNINFSEYS MHLYENNNPT QPYANGEVDA AHTQFPVEDD
VKSQLPSGHT LIKESFSTNK LFTFNCGHDV SYDTPFSSVN VRKAVCHVFD RQQVTEVLEG
VNRMFDWAPC RVPGNVLDSG THDAAEWVQD FPKYGQNDTE RATELLEEEG YTLEDGQWYT
PDGEEFEINI MNGSERKDFG VLKQNLNDFG IKTNQEQVDD ATFDERRQNG EYDMMPDGSS
ANGVRAMWAL DLVPNWIQSI SHFDPNAEIP MPVGDPEGSE GTKEINVEEH IRQWQVTDDD
QYHKELMWWW NQTLPEMEVM FQPDAGAYNA DNWELDAPDG IIDGTEDALY LIPKTDEASM
EYIG