Gene Htur_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1456 
Symbol 
ID8742047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1513870 
End bp1515468 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content66% 
IMG OID646512032 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003403015 
Protein GI284164736 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCTG ATCAGAGGGA CAAGCGAACG CACAGAACGA CGCGACGCGA CCTGCTCGTC 
GCTCTCGGCG GTGCGAGCAC GGCAGCGCTG GCGGGCTGTT CGACGACGCT GGGAACGGAC
TCGGGCTCGA CGCTCCGCGT CGGAACGCTC CGTCCGCCGC TCTCGCTCGA TCCGATCACC
GCGCGGGCGA TCGGCTCGGA GCAGGCGATC GATCGGATTT TCGAGGGGCT CTACGGCTAC
GGCGAGGGAA CCGATATCGT TCCCGCGATC GCGGCCGGCG AGCCGGAAAT CGCCGACAAC
GATCGAGAAG TCGTCGTCGA ACTCGACGAC GGCGCGCGAT TTCAGAACGA CCGAGCGGTG
ACCGCCGAGG ACGTGGTCTA CTCCTACACC GCGCCGCTCG AGGAGGACGC GCCGACGGAA
TGGCTGGCGA GCCCGTTCGA CTCGGTCGAG TCCGACGGCG AGCACACCGT TCGGTTCACG
CTGGCGGAGC CGTACCCGGC GCTCGAGCAC GCGCTGACAC ATCCGATCGT GCCCCGACAG
GAGCGCGAGG ACGACAGGGA GGCGTTCGCC ACGAACCCGA TCGGCGCCGG CCCGTTCGAG
GTCGCGTCGT TCAGCGCGGA GAAGAAGACC ACGCTCCGTC GCTGGGACGA CTACTGGGGC
GAGACTCCAC CCGCGATCGA TCGGTTCACG ATGGTCTACG TCGAGTTCCC GGTGACTCAG
CTGACCAGCC TCCGGACGAA TCGCAACGAT CTGATCGAGC CGGTCTCACC GCTGATCGTC
GATCACGTCA GCGACGTCGC GAACGCGTCG GTGAAGCGCC AGCAGGGATA CACGTCGTTT
TACTTCGGCT TCAACTGCAA CGAGGGGCCG ACGACCGACC CCCGAGTGCG AGAGGCGATC
AGCTACTGCA TCGACCTCGA GAAGGCGGTC TCCGAGTTCG TCGAGCCGAT GGGCCAGCGT
CAGTACAGCC CGCTGCCGCC GCAGGTCGCC GAGGAGTGGA ACATGCCGAC CGACGAGTGG
GCCGAACTCG CGAACGAGCA GAACCCCGAA CGCGCCCGTG ACCTCTTTCG CGAGGCCGAC
GCGGCCAGCG GTCAGCTTCG CATCCTGACC TCGACGGATC CGAAACACAA AGAGTTCGGC
GAGGCGCTCG CCGGCGGCCT CCGGGATGCC AGCCACGGCG CGCTCACCAT CTCGACGTCC
GAAACGAAAT TCCTCGAGCG GCACGTCACC GGCTCCGAGC GCGACTACTC GGTGTTCGTC
GGGGAGATCA CGGGGACGCC CGATCCGGAC ACCCATCTCT ACCCGACGTT CCACGAGAAC
ATGACCGGCG TGACGAACGG GACCTTCTAC CGCGAGGACG CGGTCATGGA ACGGCTCGCG
TCGGCGCGAA CGACGACGGA TCGCGAGCAG CGGCGCGATC TCTACGAGAC GGCGATCACC
CGATTGCTCG AGGATCGCGT CTGCCTGCCG ATCTGCTCGT TCGAGAACAG CTTCGCCGTG
GATGCGGGCG TCGAGAACTT TCGCGTCCAC CCGATCGCGC GGGTCAATCC CCGGCTCGTG
TGGGAGGACG GCGTCGTGAC AGTGGGGTCG GAATCATGA
 
Protein sequence
MMADQRDKRT HRTTRRDLLV ALGGASTAAL AGCSTTLGTD SGSTLRVGTL RPPLSLDPIT 
ARAIGSEQAI DRIFEGLYGY GEGTDIVPAI AAGEPEIADN DREVVVELDD GARFQNDRAV
TAEDVVYSYT APLEEDAPTE WLASPFDSVE SDGEHTVRFT LAEPYPALEH ALTHPIVPRQ
EREDDREAFA TNPIGAGPFE VASFSAEKKT TLRRWDDYWG ETPPAIDRFT MVYVEFPVTQ
LTSLRTNRND LIEPVSPLIV DHVSDVANAS VKRQQGYTSF YFGFNCNEGP TTDPRVREAI
SYCIDLEKAV SEFVEPMGQR QYSPLPPQVA EEWNMPTDEW AELANEQNPE RARDLFREAD
AASGQLRILT STDPKHKEFG EALAGGLRDA SHGALTISTS ETKFLERHVT GSERDYSVFV
GEITGTPDPD THLYPTFHEN MTGVTNGTFY REDAVMERLA SARTTTDREQ RRDLYETAIT
RLLEDRVCLP ICSFENSFAV DAGVENFRVH PIARVNPRLV WEDGVVTVGS ES