Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3887 |
Symbol | |
ID | 8744515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 119260 |
End bp | 121065 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646514471 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003405418 |
Protein GI | 284167140 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.544175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGTA ACGGATACCC CGTGGGAAAG ATGATCAAGG ACCACAGTCA TTTGAGTAGA CGTAAATTTG TCGGGGCCAG CGCCGGAACG CTCGCTGCGA CGCTCGCTGG GTGTGTCGGT GGCGGCGACA ACAGTACGGA GTTCGTCACG GCGTTCGAAG GGGGTCGTCC GCCGACGGAG GTCCACTTCA ATCCGTGGAA CGCCTCGGAC CACGCACAGA CGTACAGTAT CTACTGGACA CAGGAAACGC TCGCGACGCA TTCTGACGGG ACCGTCTCGA CCGATTTCTT CGAGGACATC AGTGTCGACG GCCGCGAGGT CACGATCAAG TTCTCAGACA AATGGAACTT CTGGAACGGC AACGACATCA CCGCCGAAGA CTACTTCATC GAGGCGGAGC TCTGGCGCTA CCAGGACCCG GAGGCTTCCC CCCTCGAAGG CCACGAACTG GTCGACGACT ACACCGTCAA ACGAATCTAC AAGAACGAGG TCTCGCCGGT TATCGCGAAA TCGAACGCGG GTCTCGGGAC GAGCGCCCCG AAATCGGTCT TCCGAGAGTA CTACGAGCGC TACGAGGACG CGGGCGGAGA AAGCGGCCGC CAGGCGGTTA CCGAGGATCT CCTTCAGATG ACGATCGATA CCGAGGAGTT CGTCGAGGAG GGATACGGAA GCTCGCTGTT CAAGATCGAG GACTTCAACT CCTCCGAGAC GCTGGCGACC AAGTGGGAGG ACCATCCGTG GGCCGACGAA ACGGATATCG AGCAAATTCG GGTCCTTCCG AACGTCGAAT CGGGGACGCA GGTCGAGCAG CTCGAGAAGA GTGACAAGCT CGACATGACT CAGTACATCA CCGAGAGCCA GCGCCCGGAC TACCCCGACA ACATCGAGAA TATCTACGAG TTGAGCCACT ACAACTGCCA GAAGTTCATG CTGAACTGGA ACAACGAGCA CCTCGCGCGG CGGCCGGTTC GCCGCGCGAT CATCTCCGCG ATCGACATTC CCGCGATCAT CGACGCCGCG ACGCAGACGG GAATGCTCGC GAGCCCGACG CAGGTCCAGA CGGGAATCCG AGAGACCATC GAAGAGGAAT ACCTCGGTGA GGACTTCGTC GACCAGCTCA TCGACTACCC CGTCGAGGCC GACGAGGAGA CGGCGATCGC CTACATGGAG GAGGCCGGCT ACTCGCGGGA GGGCGACGAG TGGATCAGTC CCGACGGCAA CGCGACTGAC TTCACCATCA TCACGCAGTC CGCCGTTTCG CAGTCCCAGC CGACGAAAGT CTTCACCGAC CACCTCAACG AGTTCGGGCT GAACGCGGAG ATGGAGGCCA TCGGTCAGGA CTACTACTCG CGGGTCCAGG AGTGGGAGTT CGACATCGCC TGGATGTGGC ACGTCGCACT GCCGTACTGG CATCCCATGG CGTACTTCTC GAACAACTTC TACGGCCTCC TCGCCGGCGA CGTCAACAGT GATAGCGACA CGGGACCGAC CGGCGTGCCG TTCTCGCTCG AGATCCCCGA GGAGGTCGGC GCGACGGAAG TCGAGGGTAA CGGCGTCGAG ATCAACCCGG CCCAGCTCAT GGTCGACCTC GAGGGCGCAT CGTCCGAGGA GGAGACGAAG GAACTGACCC GAACGCTCGT CCAGTGGGTC AACTTCGACC TACCCGCGAT CATCCACTTA CAGGAGAGCC GCGGCTTCGC CGGCGACGTC GAGAATTTCG ACTTCCCGAG CGAGGACGAG TTCCGAATGG ACCGTCCGAA CCCGGGACCG TTCGCGCTGC TGCGAGGACG TATTTCGACG AATTAG
|
Protein sequence | MGRNGYPVGK MIKDHSHLSR RKFVGASAGT LAATLAGCVG GGDNSTEFVT AFEGGRPPTE VHFNPWNASD HAQTYSIYWT QETLATHSDG TVSTDFFEDI SVDGREVTIK FSDKWNFWNG NDITAEDYFI EAELWRYQDP EASPLEGHEL VDDYTVKRIY KNEVSPVIAK SNAGLGTSAP KSVFREYYER YEDAGGESGR QAVTEDLLQM TIDTEEFVEE GYGSSLFKIE DFNSSETLAT KWEDHPWADE TDIEQIRVLP NVESGTQVEQ LEKSDKLDMT QYITESQRPD YPDNIENIYE LSHYNCQKFM LNWNNEHLAR RPVRRAIISA IDIPAIIDAA TQTGMLASPT QVQTGIRETI EEEYLGEDFV DQLIDYPVEA DEETAIAYME EAGYSREGDE WISPDGNATD FTIITQSAVS QSQPTKVFTD HLNEFGLNAE MEAIGQDYYS RVQEWEFDIA WMWHVALPYW HPMAYFSNNF YGLLAGDVNS DSDTGPTGVP FSLEIPEEVG ATEVEGNGVE INPAQLMVDL EGASSEEETK ELTRTLVQWV NFDLPAIIHL QESRGFAGDV ENFDFPSEDE FRMDRPNPGP FALLRGRIST N
|
| |