Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0076 |
Symbol | |
ID | 8740639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 81526 |
End bp | 83427 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646510639 |
Product | hypothetical protein |
Protein accession | YP_003401650 |
Protein GI | 284163371 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACTGA ATAGCAATGG CAACCGAACG CTCCCGCGAG GCGTGTCTCG CCGACGATTC GTGGCGACGG CGGCCGGCAG CGCGGGGCTG CTGTCGATCG CCGGGACCTC GTCGGCCCAG GACGACGACG GCGGGTTCTC CGACGCTCGA CTCGTTTCGA CTCGGGATTT CGACGCCGCG ACGATCCAGA CGCTCTACAC CTATCGGAAG CGGCGCGCAC TCGAGGTGCT CCTCTCCGAC CCCGAGGTCC ACGCGGTCGC CGAGGACATG CGCTCGAGTT ACGAGGCGTA CGATCCGTAC ACGAACTCCC TCGACGCGGT CAGCGTGCAG GGCAGTCCGG ACGTCGAAAT CGAGGGCGAT CTCGACGAGG GGGTGTTCGA CGTTACGGCT GTCGACAGAC AGATCGCGTA CGGACTCGTC GATCGCCGGA CCGACTCCCT CGCCGCGCTG ACTATCACCG ACCCGCAGGA CGTCTCGTGG CGGGCTTGGG AGGTTGGCGA CGCGGGACTC GAGGAAGCGC GACTCCGACG GGTCTTCGAA GACGCCCGCG TACAGGAGTA CGTGGACGGC AACGACTGGT TTCCGTCGCT GCCGATCGCC GAATCGATCA CGGCGACCGG GGAGATCGAA CGCGGTGGCG TGATCCCGGT CGCCCTGTTC GTCGACGAGG GCGAAGCGAT CACCGCCGTG GTCGTCGATC TCGACGTGCG GAACGACGAC GTCGGTTCGG TGAGTGACGT GACGCGGGTC GAACGGTTCG TCGAGGTTCC GCCGCACGAA CTGGCGGCGA CCATCGTGCC GGCGGACGAC ACCGTCCTCG GAACCGTCCC GGCGGTCCCG CTCGAGCGGC GACCGTGGTA CACCGCGGTC GACGGCGGCC ACCGGATCGA GGAACCGCCG GAGCCGTTCG ATCGGGCCGG GTGGCGAATC GGGTGGGACG ACTCGGGCAA CCATGGCGTC GAGATCGCCG CCGAGTTCCG GGATCGCCCG GTGTTCGCGA GCCTCGGTTC ACCCGCGACC CTCAGCGGCT ACGGCCTCCC GGAACGCGAC GGCGAGAACA CGCTGGAGTG GTTCTTTCCG GACGACGAAT TCGCCTTCAG TGGCGACTTA CTGGTCTGGG ACGTCCATAG CGCCGCGCTC GGCGGACCGG GGCTGCTCGG CGTCGTGACG TATCCCGCAG GCGCGGATCG ACCGGCCGGA TTCCGGTTCA GATCCCACTA TCAGACCGGC GCTCGCGGCG CCGAGGGTCG AGACCACCGC TCCGGCTACC GGTTCGGGCA GTCCAGTCAC GAACTCGCCA CCGAGTTCTG GAACGACGGG ACGATCGTTC CGATCTGGCG CCGCCAGGGC CCGGGGTTCG TGACCGACTA TGCGTCCACG AGAACTGACA GCGTGTTTGC AGACGCCGAA ACGGATCTCG AGACCGATCC GGACACCGGC ACCGACGGGG ACGCGAGCGA GAACGGCGTT CCCCATCACT CGATCGCGAC GGTCGCGATG GACGTCACCC CCGGAACGAT CGACGGCGTC GAGATCGCAC GCTACGACGG CGACGAGTGG ACGACGCCGG AAACGGAGTT CTATCTGGTC GGCGAGCCCG GTACGGCCGT CCGGTTTTCC AACCCCGAGG GCCCGGAAAC GATCGGCGTT CCGCTCGAAG ACGGGATGGA GGTCGTCGTC GTTCGCCGGA GCGCCGGGGA AGTCCCGGGC GCCCAGCGAC TCGCGGATCG AGGGATCGAA ACCGCGTTCG TCCATCCGGC GCAGTACATC GGCGACGAAC CGATCCAGGG CGAACGCGTC GTCGCCTGGC TCCTGTTGGA GGCCGCGACG GGGCGACTCC CCCATCCGAC CGGGTCCACG TCCTTCGAAA CCCAGGCGAC GATGCGACTG TCCGGCTACT GA
|
Protein sequence | MVLNSNGNRT LPRGVSRRRF VATAAGSAGL LSIAGTSSAQ DDDGGFSDAR LVSTRDFDAA TIQTLYTYRK RRALEVLLSD PEVHAVAEDM RSSYEAYDPY TNSLDAVSVQ GSPDVEIEGD LDEGVFDVTA VDRQIAYGLV DRRTDSLAAL TITDPQDVSW RAWEVGDAGL EEARLRRVFE DARVQEYVDG NDWFPSLPIA ESITATGEIE RGGVIPVALF VDEGEAITAV VVDLDVRNDD VGSVSDVTRV ERFVEVPPHE LAATIVPADD TVLGTVPAVP LERRPWYTAV DGGHRIEEPP EPFDRAGWRI GWDDSGNHGV EIAAEFRDRP VFASLGSPAT LSGYGLPERD GENTLEWFFP DDEFAFSGDL LVWDVHSAAL GGPGLLGVVT YPAGADRPAG FRFRSHYQTG ARGAEGRDHR SGYRFGQSSH ELATEFWNDG TIVPIWRRQG PGFVTDYAST RTDSVFADAE TDLETDPDTG TDGDASENGV PHHSIATVAM DVTPGTIDGV EIARYDGDEW TTPETEFYLV GEPGTAVRFS NPEGPETIGV PLEDGMEVVV VRRSAGEVPG AQRLADRGIE TAFVHPAQYI GDEPIQGERV VAWLLLEAAT GRLPHPTGST SFETQATMRL SGY
|
| |