Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5035 |
Symbol | |
ID | 8745841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013748 |
Strand | + |
Start bp | 21550 |
End bp | 22806 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646515649 |
Product | hypothetical protein |
Protein accession | YP_003406596 |
Protein GI | 284176320 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCGAG CGTTCGCGTT GCTTGCGGTC TGTCTCGTCC TCCTGAGTGC CGCCGTGCCG GCCACAGCGG CCGCGGCGAC GACGGCCGAC GCTGGGGCCG CCGACGCATC GATGCCGGCC TACGCACTCC AAGAGGACAA CGAGAGCGAC GCCAACAGCA CGCCCGATAC CAGCGCACCG ACCAGCGCTG AACAGGTGCG GATCAATCCT GCCGGCCCGG ACGTCGAGTA CCAGTCGACC GAGGTCACCG AAGAGGATTC GACGTTCAAC ACGACCGGCG AGTTCGCCTA TTTCAGTCTG ACCGAACCCG TCGACGCCGT TCGGATCTCC CAATCGAAGG CGGAAGCGCG CGTACTCGAG GGCGGCCAGA CGGTCCAGGT CTCCTACGAG CCCGACGCGG CGCCGCCGGA CCAGACCTCG CTGTACACGC TCGAGGTGTT CTTCGAGGAC GGCTCCGAGA AAGACGTCGA GTTGTACGCC AGCGAGACCG ACCAGAGCGT CGAAGCCGCG GAACTGAAAG ACTGGGAGCC GACCATTGAG ACGCTGAAAG ACAAAGCCGA GGAAAACGGC TACGAGAAAA CACCGGAGGG CGCCGAGTCC TATGTGACGT GGGTCGACGA TCGTGCCCAG CTCGTCGATG GCTTCCTTAC GGAACTCGCC GCCCAGACGA TTGCCTGGGT CATCGCCGGC CTGATGAACC CGCTGAATAT CATCATCGGC CTCTCGCTGT TTGCGCTGGC AATGTGGCGG CGTCGTTCGA AACACGGCGA TATCGTCGAC GCCCTCTCGA GTATGACCGG GCGGTACGAA CAGGAACTCA CGAAACTGCG TAACGGCCGG CAGACGGCCA AACGCACCGC CGACGACGAC AAACTCTCGG AGGTGCCAGC GATCGGCTCG GAAGCGGACT ACTACGAGGA CGCGTTCGGG ACGAAAAGCC CGGCCCAACT CGCTCACCTT ACCGCGACCG GCGAAGCGCG AGCGACCAAC GACGGGCTCG AGATGGTCCA TCACGGTGTG GACGACCTCA ACACCGACGA CCTCCATGGG ACGTGGCTCG AGCCCGTCCT CCGGCATATC CCGAACGAAC GGCGAGTCCT GAACCACTTA CTTCAGAACA TCAAGTACAT GGAAACAGAA CATAATCTCG GATCGAACTA CCGAGAGACG CGCAACGAAC TCGAGGTTAT GCTCGACGAC CTCGAGCGCA AGCAAACGCA ACTGACCGGC ACGCCGGCCG CCGCGGGGGA TGATTGA
|
Protein sequence | MRRAFALLAV CLVLLSAAVP ATAAAATTAD AGAADASMPA YALQEDNESD ANSTPDTSAP TSAEQVRINP AGPDVEYQST EVTEEDSTFN TTGEFAYFSL TEPVDAVRIS QSKAEARVLE GGQTVQVSYE PDAAPPDQTS LYTLEVFFED GSEKDVELYA SETDQSVEAA ELKDWEPTIE TLKDKAEENG YEKTPEGAES YVTWVDDRAQ LVDGFLTELA AQTIAWVIAG LMNPLNIIIG LSLFALAMWR RRSKHGDIVD ALSSMTGRYE QELTKLRNGR QTAKRTADDD KLSEVPAIGS EADYYEDAFG TKSPAQLAHL TATGEARATN DGLEMVHHGV DDLNTDDLHG TWLEPVLRHI PNERRVLNHL LQNIKYMETE HNLGSNYRET RNELEVMLDD LERKQTQLTG TPAAAGDD
|
| |