Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4474 |
Symbol | |
ID | 8745103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 64016 |
End bp | 65926 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646515011 |
Product | protein of unknown function DUF1680 |
Protein accession | YP_003405958 |
Protein GI | 284172576 |
COG category | [S] Function unknown |
COG ID | [COG3533] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0254713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCCAC GAGATGCGGT TACGGACGGG ATTTCGCTCT CCGAAGTCGC GATCGACGAC GGATTTTGGT CGCCGCGACG GGAGCGAAAC AGGGACGTGA CGATCGAGTA CCAGTACGAA CAACTCGAGG AGTCGGGCTC CCTCGAGAAC TTCCGGCGCG TCGCCGCCGG GGAGTCCGGC GACTTTCAGG GGATGTGGTT TCAGGACACG GACGCGTACA AGTGGATCGA AGCCGCCTCG TACGTGCTCG CCCAGCGCGA CGATCCGGAG CTGGAAGCGA AGGTCGACGG GGTCATCTCG CTGATCGCCG ACGCACAGCA GCCCGACGGC TATCTGAACA CGTACTTCTC GCTGGTCGAA CCCGAGAACC GGTGGACGAA CCTCCACATG ATGCACGAAC TCTACTGCGC CGGGCATCTG ATTGAGGCCG CGGTCGCCCA CTACCGTGCA ACGGAGAAGG AGACGCTGCT CGAGGTGGCG GTCGACTTCG CAGATCTTGT CGACGACGTC TTCGGCGACG AGGTCGAGGG CGTTCCCGGC CACGAGGAGA TCGAACTGGC GCTCCTGAAG CTCTACCGGG TCACCGACGA GACGCGGTAT CTCGAGCTCG CGAAGTACTT CATCGACCTC CGCGGGAAGG ACGACCGACT GGCGTGGGAG ATCGACAATC CGGAGACGCT CGGCGGCGGC GAGTACGAGG ACGGTTCTAT CATTCCTGCC GCACGGGACG TCTTCACCCA CGAGGACGGA ACGTACGACG GGCGGTACGC CCAGGCCCAC GAGCCGCTCC GGGACCAGGA GACCGTCGAA GGCCACTCCG TCAGGGCGAT GTACCTGTTC GCGGCCGCGA CGGACCTCGC CATCGAGACG GGCGAAGACG AGTTGATCGA GTCCCTCGAG CGCCTCTGGA CGAACATGAC GACAAAGCGG ATGTACGTCA CCGGCGGGCT CGGTCCCGAG GAAGCCCACG AGGGGTTCAC GACGGACTAC GACCTCAGGA ACGACGCCTA CGCCGAGACC TGCGCCGCGA TCGGGAGCGT CTACTGGAAC CAGCGGCTGT TCGAACTCTC CGGCGAGGCG AAGTACGCCG ACCTCATCGA GCGGACGCTG TACAACGGCT TCCTCGCCGG CGTCTCGCTG GACGGGACCG AGTTCTTCTA CGAGAATCCC CTCGAGAGCG ACGGCGACCA CCACCGAAAG GGTTGGTTCA CCTGCGCGTG CTGTCCGCCG AACGCGGCCC GGCTGCTCGC GTCGCTGGGC GAATACGTCT ACAGCCAGCG GGACTCGGCG ATCTACGTCA ATCAGTACCT CGGCAGTAGC GTCACGACGG CGGTCGACGG CGCGACCGTC GAGCTATCGC AGGACAGCTC CCTCCCGTGG TCGGGCGAGG TGACCGTCGA CGTCGACGCC GACGGGGCGT CGGTGCCGCT CCGACTCCGC ATCCCGGAGT GGGCCGAGTC GTCGACGGTG ACGGTAAACG GCGAGTCGGT CGAGACGCCG TCCGAGGGCT ACCTCGAGAT CGAACGGGTG TGGGACGACG ACCGGATCGA GCTGACCTTC GAGCAGACGG TCACGCGGCT GGAAGCCCAT CCTGACGTGG CGGCCGACGC CGGTCGCGTC GCCCTCAAAC GCGGGCCGCT CGTCTACTGC CTCGAGGCGA TCGACAACGA CCGGCCGCTC CACCAGTACG AGGATCCGTC GCCCACCTCG ACGACGCACC GACCGGACCT ACTCGAGGGC GTCACGGTCA TCGAGGGCGA AGCGAGCGTG CCGGACCGGG CGGGTTGGGA CGGCCGACTG TACCGGCCGG CCGACGAAAC GGCTCGAGAA CGAACCGAGT TCACGGCGGT CCCGTACTAC GCGTGGGACA ACCGCGAGCC GGGAGCGATG AGAGTCTGGA TTCGGTCGTA G
|
Protein sequence | MGPRDAVTDG ISLSEVAIDD GFWSPRRERN RDVTIEYQYE QLEESGSLEN FRRVAAGESG DFQGMWFQDT DAYKWIEAAS YVLAQRDDPE LEAKVDGVIS LIADAQQPDG YLNTYFSLVE PENRWTNLHM MHELYCAGHL IEAAVAHYRA TEKETLLEVA VDFADLVDDV FGDEVEGVPG HEEIELALLK LYRVTDETRY LELAKYFIDL RGKDDRLAWE IDNPETLGGG EYEDGSIIPA ARDVFTHEDG TYDGRYAQAH EPLRDQETVE GHSVRAMYLF AAATDLAIET GEDELIESLE RLWTNMTTKR MYVTGGLGPE EAHEGFTTDY DLRNDAYAET CAAIGSVYWN QRLFELSGEA KYADLIERTL YNGFLAGVSL DGTEFFYENP LESDGDHHRK GWFTCACCPP NAARLLASLG EYVYSQRDSA IYVNQYLGSS VTTAVDGATV ELSQDSSLPW SGEVTVDVDA DGASVPLRLR IPEWAESSTV TVNGESVETP SEGYLEIERV WDDDRIELTF EQTVTRLEAH PDVAADAGRV ALKRGPLVYC LEAIDNDRPL HQYEDPSPTS TTHRPDLLEG VTVIEGEASV PDRAGWDGRL YRPADETARE RTEFTAVPYY AWDNREPGAM RVWIRS
|
| |