Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4581 |
Symbol | |
ID | 8745400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 167292 |
End bp | 168329 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 646515098 |
Product | hypothetical protein |
Protein accession | YP_003406045 |
Protein GI | 284172663 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00340082 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTTCC GGACATATCC GGTGCGCACC AATGACACGT TCCCGGTCCG ACTCGAAACC GCGTCCGATG TAGCCGAGGC AATGGTCACT GTCGCCGGTG TTGACCCGTG TGATTCCCAG TACGTGTCTC CTGATCGTGA CCAGACACTA ACGGTTCAGC CAAGGACAAC GGTCATCTTC GAGGTGGATG CGGAACTGCT GAAGCAACAG GGAACTACGC ACTGGTACGT TGACGGCGAT TACGTCACAA CCCCGGCAGG TCCGTGGCCT TCAACGTATT TCGCAGAGAT TGGCCGGGAG ATCTTCACGC ACACGTTTGA CTCGGAAGGG ACGCACCTCG TAGATACGGC CGTGGTAGCT GACGAGGGGA ACTCTGCCTC ACGGTGGGAG GTGACGGTGG CTGACAACGG CGCTGCACCG CCAACCGTCA ATGCTTCCCG GCCAGCGACC AGTGAACTTG CTGCCGACGA AACGACCACG CTCGAACTTG AGGTCTTGAG CCACAGCACC GAACTGAATC GTGTCGTGTG GTGGATGACG CAGTCGGACG TGATCCTTGA TGTCTCCGAT ATCGAAGGAA GGAGTGACAC CGCGTCGGTC ACGGTTGATG GTGGCTGCCA TACGTGTCAG ATTGAGGCGT GGGTGATCGA TGAGAACAAT ATGTTTACGG CGGTGAATCC ATGGGTGTTT GAAGGCTTTG ACGCGGCTGA TGACGGTGGT GGTGGGAACG AGGGCAATGT TGCTGTAAGT ATTCAGGGAA CGAATAGCCC GGTGACCGGC GGTGAGGTGC TGGAGGTGAC TGCTGCAATC GAGAATACGG GGTCGTCGGA GGTGACACGC ACGGTTGATC TCGTGGTCGG TGAAGATCCG GAGACCGTGG ACAGTCGGAC GGTGACGATT CCTGCCGGTG GGACAAAACG GTTCACTCTC GTGTTCGAGA CGTATCCGGT GAAGCAGGAC GATTCGTTCC CTGTCCGCGT GGAGGCTGAG GGGAGTTCCG ATGTACGTAC TGTCACTGCG TATGGAACGG AGTCATAG
|
Protein sequence | MQFRTYPVRT NDTFPVRLET ASDVAEAMVT VAGVDPCDSQ YVSPDRDQTL TVQPRTTVIF EVDAELLKQQ GTTHWYVDGD YVTTPAGPWP STYFAEIGRE IFTHTFDSEG THLVDTAVVA DEGNSASRWE VTVADNGAAP PTVNASRPAT SELAADETTT LELEVLSHST ELNRVVWWMT QSDVILDVSD IEGRSDTASV TVDGGCHTCQ IEAWVIDENN MFTAVNPWVF EGFDAADDGG GGNEGNVAVS IQGTNSPVTG GEVLEVTAAI ENTGSSEVTR TVDLVVGEDP ETVDSRTVTI PAGGTKRFTL VFETYPVKQD DSFPVRVEAE GSSDVRTVTA YGTES
|
| |