Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0545 |
Symbol | |
ID | 8382812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 550037 |
End bp | 551707 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644971607 |
Product | hypothetical protein |
Protein accession | YP_003129465 |
Protein GI | 257051632 |
COG category | [S] Function unknown |
COG ID | [COG3390] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0168127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCG AGAACGGAGA CGGCGATGCT GACGACGGCG GGGACGACAG TCCAGGCCGT CGCGAGGTCG CCTATCGGGT GTTCGCTGCC GAGTTCGACG ACAGCGACTT CGATTACTCC GAGAGCGACG CCGACCGTGC GCCCAACTAC GTCGTGACGC CGACCGGCGC GCGCGTCAAT CGCCTGTTCG CCGTGGGCGT CCTGACGGAG GTCAGCCCGG CCGGTGAGGA CGTCCTCCGG GCGCGGATCG CAGACCCGAC CGGAACCTTC GTCGTCTACG CCGGCCAGTA CCAGCCCGAC GCCCAGACGT TCCTCGAACG CGCCGAACCG CCGGCCTACG TCGCCGTGAC GGGCAAGGCC CGGACCTTCC AGCCCGACGA TAGCGACGTC GTCTACACTT CGATCCGTCC GGAGAGTCTC AACGAGGTCG ACGAGGTGAC TCGCGACCGC TGGACGGTCG GGGCCGCCGA GGCGACGCTC GAACGCGTCG GGACGATGGC GACGGCGATG GAACTCGACG TCCGCGGCGA CGAACTCCGG GCGGCACTCG ACGAACGAGG TGTCGAACCA GGGCTGGCGG CCGGGATCCC GCTGGCGCTG GACCACTACG GGACGACACC GGGGTACCTG GCGGCGATGC GAACGCTGGC CACGTCGGCA CTGGAAGTCG TCGCGGGCGA GCGTAGCGAG GTCGAATCCC TGTCGCTGTC TCCAGATCAG GGCGGCGACG CTGATCTGGA CGCGCTTGCT GAGACCGCGT CGATCGGCGA ACCGGCGGGG GAGACTGTCG GGACGGCCGA ACAGGGGGAG AGTCCGGCGA CGGAGAACGA GGAGACGCCA TCGACCGGGG AGACGACATC GACAGTCGAT ACATCGGCGT CGACAGCTGA CGCGTCGCCA TCGGCCGATG ACACCGCTTC GACGACCGCG GACAGTGACG CAGGGGCGGC GTCTGAAGAG AGCGCGTCTG CGTCCACGGC TGACGTGTCC GGGGAGAGCG ACGCGGCCGA AGGCGATGGA TTCGACAGCG ACGAAACCCC TGAAGCCGAA TCTGCGACCG AGACGAGTGA GGAAAGGAGC GAAACGGATA CGGACACCCA AGCGGCAGAG ACGGATACCC CCACCGACGA TCCGAGTGAG GACACGGCCG ACCTCGAAGA CTTCGATGGC GAATTCGAAC TCGACGAGGA CGAGCGTGAG CAGATCAAAG ACGAGTTCGG GACCGAGTTC ACGTCCGGCG CGGAGGTCGA CGAGCCGGGC GAAGCCGGTA TCGAGACGCC CGACGAACCG ATCGAACCGG AGACGGCTGG GGACGACGAG GGGACTCCTG CAACCGAAGA GACGGCCGAC GAAGCGGCTC CTGCAGCCGA GAAAACGGAT ACGACGACCG AAGGCGAGCC GGAACCGGCT GCCGAGGAAC CGGGATCCGA AGAGCTGGCG AGTGGAGCAG ACGAATCCGC GGCTGAGCCG GAGGACATCG ATCTCGAAGA CGCGGCGATG GAGGCGATGG CGACGCTCGA CGACGGTGAC GGGGCCGACC GTGAGGCGGT CATCGAGCGC GTCCAGGACG CCCACGGCGT CGATGCGGCC GATGTCGAGG ACGCGATTCA GGATGCGCTG ATGAACGGCC GGTGTTACGA ACCCAGCGAC GGCCGGCTCA AGTCGATCTG A
|
Protein sequence | MSAENGDGDA DDGGDDSPGR REVAYRVFAA EFDDSDFDYS ESDADRAPNY VVTPTGARVN RLFAVGVLTE VSPAGEDVLR ARIADPTGTF VVYAGQYQPD AQTFLERAEP PAYVAVTGKA RTFQPDDSDV VYTSIRPESL NEVDEVTRDR WTVGAAEATL ERVGTMATAM ELDVRGDELR AALDERGVEP GLAAGIPLAL DHYGTTPGYL AAMRTLATSA LEVVAGERSE VESLSLSPDQ GGDADLDALA ETASIGEPAG ETVGTAEQGE SPATENEETP STGETTSTVD TSASTADASP SADDTASTTA DSDAGAASEE SASASTADVS GESDAAEGDG FDSDETPEAE SATETSEERS ETDTDTQAAE TDTPTDDPSE DTADLEDFDG EFELDEDERE QIKDEFGTEF TSGAEVDEPG EAGIETPDEP IEPETAGDDE GTPATEETAD EAAPAAEKTD TTTEGEPEPA AEEPGSEELA SGADESAAEP EDIDLEDAAM EAMATLDDGD GADREAVIER VQDAHGVDAA DVEDAIQDAL MNGRCYEPSD GRLKSI
|
| |