Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2758 |
Symbol | |
ID | 8385064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2830246 |
End bp | 2831706 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644973833 |
Product | protein of unknown function UPF0027 |
Protein accession | YP_003131652 |
Protein GI | 257053819 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0799133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACCT ACGACGCCGG TGAGTTCACC CTGGAGAAGG TACGGGAGTA CGTCTGGGAG ATGCCACAGG AAGGCGACAT GCGCGTTCCC GCGCGGGTCC TCGCCAGCGA GGAACTCCTC GACGAGATCA GCGACGATCT CTCCTTGCAG CAACTCAAGA ACACTACCCA CCTCCCCGGG ATCCAGCAGT ACGCCCTGGC GATGCCCGAC GCCCACCAGG GGTATGGGTT CCCGGTCGGC GGCGTCGCCG GCATCGACGC CGAAACCGGC GTTATATCGC CTGGAGCGGT TGGTTACGAT ATTAATTGCG GCGTAAGGAT GTTGAAAACT AACCTTACGT ACAGCGACAT TCAGGGTCGC GAAGAGCAAC TCGTCGACGC GCTGTTCGAG GCGATCCCGT CGGGGTTGGG CGGCGGTGGT GTCGTCCAGA CCGACGTCGA CACACTGGAG GAAGTCCTCA CACGCGGCGT CGAGTGGGCG CTCGACGAGG GCTACGCCGT CGAAGACGAT CTCGCCCACT GCGAGGACGA GGGGGTCCGG CCGGGGGCAG ACCCCGACGC AGTCTCGAAG AAAGCAAAGG ACAGGGGCCG CCAGCAACTC GGGAGTCTGG GCAGCGGGAA TCACTTCCTC GAAGTCCAGC GCGTCACGGA CGTCTATCGC GAGGACGTCG CCGAATCTTT CGGTCTCAGC GAAGACCAGA TCGTCGTCCT GATTCACTGT GGCTCACGCG GCCTGGGCCA CCAGGTCTGT ACCGACTACC TTCGGGATAT CGAGCAGGCC CATCAGGGAC TCCTAGAGCA GTTGCCGGAC AAGGAACTCG CGGCCGCGCC CGCCGGGAGC CATCTCGCCG AGGCGTACTA CGGGGCGATG AACGCCGCGA TCAACTTCGC GTGGGTCAAC CGCCAGTTGA TCATGCACCG AACCCGGGAG GTCTTTGCCG ACGTCTTCGA CCGCGACTGG CGCGATATGG AGATGGAACT GTTGTACGAC GTTGCCCACA ACATCGCCAA GAAGGAGACC CATACCATTG AGGGCCAGGA TCGCGAACTC TTCGTCCACC GGAAGGGCGC GACGCGCGCG TTTCCCGCCG GTCACCCGGA AATCCCGGCC GCCTACCGCG ACGTCGGCCA GCCGGTCATC ATCCCCGGGA GCATGGGGGC CGGGAGTTAC GTCCTCCGTG GCGGCGAGCA TTCGATGGCG GAGACGTTCG GTTCGACGGC CCACGGCGCG GGCCGGCTCA TGTCCCGGAC GGAAGCCAAG AACACCTACT GGGGCGAGGA TGTCCAGGAC GACCTCCGCG ATCAGGAGCA GATCTACGTC AAAGCCGAGA GCGGCGCGAC CGTCGCCGAG GAGGCCCCGG GCGTCTACAA GGACGTCGAC GAGGTCGTGC GCATCTCCGA CGAACTCGGG ATCGGCGATC GGGTCGCCCG GACGTTTCCG GTCTGTAATA TCAAAGGGTG A
|
Protein sequence | MTTYDAGEFT LEKVREYVWE MPQEGDMRVP ARVLASEELL DEISDDLSLQ QLKNTTHLPG IQQYALAMPD AHQGYGFPVG GVAGIDAETG VISPGAVGYD INCGVRMLKT NLTYSDIQGR EEQLVDALFE AIPSGLGGGG VVQTDVDTLE EVLTRGVEWA LDEGYAVEDD LAHCEDEGVR PGADPDAVSK KAKDRGRQQL GSLGSGNHFL EVQRVTDVYR EDVAESFGLS EDQIVVLIHC GSRGLGHQVC TDYLRDIEQA HQGLLEQLPD KELAAAPAGS HLAEAYYGAM NAAINFAWVN RQLIMHRTRE VFADVFDRDW RDMEMELLYD VAHNIAKKET HTIEGQDREL FVHRKGATRA FPAGHPEIPA AYRDVGQPVI IPGSMGAGSY VLRGGEHSMA ETFGSTAHGA GRLMSRTEAK NTYWGEDVQD DLRDQEQIYV KAESGATVAE EAPGVYKDVD EVVRISDELG IGDRVARTFP VCNIKG
|
| |