Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0921 |
Symbol | |
ID | 8383194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 887466 |
End bp | 888797 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644971985 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003129837 |
Protein GI | 257052004 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAACG TCGCGCTCTC GGCAGCACAA CTTGTCTTGG CGCTGATTCT GGTGGTTTTA AACGGCTTCT TCGTCGCTGC GGAGTTCGCC TTCGTTCGGG TTCGGGGGAC GTCGGTCGAC CAGCTCGCCG AGGAAGGGCG GCCCGGCTCG GCGACGCTGC AAGAAGTGAT GACGAATCTC GATAACTACC TCGCCACGAC GCAACTCGGC ATCACCATCG CCTCGCTCGG ATTGGGGTGG GTCGGCGAAC CCGCCGTGGC GGCGCTCATC GAGCCAATAC TGGAATCGGT TCTCCCGGCG AGTCTCATCC ATCTCGTGGC GTTCGCAATC GGATTTAGTA TCATCACGTT CCTCCACGTC GTCTTCGGTG AACTCGCGCC GAAGACGATC GCAATCGCCC AGACCGAGCG ACTCTCGCTG TTTCTCGCCC CGCCGATGAA GTTCTTTTAT TTCATACTCT ATCCGGGTAT TGTCGTCTTT AACGGGGCGG CTAACGCGTT CACGCGGTCG CTCGGTGTGC CGCCCGCTTC CGAGACGGAT GAGACACTCG GTGAGCGAGA GCTCCTCCGG GTACTCACAC GATCGGGTGA GGTGGGGGAC ATCGACCTGG CAGAAGTGAC GATGATCGAG CGGGTCTTCG ATCTCGACGA CATCGTGGTG CGGGAGGTCA TGGTTCCACG GCCTGACGTG GTGAGCGTTC GGGCGGATGC CGCGCTCTCG GACCTCCAGT CGATCGTCCT CGAGGCTGGT CATACCCGCT ATCCAGTGCT TGCCGCCGAG GACGGCGATC AAGTGATCGG ATTCGTGGAC GTCAAGGACG TGTTACGAGC GGAGGTGGAG GGTGGGGACG CCGAGTCAGT CGGCGACATC GCCCGCGAGA TCGCGATCGC CCCCGAGACG ATGGCACTCA GCGATCTCCT GAGACAGTTC AGGGAAGACC AACAGCAAAT GGTTGCAGTT ATCGACGAGT GGGGGGCGTT TGAGGGGATC GCAACGGTTG AAGACGTCGT CGAGGCACTC GTCGGGGACC TCCGGGATGA GTTTGATATG GACGAGCGCG AACCCTCGAT TCGCCCGCGT GATGATGGGG GATACGACAT TGATGGGGGC GTCCCGTTGT CAAAAATCAA CGACATGATC GAGGGGGAGT TCACGAGTGA CGAGGTCGAA ACGATCGGTG GGCTGGTACT CGAGCAACTC AACCGTGCGC CGGAACGTGG CGATCGCGTT GCGGTCGCCG GGTACGTCGT CACGGTGACG AGCGTCGAGG GGTCCCGAAT TTCGACGATC CGGGTCCAGG AACGTCAAGA GGGCGACTCA GCAGTAGACT GA
|
Protein sequence | MVNVALSAAQ LVLALILVVL NGFFVAAEFA FVRVRGTSVD QLAEEGRPGS ATLQEVMTNL DNYLATTQLG ITIASLGLGW VGEPAVAALI EPILESVLPA SLIHLVAFAI GFSIITFLHV VFGELAPKTI AIAQTERLSL FLAPPMKFFY FILYPGIVVF NGAANAFTRS LGVPPASETD ETLGERELLR VLTRSGEVGD IDLAEVTMIE RVFDLDDIVV REVMVPRPDV VSVRADAALS DLQSIVLEAG HTRYPVLAAE DGDQVIGFVD VKDVLRAEVE GGDAESVGDI AREIAIAPET MALSDLLRQF REDQQQMVAV IDEWGAFEGI ATVEDVVEAL VGDLRDEFDM DEREPSIRPR DDGGYDIDGG VPLSKINDMI EGEFTSDEVE TIGGLVLEQL NRAPERGDRV AVAGYVVTVT SVEGSRISTI RVQERQEGDS AVD
|
| |