Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_3001 |
Symbol | |
ID | 8385310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 3091344 |
End bp | 3092933 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644974079 |
Product | hypothetical protein |
Protein accession | YP_003131895 |
Protein GI | 257054062 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.303389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGCGG TCCCGTTGCA GTTCTCCGTC CGAACGAGCA CCGTGCTGGC TGGCGCTGCG GAGATCGGCG GGCTCGCGGT CCTGGCCGCG GCGGTGGCCG GAGCCGCCGC CATCGCATAT CGGTGGTACG CTCGCGAGCG CGTTTCGAAC CCGCCGGCAG TGCTGATTGC GCTCGCCACA GTCGCCTTCT TCCTCCACGC GATGACTGCC CTCGGACAGG TCATGGACGG GGCCGATCCG CTGACCGTTC GGGCCGCCGC CTTCAACGTG AGTGCACTCG CGGTCGGCAC CGTCGCGGCC ATGCTCGGCA TCGCGGTAGG TGATCATCTG GCAAGGACTG TTCTCGGCGG GACCGAGCAG GTCGCTCTCG ACGATTCGGT AAGCCGAGTC GTTCGGGCCG TCGGTCGGGT CATCACTGTC GAACTCCCCG CGGAGATCGA CGACGTTCCG GGGTACGATC CCGTCGACGC CGATACGAAG GCCGCTCTCG AGGGGAAGAC GTTCGTCTTT CCGCGCGGCC TCACCGTCGA CGCTTTGCGC GACCGACTCA CCCGCCGTCT GCGGGAGGAC TACCGCGTCG GGCACGTCGA CATCGAGATC GGCGAAGACG GGACGGTCAC CCACCTCGGT CTCGGCGCCC GTGCGGCGGG ACTCGGCCCC ACGCTTCCGC CCGAGAGCGC GGCGATGGCG ATCCGCGCCG ATCCCGCCTA CGCAGCCAGC GCCGGCGACC TCGTCCAGGT GTGGGAGCGC GGCCCAAAGC GCCGACTCCT AAACGCCGAG GTCCGGGGAA CGGCCGGCGA CGTCGTCACG CTCGCGATCG ACGCCGCCGA CGCCTCGAAG CTGTCGACCG ACGACCGATA CAAACTTGCC ACCCTCCCCG TCGAGGAACG GGCCGATCGT GAACTCACGG AACTCCTTCG GGCAGCCGAG GAGACGCTCG GCGTCGTCGA GATCGCCGAC GGGAGCCCAC TGGCCGGGGT GCCGGTCGGA GCGCTCGAGG CCACCGTCAT CGCGATTCAC GCCGGTGGCC CGAACGATCA CATCGAGACA CTACCGGCCA GCGATCGCGT CATCTCGGGT GGGGATTCGG TGTACGTCAT CGCCCGGCCG GACGCTATCC GCCGGATCGA GGCGGCCGGG ACATCGGATG GCGGCGGTGA GCACGCTTCG AGCACGATAT CCGAAGATGG TGCTGGAAAC GTGAATCAAG CGGTCGATAC CGGCAGAGCC GGCCAAGCGG TCGCTGCCGG TGGGGAGGAC GAGCAGGCCC ATGCCAGCGG AGAGGACCAA ACAGTCGACA CTGGGCGTGA GACTCCTGAG GGTGATCCGA CGGAGGTGGG GCAGGCGGAT ACTGAACGGG CTGCTCACGA GGATGAAATC GATGGGAAGG ATATGACGGA CGATACAGCG GACGAGGGAG AAGCGGTTGC TACCGAGGGC GAGGAGATAC CGAACGGTAC CGATGGCGAA AACGAGGTCG ACGATGGCGA GCAAGAGAGT CGGAAGGAAG ATACTGGTGG AGATGATCCG GAGGCCGATG ATTCGGCCGA GCCGTCCGAT AATGGGTCCG TCGACCGTAG CCCCGAATGA
|
Protein sequence | MIAVPLQFSV RTSTVLAGAA EIGGLAVLAA AVAGAAAIAY RWYARERVSN PPAVLIALAT VAFFLHAMTA LGQVMDGADP LTVRAAAFNV SALAVGTVAA MLGIAVGDHL ARTVLGGTEQ VALDDSVSRV VRAVGRVITV ELPAEIDDVP GYDPVDADTK AALEGKTFVF PRGLTVDALR DRLTRRLRED YRVGHVDIEI GEDGTVTHLG LGARAAGLGP TLPPESAAMA IRADPAYAAS AGDLVQVWER GPKRRLLNAE VRGTAGDVVT LAIDAADASK LSTDDRYKLA TLPVEERADR ELTELLRAAE ETLGVVEIAD GSPLAGVPVG ALEATVIAIH AGGPNDHIET LPASDRVISG GDSVYVIARP DAIRRIEAAG TSDGGGEHAS STISEDGAGN VNQAVDTGRA GQAVAAGGED EQAHASGEDQ TVDTGRETPE GDPTEVGQAD TERAAHEDEI DGKDMTDDTA DEGEAVATEG EEIPNGTDGE NEVDDGEQES RKEDTGGDDP EADDSAEPSD NGSVDRSPE
|
| |