Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2346 |
Symbol | |
ID | 8384645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2388477 |
End bp | 2389904 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644973419 |
Product | hypothetical protein |
Protein accession | YP_003131245 |
Protein GI | 257053412 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCCC GTTCGCAGCT GGCCGCCCTG GCCCTGGTTG TCCTCCTCGT GACGGCAGGG TGTACGACCA CCACGCAGCC GACACCCTCA CCGGCCCAGG AGATATCGCC GACCGAAACC CCGACCGAAG CGCCTGAGAC GCCGGAGACG GGATCGGCCG AGCGAGCAAC CACGTCCGAG AGAGACCAGA CGACAGCAAC TGAGACGACA GCGCCAACGC CGACGGTGAC CGAGACACCA ACCCAGACGA CAACTGAAAC CCGGCCAGAA AGCCCGACGG AATCAGCGAC GCCGACCGAC ACACCGACAG ATAGCGATAC AACGACACCC GAACCGACCG AGACCGAAAA CACCGCCACG GAACAGAACT CCGTGACCAT CGAAGGATCA CTCCCGGTCG ACGCCGATAC CGTGTTTCGG CGCGTCGAGA CGTTGATGGG GGAGTCGGTC GAGGAGCCGC GTGTGGTCGT CGAGCGGGGG CCGTTTTCGG TCCCGAACAG GGTCGCCGAC CGGACGGTCC CACAGGTGTT CGGCCTGACT GGCTGGGAGG CGAAGAACAC GACCCACGGA ATAACGAGAG CGGGCGGCCG ACTCGTCCAG CTGTTCCCCG GGAACGCGAG CGAAATGTCG ATCGAACCGG TCCTGGCACA CGAGTACGCC CACGTCATCC AGTTCCGGAG CGCACTGCCA CTGATCGAGT TGCAACGCGA TGCTGAGACG ACTGACGAGC GGATCGTCGC CGGCGCACTC ATCGAGGGCG GGGCCGTCTA CGTCGCCGAC GAGTATATCG CGACGCACAT GCCCGAGGAA CCGAGCCAGT TGTCCGTCCT CGCCGAGCAG TATCGCCGGG CGACACCGGG GTATCGCTTC TTCCGGTCGC GGTATTACTT CGGCGCGCAG TACCTCCACG ACAGGCTTGA CTCACCGCGG AATCTCTCGG ATGCATACCA CAATCCACCC GAATCAACCG AGGCACTCAT CCACAACGGC AGTGTCGACC CCGAACCGGA CCTCACCGTG AATCTCAACA CGTCCGATTC CTGGAACCGT GCCCTGTTCC AAAACCGCAA CTGGAACGAC ACGATGGGTG AACTGCTGGT CCGGAACGTG CTCCGGAGCG AGTTGAACCG CTCGCGAGCG GCCGCCGGTG CCGAGGGGTG GGGCACGGAC CGGCTGTTCG CCGTCTCGAA CGGATCCCGA AATGCCATCG CCTGGGGGCT CCGGTGGGAT ACACCCGATG ACGCGGCTGA ATTCGCCGAC GCGTTCGGTG CCTACCGCGA GCGCCAGGGG CCGACGACGC CGTACGCCTA CCGGCTGAAG CGATTGGGAT CGGAGACGAC CGTCGTCATC GCCGGGCCCC CGGACGTGCT TGGGGCCATG ACCGTCTCAG GGTCGAATGC CACGGCGACT GTGACGATTG CCGGCTGA
|
Protein sequence | MSPRSQLAAL ALVVLLVTAG CTTTTQPTPS PAQEISPTET PTEAPETPET GSAERATTSE RDQTTATETT APTPTVTETP TQTTTETRPE SPTESATPTD TPTDSDTTTP EPTETENTAT EQNSVTIEGS LPVDADTVFR RVETLMGESV EEPRVVVERG PFSVPNRVAD RTVPQVFGLT GWEAKNTTHG ITRAGGRLVQ LFPGNASEMS IEPVLAHEYA HVIQFRSALP LIELQRDAET TDERIVAGAL IEGGAVYVAD EYIATHMPEE PSQLSVLAEQ YRRATPGYRF FRSRYYFGAQ YLHDRLDSPR NLSDAYHNPP ESTEALIHNG SVDPEPDLTV NLNTSDSWNR ALFQNRNWND TMGELLVRNV LRSELNRSRA AAGAEGWGTD RLFAVSNGSR NAIAWGLRWD TPDDAAEFAD AFGAYRERQG PTTPYAYRLK RLGSETTVVI AGPPDVLGAM TVSGSNATAT VTIAG
|
| |