Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0473 |
Symbol | |
ID | 8382740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 468510 |
End bp | 470507 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644971535 |
Product | hypothetical protein |
Protein accession | YP_003129393 |
Protein GI | 257051560 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACC GTGGGTTCAC CAAGTGGCTA TCTATGGATC ACGACGTAGA TATTTCCGCC GCGAAGGTCG GGATCGCCGC CTCGATCCTC CTGCTCGGAC TTCGGACGCT GGCCGCGCAG GCGTTCCTCC TGGTGATCCC TCTCGCGGCG GGTGTCGGGA GCGTCGTGTA TCTCATCAAC CGGCCGGATC GGGTGTCGAA CCTCGTCAAC TGGCGGAGCC GGCGACTCGA AACCGCGTCA GTCACGCTCC CGAAGTCCGT CGCCCAGTTG CTTCCCACGC TCACGTTCGT GGTGCTCGCC GGGTTCGTCG TGGCGATTCA CCGCGCTGGT ACCCGAACGA ACCTCGTCTA CCTATTGACC GCTACGGTGG GTGTGGCCAT CCTGGCCCAG TTACTGTTCG TCGACGACGG CCATCTCACC CCGGCGATCA TTCTGTTCGA GATACTCGTC GCCGGCACCG TCCTTCGGCT CGCTCCGCTG TACGTGACGC CGGGGTACGT CGGCATCGAC AACTGGACGC ACGCCACCGT CTTCATCGAC GGGATCGTCC GCACGGGATC GCTCGGCCCG CTCGCCGAAA GCAAGTATAT CATGGCCCCG ATATATCACA TCATCGGGGC GACCGGCGAA CTCTTCTTCG GCAGCACCCG CGACGGTCTC TACCTGACTC TTGGGCTGTT TGTCCCGCTC TCGGCGCTGT TCATCTACGG GACCGGCAAG CTCCTCATCC CCGAGCGATG GGCGCTGCTC GCGACGGCGT TTCTCGTCTT CTCCGAACAA TTCATCCGGT GGGGGATGTA CATTATTCCG ACAAGTCTCG GACTTGCGTT GTTCCTCGTG GTCTTTTACG CCGTGACGAG GATTTTCGTG GGATACACGG AACGGTGGGT CGTGGCCCTG CTGCTCGCGG CGAGCCTTGC GATCGTGTTC ATCCACCAGG TATCGACCGC GGTCACGATC GTGTTCCTGG GGATTGCCAC GCTCGTCGCC GTCACGCTCG GTCTGACAGG ACGGTCTGCG TTCGAGGGCG ACCGAACGGC TGGTGCACTC GGTGGTGTCT TTCTCGTCAC GCTCGTGGTG ACTGTCGCGT CGTGGGCGAA CACGCCCTTC CCGGGGCAGG GAACGTTCCT CTGGACGGAA ATCTCCGTCG TCCAGGAGGC TATGTCGACG AAGGCTGGGT TCTTGAACCT TGCCAGCACG GGGTCTGAAG CGTCCCAGAT GATCGGGGGG CCTGTCGAGA CCGGGACGTT GTTGGCCAAG ACCGTTCCGT ATATCGAACT GCTCGGCTTC GGACTCCTGT TGCTGGCGGC GGTCGTCGGC GGGCTTCACA TGCTCGGCTG GAAACACGTC CCGGACCTCA CGTACACCTA TCTGCTGGCC GGGGGCGGAC TGTTCGTGGC CGTCTTCGGC CTGTCTTTGT TCGGCTTTCG CGCACTCCTG CCCACTCGCT GGATCGCGTT CCTGTACGTC TCGATGGCCC TGCTCGGTGC CATCGGGCTG TACTATCTTT CCCGTTCCGG ACATCGTCGG GTTGTCCTCG TGGTGTTTGT CCTCGTCTCG GTGGGCTACC CGACATCGAT GGCCGTCGCC GAGAAAGCCA CGCTCGATAA CCCGGTCTTC GATGACCAGT TCAAGCGCTT TGCCTACACG GAAGCGGAGA TCGCGACTGT GGATACCATC CGTGAGATGA AGCCGCCTGC CTCCGGCGCG ACGGTCGCTT CGGATCACCC TTACATCGGT TTAATCGAAC GGTATGGGCG ATACGAAGAG CGTGCTATCA ATCTCGAACT GACTCAGAAG GGTGCAGCAA CGACGGCGGA CGCGGTCATC TACCGGGAGT ACCAATCAAG CGGTCCAGTC ACGTTCCACC GGGCGGAGGG GTCCGATCTG ACACAAGTAC CGGCAGCTGT CGAGACTGCT GTTTGTCCGC CCGACTGGAA CGTCGCCTAC GCGAACGACC AGGCCAAAAT CTGCACGCCC ACTGGAGGAA CACCGTAA
|
Protein sequence | MSNRGFTKWL SMDHDVDISA AKVGIAASIL LLGLRTLAAQ AFLLVIPLAA GVGSVVYLIN RPDRVSNLVN WRSRRLETAS VTLPKSVAQL LPTLTFVVLA GFVVAIHRAG TRTNLVYLLT ATVGVAILAQ LLFVDDGHLT PAIILFEILV AGTVLRLAPL YVTPGYVGID NWTHATVFID GIVRTGSLGP LAESKYIMAP IYHIIGATGE LFFGSTRDGL YLTLGLFVPL SALFIYGTGK LLIPERWALL ATAFLVFSEQ FIRWGMYIIP TSLGLALFLV VFYAVTRIFV GYTERWVVAL LLAASLAIVF IHQVSTAVTI VFLGIATLVA VTLGLTGRSA FEGDRTAGAL GGVFLVTLVV TVASWANTPF PGQGTFLWTE ISVVQEAMST KAGFLNLAST GSEASQMIGG PVETGTLLAK TVPYIELLGF GLLLLAAVVG GLHMLGWKHV PDLTYTYLLA GGGLFVAVFG LSLFGFRALL PTRWIAFLYV SMALLGAIGL YYLSRSGHRR VVLVVFVLVS VGYPTSMAVA EKATLDNPVF DDQFKRFAYT EAEIATVDTI REMKPPASGA TVASDHPYIG LIERYGRYEE RAINLELTQK GAATTADAVI YREYQSSGPV TFHRAEGSDL TQVPAAVETA VCPPDWNVAY ANDQAKICTP TGGTP
|
| |