Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2391 |
Symbol | |
ID | 8384690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2438843 |
End bp | 2441089 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644973464 |
Product | Fibronectin type III domain protein |
Protein accession | YP_003131290 |
Protein GI | 257053457 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3693] Beta-1,4-xylanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.999424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACCAG ACACGAACGA CGACATCGAC ACATCGACAG CCGGCCCCGT CGAGGCTGAC GATTCTGTCG GCTCGATGGA CCGGCGAGAC TATCTCCAGA CCGTCGCGGC AGCGGCCGCG GCGGCCGGAC TCGGCGCTGC CACAACGGGT GGCGCAGCCG CCGAGACGGC TGACACTTCC TTGAGTATCG ACGAACGGAT CGAAGAACAC CGGACAGGCA CTCTTGAGGT TGTCGTCGAG AACCCCGACG GCTCGACGGT CTCGGACGCC GAGGTCTCGA TCGCCCAGCA GGAACACGCG TTCAGCTTCG GCACCGCCGT CAACGCCGAC AGGCTCGTCA ACGAGAGCGA CCCCGGCGAC AACTACCGCG AGTACGTCCC GGAGCTGTTC AACACGGCGG TGCTCGGCAA CCACCACAAG TGGCGGTTCT GGGAGAACAA CCGGGAGGTC GCCGACGAGG CGACGAACTG GCTGCTCGAT CAGGGACTGG ACATGCGCGG ACACGTCTGT CTCTGGGGTC GGGAGGACGT CGCGGCGATC CCGGACGATA TCTTGACGGC GATCGAGGAA CGCGACGCCG AGACGATCCG CGAGCGTTCG ATGGCCCACA TCGAGGAGAT CATCACCCAC TACGGCGAGG ACATCACCGA CTGGGACGTC GTCAACGAGG CGATGCACGC CTACCAGCTC CAGCTCGGTG TCTACGGCGA CCGGATCGAC ACCGAGGAGC CCTGGAACGG TGAGATCGTC CCGTGGACCT CCCCACTCCT GGCGGCGTGG TACGAGCAGG CCGCGTCGGT GATCGCGGAG CACGACCTCG ACGTCGGCAT CGCGGTCAAC GACTTCAACC AGTTCCCCTA CGCCTACACG GACAACCGCT ACGAGTCGGA GATCGATCAC ATCAACGCCA ACGGGGCACA GCTGGACACG GTCGGCCTCC AGGCACACAT TGCCGCCCGA GAGGGCGAGT TCAATTCCAA CGACGATCCG GACGGCCGGA TCGACGCCGA CCAGGTCGTC TCGGAGATCA ACACGTGGGC CGACCACGGC GCACGCGTGA AGATCACGGA GTTCGACACG TACAACGGCG ACGACTGGAA CTCAGATGAG GAACGCGCCG ACGTGACGGA GAACTACCTC CGGGGTGCGT TCAGCCATCC TGGCGTCGAC GCGTTCATCA TGTGGGGCTT CTGGGACGGC GACCACTGGG AAGACGAAGC GCCGCTGTTC TACGAGGACT GGTCGCAGAA ACCCGCATAC GATGTCTGGA CCGGCCTGGT CTACGACGAG TGGTGGACCG ACGACTCCGG CACGACTGAC TCCAGAGGGG CCTACACCAC GACGGCGTTC CTGGGTGACC ACGAAGTCAC CGTCAGTACC GATAGCGCAG AGACGACCGA GTCAGTCGAA GTCACGGATG CCTCGGGCAC GACGACGGTC ACGGTCACCG TCGCGGGTGA CGGCAGCGCC GCGGACGACA CCCAGCCGCC GTCGGTGCCG ACGAATCTCT CGGTGTCGAC GACGACCGAC TCGACGGTCA CCGTCTCCTG GGACGGCGTG ACGGACAACG GGACCGCCGG GCTGGACCAG TACGTCGTCT CCGTGGGCGG CTCACAGGAC CAGACGATCG GTGCCGGCAT GACGACCGCG ACGGTCGAGG GGCTCGACGC GGCGGCGACC TACGAGATCG GCGTCTCGGC AGTCGACAGT GCGGGCAACG AGTCCGACGC CGCGACCGTA CAGGCCACGA CCGCGGAAGC CGACGACGGC GAAGACGATG AGGGCGACGG CACTGACGAC GAGACGCCAG CCGAGGCACT CGTCGTCAAC GACTACGACG GCGACCCGGC GTGGGCGTCC AATCGAAACG ATCTCGGCCA GTGGTGCGGG GCTGGCTCGT TCGAGAACGG TGGCGGCGAG GTCGAGGACG GGGCGCTGGT CCTCGAATAC GACAATGCCG GCTGGTTCGT CGAGCAGCTC AACCAGGACG TCTCGGAGTA CTCGGAACTG GTGTTGGTCC TGGCCGGTGA CGACGTCCAA GCGGACGAGT TCCTGCTGGA CGTGGGTGGC GCTCGCGGGC TCCTCTCTGC GTTCACCGAC GACGCCATCG GAACGTCGGC CTCGACCGTC ACCGTCGACA TGGAATCGGC AGGTATCGAC CCGTCGACCG GGGGGCTCTC GGTTCGACTG AACTTCTGGC AGGGCGGCAG TGGCACGCTC GAAATCGAGG AGATCCGCTT CCAGTAG
|
Protein sequence | MTPDTNDDID TSTAGPVEAD DSVGSMDRRD YLQTVAAAAA AAGLGAATTG GAAAETADTS LSIDERIEEH RTGTLEVVVE NPDGSTVSDA EVSIAQQEHA FSFGTAVNAD RLVNESDPGD NYREYVPELF NTAVLGNHHK WRFWENNREV ADEATNWLLD QGLDMRGHVC LWGREDVAAI PDDILTAIEE RDAETIRERS MAHIEEIITH YGEDITDWDV VNEAMHAYQL QLGVYGDRID TEEPWNGEIV PWTSPLLAAW YEQAASVIAE HDLDVGIAVN DFNQFPYAYT DNRYESEIDH INANGAQLDT VGLQAHIAAR EGEFNSNDDP DGRIDADQVV SEINTWADHG ARVKITEFDT YNGDDWNSDE ERADVTENYL RGAFSHPGVD AFIMWGFWDG DHWEDEAPLF YEDWSQKPAY DVWTGLVYDE WWTDDSGTTD SRGAYTTTAF LGDHEVTVST DSAETTESVE VTDASGTTTV TVTVAGDGSA ADDTQPPSVP TNLSVSTTTD STVTVSWDGV TDNGTAGLDQ YVVSVGGSQD QTIGAGMTTA TVEGLDAAAT YEIGVSAVDS AGNESDAATV QATTAEADDG EDDEGDGTDD ETPAEALVVN DYDGDPAWAS NRNDLGQWCG AGSFENGGGE VEDGALVLEY DNAGWFVEQL NQDVSEYSEL VLVLAGDDVQ ADEFLLDVGG ARGLLSAFTD DAIGTSASTV TVDMESAGID PSTGGLSVRL NFWQGGSGTL EIEEIRFQ
|
| |