Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2394 |
Symbol | |
ID | 8384693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2445728 |
End bp | 2447683 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644973467 |
Product | Fibronectin type III domain protein |
Protein accession | YP_003131293 |
Protein GI | 257053460 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.253992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAAAG ACCGTTCGAC GGAACGAACC GAAACTGACG AATCGACGAC TGAACGAGAC GAATTCACGC AGGAGGGCCC CGAGACGTAT CGGGCAGGGA TATCCAGACG GTCGTTTCTG CAGACGACCG CGGCTGCGGG ACTGGTCGGC CTGGGCGTCG GGAGTGGCGC TGTCGGCTCA GCGGCTGCAG CCGGTATTCC AACGCCGTGG CTCGAAGTCG ATGGCAATCT CCTGCGGGAT CCCGACGGCA ACAAGGTGAT CCTGCGGGGT GTGAACGTTA TCGACCCCGC GCGGGCGGCC AAGGAGTGGC GCAAGAACAT CGAGCCGCTG ATCGAGTTGG CGACCGATCC GGGCGAGGGC TGGCACGCCC ACGTCATCCG GCTCCCGATG CAGCCTCAGG ACATCGGCGA TCATGGTCCG GGGACGGCGG CCCCGACGCC GGGATTCACG CAGGACGAAC TCCAGAATTA TCTCGCGGAG TACGTCGATC CGGCGGTCGA CGCGGCCGAG GACGTCGGCG CGTACATCAT GCTGGATTAC CATCGCCACT ATCCGGAGGG GCCCGACTGG GACTCGCCGG AACTCGACGA GGAGATCCGG TTGTTCTGGA ACGAGGTCGC CCCGCGCTAC AGCGATCGTT CCCACGTCAT CTACGAACTG TACAACGAAC CGAACACGCC GTATCCGGGG GCCGGCGATC CGACCGACGA CGTTGGCGTC ACGGACGCTC GTGCCGAGGA GAACTACCTC TACTGGCGCG AGACGGCCCA GCCGTGGGTC GATCTCATTC GGGAGCACGC GTCCCGGAAC CTGATCGTCA TCGGGTCGCC GCGCTGGAGC CAGTTCACCT ACTGGGCGGG CGAACACGAG TTCGAGGGCG ACAATCTCGC GTATGCGGGC CACGTCTACG CCCACGAGAA CCTCCGGCCG CTATCGACGT ACTTCGGCGA GCCCTCAGAG GAGGTTCCGG TGTTCATGAG CGAGTTCGGG TACGGGACCG AGGGCTCGCC CTACCTCGTC GGGACCAACG AAGTCGAGGG CCAGCAGTTC CTCGACCTCT TCGACGCCCA CGACATCCAC TGGCAGGCCT GGTGTTTCGA CCACACGTGG TCGCCCGGCA TGTTGAATCG GGATTACGAG GTCGACAGTC CCCACGGTCG GCTGTTCAAG GAGCGACTTC GCGAGAAGCG CAACGACGAC CTGCCGGCGA GCGCCGGCGG TGGCGACGAG ACGCCGCCCT CGGCCCCGTC GAACCTCGCC GTGACCGAGA CGGGCAGCGA GAGTGTCGGT CTGGCGTGGG ACGCCGCGAG TGATTCCGGC GACTCCGGCC TCGCCACTTA TGCCGTCTAC CTCGACGGCG CGCTGGATCA TCGGGTCACT GCCGGGACGA CCGCTACAGA AGTCAGCGGC CTGCTGCCGG AGACGACCTA CGAGTTCGCC GTCAGTGCCG TCGACGGCGC GGGCAACGAG TCCGACAGGA GCGGGGTCGT CACCGCCACC ACCGATCCGC CGGCCAGCGA GCGCCTGGTC CTCAACGACT TCGACGGCGA CCCGGCGTGG GCTGACAGTC GCAACGAACT CGGGAACTGG TGCGGTGCCG GTTCCTTCGC AAACGACGAC GGCGAGGTCG TGGACGGCGC ACTCGTCCTC GAATACGACG GCGGCTGGCT GCAGTCGTAC GTTCGCCAGG ATGTCTCGTC GTTTTCGACG CTGAATCTGC AGGTTCGCGG TGCCGACGGT GGCGAGGAGT CGGCCTTCGC GGTGGAACTC GGCGGCGGGG GCGGCGTGCT CGCCGAAATC ACCGACGACA CGATCGGCAC GTCGTTCTCG ACAGTATCGA TCGACATGGC CGCCGCCGGG ATGGACGGGG CGAGTCCCGG CGCGGTATAT CTCGACTTCT GGTCGGGTGA CGGAACGAGT GGAACGATCG AGATCGACGA AATCTGGTTC GAATAG
|
Protein sequence | MTKDRSTERT ETDESTTERD EFTQEGPETY RAGISRRSFL QTTAAAGLVG LGVGSGAVGS AAAAGIPTPW LEVDGNLLRD PDGNKVILRG VNVIDPARAA KEWRKNIEPL IELATDPGEG WHAHVIRLPM QPQDIGDHGP GTAAPTPGFT QDELQNYLAE YVDPAVDAAE DVGAYIMLDY HRHYPEGPDW DSPELDEEIR LFWNEVAPRY SDRSHVIYEL YNEPNTPYPG AGDPTDDVGV TDARAEENYL YWRETAQPWV DLIREHASRN LIVIGSPRWS QFTYWAGEHE FEGDNLAYAG HVYAHENLRP LSTYFGEPSE EVPVFMSEFG YGTEGSPYLV GTNEVEGQQF LDLFDAHDIH WQAWCFDHTW SPGMLNRDYE VDSPHGRLFK ERLREKRNDD LPASAGGGDE TPPSAPSNLA VTETGSESVG LAWDAASDSG DSGLATYAVY LDGALDHRVT AGTTATEVSG LLPETTYEFA VSAVDGAGNE SDRSGVVTAT TDPPASERLV LNDFDGDPAW ADSRNELGNW CGAGSFANDD GEVVDGALVL EYDGGWLQSY VRQDVSSFST LNLQVRGADG GEESAFAVEL GGGGGVLAEI TDDTIGTSFS TVSIDMAAAG MDGASPGAVY LDFWSGDGTS GTIEIDEIWF E
|
| |