Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2396 |
Symbol | |
ID | 8384695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2451361 |
End bp | 2454513 |
Gene Length | 3153 bp |
Protein Length | 1050 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644973469 |
Product | Fibronectin type III domain protein |
Protein accession | YP_003131295 |
Protein GI | 257053462 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3934] Endo-beta-mannanase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACGAG ACAACCATAC TTACGCCGGC GGCGGTGCAG ACCGTCCCGA CGGACGGACG TATCGTCCGG ACGACCGACG GTCGGCACTC GCGGCGTCGC GACGGGACGT CCTCCGCACC ATCGGTGCCG GGGCGCTGCT GGGCTCGATC GGGACGGCAC GCGTTCAGGC AGCACCCGGG GACCGCGAAT TCGTCGCCAC CGACGGCCCG GAGTTCACCG TCGGCGGTGA GCCGATCTAC TTCAGCGGGA CGAACAACTT CTGGGTGACC GATCCCTACA GCGATCGCTC GCGGATCGAC GACGTCCTCG CGCTGTGTGC GGACCTGGAT CAGAATCTGC TGCGGACCTG GGCGTTCTGT GCGGGCGAGG GCGGCCAGTG TCTCCAGCCC GAACCCGGCG TGTTCAACGA GGCGGCGCTG CAGCACCTCG ATTATCTCGT CGCGAAGGCC GGCGAACACG GGGTACGACT CATCCTCTCG CTGGTCAACA ACTGGGACGA CTACGGCGGG ATGGCCCAGT ACATCGAATG GGCGGACGGC GCAAGCGAGC ACGGCGACTT CTACGTCAAC GAGGCGTGTC GTGAACTCTA CCGGACTCAT GTCGAGACGC TCCTCACGCG GGAGAACTCG ATCACCGGCG TCGAGTACCG CAATGACCCC GCGATCGCGA TGTGGGAACT CGCCAACGAA CCCCGACTGG AGGACGACGA CACGGAGACC ATCGACGACC GGGAGGCTGC CCTCACCGAA TGGTTCGCCG ACATGTCCGG GTTCATCAAG GATTTCGACG ACAACCACCT GGTGACGACC GGGCTGGAGG GCTTTTACAC CCGCGCGGAC GGGCCAAACT GGATGTACGG TGACTGGACC GGCCAGAACT TCATCGCCCA CCACGAGATC GACACGATCG ACGTGTGTTC GTTCCACCTC TATCCGTACC ACTGGCCCGG CATGGGACTG GCGGGCCAAC TCGCCGAGGA CGACGTCGTC AGTGCCGTCG AGTGGATCCG CGAGCACGCC GCCGACGCCC GCGAGACGCT CGAAAAGCCC GCGCTGCTTG GCGAGTTCAA CGTCAACGTC CAGGAACACG ACCTGGCGAC GCGAAACGAT CGGTTGCGGG CGTGGTACGA CGCCCTCGAC AGCCAGGACG CGGGCGCGGC GGCGATCTGG CAACTGGTGC TCGAGGACAC CGAGGACCAC GACGGCTTCC AGGTCTACCG GAGCGAGTCC GGTGACATCC TCTCGGGGTA CGCATCGACG ATCCGCGAGA AGTCCGGGCA CAGCGACGGG ACGCCGACGG CCGACGCGAC GGCACCTTCT TCGCTCCGAA TCGGCGAGTC CGGCGATTTC AGCGGCACCT ACTCCTTCGA CCCGGACGGC TCGATCGCCG CCTACGACTG GGCCTTCGAC GACGGCGCGA CGGCCACCGG CGAGCGGGTG GCCCATCGCT TCGCCGAGAC CGGGTCCCAC GAGGCCGAAC TGACCGTTAC CGACGACAGC GGCGCGACTG ACGCCGACAT CGAGTCGGTT TCCGTCGAAG GCATCCCGGA AGACTCGTTC CTCGTCGAGG GCGCGGGAGA GACGTTCCAC CGCGACACCA AGCAGTGCCA CTTCGCGTCG ATGCCCGCGT CGGGCGACGT GGCGGTCACG GCCCGCGTCG CGGATCTCGA ACCGGTCGAT CCCGAAACCC AGGCCGGTGT GATGGTGGCC GACGATCCGG ACGCGCCCGG CGCGCTCGGT GCCGCCACGA TCACGCCCGG CGAGGGGAGC GAACTGACGC GGGCTTACGA CTCGACGGTG TGGCGCGAGC GTGCCGGCGA CGATCGCACG CCGCCGATCT GGTTGCGCGT CAAGCGGTCG GGATCGACAG TGTCGGCCTC GGTCTCGCCG AACGGCTCGG ACTGGACGGA GATCGGCTCC GGCGACGTCG ATCTCCCCGA TGATGTCCAC GTCGGGCTGT TCGTCAGCAG CAACGCCGCC GGCGAACTCG CCGCCGCGCG CTTCGACGAG GTTGATTGGC TGGAGGACTG GACGGCGACC GACGTCGGCC CCGTTTCGGT GGCCGGCGCG ACGACCGCCG GCGACGGCAC CACTGACGAC GGTGATGGCG ACGAGGACAC GACGCCGCCG ACGGCGCCCG GCGATCTGAC AGTGACCGAG ACGACGGACT CCTCGATTTC GCTCTCGTGG GACGCCGCCA CCGACGACGG TGGGTCGGGC CTTGCCCACT ACGATGTCTC CGTCGACGGC GCGCTCGACC AGCAGGTCCC CGCTGGCACG ACGACCGCGA CGGTTGAGGC CCTCGATCCC GGGACGGCCT ACGACATCGG GGTGTCAGCT GTCGACGGCG CGGGCAACGA ATCCGGGACC GTGACGGTGA CGGCGACGAC CGGGGACGGC GACGACGAGG CACCGACGGC GCCCGCCGAC CTGACGGCGA CCGAAACAAC GAGTTCCTCG GTCTCGCTCT CTTGGGACGC CTCGACGGAT TCGGGCGGCT CCGGGGTCGA GCAGTACGTC GTTGCCGTCG ACGGCGAAAC GGCCCACACC GTCGAGGCCG ACACAACGAG TACGACCGTC GAGGAACTGG ACGCCGAGAC GACCTACGAG CTCGGCGTCT CGGCGGTCGA CGCGGCCGGA AACGTGTCCG ACCCGGCCGT CATCGAGGTG GCGACCGCCG AGGGCGACGA TAGCGATGAG GAACCGCCAG AAAATGCCCT GGTCGTCAAC GACTACGACG GTGATCCGGC GTGGTCCAGC AATCGCAACG ACCTCGGGAA CTGGTGTGGG GCCGGCTCCT TCGCAAACGG CGGTGGCGAT GTCGAAGATG GCGCACTCGT CCTCGAATAC GACAACGCCG GGTGGTTCGT CGAGCAACTC AACCAGGATG TCTCCGCGCA CTCCGAACTG GTGTTCGTCG TCAGTGGTGC GAGTGGCGGC GAGGGCGATC ACTTCGTCGT CAGCGCCGGC GGTGTCCGCT CGCGGTTCAG CGACGTGGCG GACGGGTCGA TCGACACCGA TCCGAAGCCG ATCGCGATTG ACATGGAATC GGCAGGGATC GACGCCACGT CGCCCGGGGA ATTGCGTTTG AACTTCTGGC AGGGTGGGTC CGGAAGTGGG GCCCTCCGCA TCGAGGAGAT CAGACTGGAG TAA
|
Protein sequence | MARDNHTYAG GGADRPDGRT YRPDDRRSAL AASRRDVLRT IGAGALLGSI GTARVQAAPG DREFVATDGP EFTVGGEPIY FSGTNNFWVT DPYSDRSRID DVLALCADLD QNLLRTWAFC AGEGGQCLQP EPGVFNEAAL QHLDYLVAKA GEHGVRLILS LVNNWDDYGG MAQYIEWADG ASEHGDFYVN EACRELYRTH VETLLTRENS ITGVEYRNDP AIAMWELANE PRLEDDDTET IDDREAALTE WFADMSGFIK DFDDNHLVTT GLEGFYTRAD GPNWMYGDWT GQNFIAHHEI DTIDVCSFHL YPYHWPGMGL AGQLAEDDVV SAVEWIREHA ADARETLEKP ALLGEFNVNV QEHDLATRND RLRAWYDALD SQDAGAAAIW QLVLEDTEDH DGFQVYRSES GDILSGYAST IREKSGHSDG TPTADATAPS SLRIGESGDF SGTYSFDPDG SIAAYDWAFD DGATATGERV AHRFAETGSH EAELTVTDDS GATDADIESV SVEGIPEDSF LVEGAGETFH RDTKQCHFAS MPASGDVAVT ARVADLEPVD PETQAGVMVA DDPDAPGALG AATITPGEGS ELTRAYDSTV WRERAGDDRT PPIWLRVKRS GSTVSASVSP NGSDWTEIGS GDVDLPDDVH VGLFVSSNAA GELAAARFDE VDWLEDWTAT DVGPVSVAGA TTAGDGTTDD GDGDEDTTPP TAPGDLTVTE TTDSSISLSW DAATDDGGSG LAHYDVSVDG ALDQQVPAGT TTATVEALDP GTAYDIGVSA VDGAGNESGT VTVTATTGDG DDEAPTAPAD LTATETTSSS VSLSWDASTD SGGSGVEQYV VAVDGETAHT VEADTTSTTV EELDAETTYE LGVSAVDAAG NVSDPAVIEV ATAEGDDSDE EPPENALVVN DYDGDPAWSS NRNDLGNWCG AGSFANGGGD VEDGALVLEY DNAGWFVEQL NQDVSAHSEL VFVVSGASGG EGDHFVVSAG GVRSRFSDVA DGSIDTDPKP IAIDMESAGI DATSPGELRL NFWQGGSGSG ALRIEEIRLE
|
| |