Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2388 |
Symbol | |
ID | 8384687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2432896 |
End bp | 2435202 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644973461 |
Product | Fibronectin type III domain protein |
Protein accession | YP_003131287 |
Protein GI | 257053454 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.52789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGATA ACGACACGTA CGACGGCGGC GAATCGACGA CGAACGACAG TCGAATCATC GACGATGTCT CGCGGCGAGA CGTTCTCAAG GCGGCGGGGG CGAGCGCACT CACGGCCGGG TTCGCAAGCA GCATCGTCGG CTCGGTTTCG GCGGCCGGCA TCCCGACGCC GTGGCTCGAA CGCGACGGCA ACCTGCTTCG GGATCCCGAC GGCAATCAGG TCATCCTTCG CGGGGTCAAC ATGGCCGATC CGGCCCGGCT GGCGCGGTCC TGGCGGAGCA AGGATTCGAT GGGCGTTTTC GACAAAGCCA CGAACACTGA CGAGTCAAAC GACGGTGGCT GGCACAACAA CATCCTCCGG GTTCCGACCC AGCCACAGGA CATCGGGGAC GCCGGGTCCG GGAGCATCGG CAGTATGCCC CACGGAGACG ACTGGGGCCC GCTGCTGCCC GGCCAGATCG ACGAGTCGGA TCTGGAGACC TACTTCTCGG ATTACATCGA CCCGATCGTC GACGCCGCCG AGGAGGAAGG CCTCTACGTG ATGATCGACT ATCACCGCCA CTTCCCGATC TTCCACCAGC CTCAGCACGA GGAGGATCTC GGTGACTATC AGTGCGGGAA CGAGTCCTTC GAAAACGACA TCGGCTTCTG TGGCGAACGT GGTGTGCTCT GGCACTCAGA GGAGCAGGCC TCCCAGCTCG ATGGCTACAC CGAGGAGTAC GCCGCCGAGC TCAACCAGGA GCTCCAGATG TACTGGAACT TCGTCGCGCC GCGGTACAAC GACCGCTCGC ACGTGGTCTA TGACATCTAC AACGAGCCGA CTGGTCCCTA CGCCGGTGAC TGGGGCTCGC CGACCGAACT GCCCGCGACC GGCGAGGAGG GTGAAGAAAA CCCCTCATAC GACGCCGATG CGAACCAGGA GTACTGGGAC ATGTGGGTCG ACCGTGCCCA GCCGTGGGTC GACACGGTCC GCGAACACGC GCCCGACAAC CTCATCACGA TCGGCTCGCC GCGGTGGAGT CAGCTCACCT ACTGGGCACC GACCAACGAG TTCGACGGCG AGAACATCTG TTACACCGGC CACGTCTACA CCCACGAGGG GATGCGACCC CTGTCGGACT CCTTCGGCAC GGCCGCCGAG GAGGTACCGA TGTTCTTCAG CGAGTTCGGC TGGGCCGAGG GCGGCGGTCG CGACGGCTTC AGCTTCCTCG AAGGGACGAC CTCCGAGTAC GCCGACGGCT TCGAGACCTT CCTCGATGAG TACCCGGTCC ACCCGATCTG CTGGAACTTC GATCACACCT GGGAGCCCTC TTTCTTCGTC CACGACGAGA GTCAGGACGG CGACTGGGTC ATCCACGACT ACGAGGCCCG TCCCGCACAG TGGTGGCAGG AGTATCTCTA TGAGAACCGG AATGACGACC TGCCGGGCAG TGGCGGCGAC GATGACGACA CGACGGCACC ATCGATCCCG TCGAACCTCA CCGTGACCGA CGAGACGAGT TCCTCGATCA CGGTCTCCTG GAGCGCTTCC ACGGATTCGG GCACGGCTGG ACTCGCGCAG TACAACGTCC TCGTCGACGG CTCGCTGGAG CAGACGGTCT CGGCCGGCAC GACGAGTGCG ACTATCTCCG GACTGGCCGC CGACACGTCC TACCAGATCG CGGTCTCGGC CGAGGACGGT GCCGGTAACA CGTCCGGCAC GACGACGATC ACGGCCGACA CCGACGCCGG CAGCGACGAC GGTGACACGC AAGCGCCGTC GGCCCCGTCG AACGTCTCGG TCGAGTCGAC GACGGAAACC TCGGTCGAGG TCTCCTGGAG CGCATCGACG GATTCGGGCG GTTCCGGTCT CGACAGCTAC GTCGTCTCCG TCGACGGCTC CCAGGACCGG ACGGTCCCGG CCGGCACGAC GAGTGCGACG GTCGACGGCC TCTCCGCTGG AACGTCCTAC CAGATCGGGG TCAGCGCGGT CGACGGTGCG GGCAACGAGT CCGCCGCGAC GACCGTCGGG GCCACGACGA GCGAATCCGA CGATGACGAC GGGACGTCCG GGGAGCCGAT TGCGACGATC GATCCGGGGA CGACCTCGGC CTCGACCGGC GACCTGGTCC AGTTCTGGAT CTCCGACGAG ACCGGTAACC AGACCTGGAT CACCGGGCTG GAGTGGGAAC TCGGCAACGG CACCACCGGG CGCGGCTGGT ACACCGACGA GCGCTACCAG TCGACAGGTA CCTACACGGT CACGCTGACC GCGACCAACA ACGAGGGCGA GACGTCCACC GACGAAGTCG AGGTGACCAT CTCCTGA
|
Protein sequence | MTDNDTYDGG ESTTNDSRII DDVSRRDVLK AAGASALTAG FASSIVGSVS AAGIPTPWLE RDGNLLRDPD GNQVILRGVN MADPARLARS WRSKDSMGVF DKATNTDESN DGGWHNNILR VPTQPQDIGD AGSGSIGSMP HGDDWGPLLP GQIDESDLET YFSDYIDPIV DAAEEEGLYV MIDYHRHFPI FHQPQHEEDL GDYQCGNESF ENDIGFCGER GVLWHSEEQA SQLDGYTEEY AAELNQELQM YWNFVAPRYN DRSHVVYDIY NEPTGPYAGD WGSPTELPAT GEEGEENPSY DADANQEYWD MWVDRAQPWV DTVREHAPDN LITIGSPRWS QLTYWAPTNE FDGENICYTG HVYTHEGMRP LSDSFGTAAE EVPMFFSEFG WAEGGGRDGF SFLEGTTSEY ADGFETFLDE YPVHPICWNF DHTWEPSFFV HDESQDGDWV IHDYEARPAQ WWQEYLYENR NDDLPGSGGD DDDTTAPSIP SNLTVTDETS SSITVSWSAS TDSGTAGLAQ YNVLVDGSLE QTVSAGTTSA TISGLAADTS YQIAVSAEDG AGNTSGTTTI TADTDAGSDD GDTQAPSAPS NVSVESTTET SVEVSWSAST DSGGSGLDSY VVSVDGSQDR TVPAGTTSAT VDGLSAGTSY QIGVSAVDGA GNESAATTVG ATTSESDDDD GTSGEPIATI DPGTTSASTG DLVQFWISDE TGNQTWITGL EWELGNGTTG RGWYTDERYQ STGTYTVTLT ATNNEGETST DEVEVTIS
|
| |