Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2131 |
Symbol | |
ID | 8384425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2170771 |
End bp | 2173716 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644973200 |
Product | Laminin G sub domain 2 |
Protein accession | YP_003131031 |
Protein GI | 257053198 |
COG category | [R] General function prediction only |
COG ID | [COG1287] Uncharacterized membrane protein, required for N-linked glycosylation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACGG GCCGCGACGA GGTCGAGTCG TTCCTCGAAG CCAATCCAGA GCTGGAAGCC GAACTCGAGG CACTCCTCAC AATCGACGCC CGCGGCCCGT GGGAGTTCGA CGACATCCCG CTCGATTCAG GCGCGTTCGG TGAATGCGTG TCCCGGGGGA TCGCGGTCGA ACACGACGAC GGGGGCTATC GCCTCGCCAA CCCTGACGCG GTGCGGGCCG CACTCGACCA CGACGTCGAT TCCGCGGCCG AGTCCGAACA GTCGTCCAGG ATCCCGGACG TCAATGTTAC ACTCGATGTC CCGCGGGAAA CGGTGCTGGC CTTGCTCGGC GCCCTCTCGC TACTCGTCGC ATTCCGTGTG GTCTTCGTCT ACCAGGGAGT TTTCCGGGAG GTGATCACGC TGCTGGGGAA CGACCCGTAC AAGCGACTGT ACTGGGTCGA ACAGCTCCAG GCGTTCCCGG CATTCGATCC AGGTGCGCTC GGCTCGATTC CGGAGGGCGT CCGCCTCACG GGTGATACGA TGATGCTCGT GACGCTCTGG TGGGTGTCAG AGCTGTTCGG TGGGACCCCG GCGGCGGCTC GCGTTACGCT CGCGATCTAT CCGGTCGTCG CTGCCGTCGT GACTGGGCTG TTCGTCTACG CGACGACGAA GCTCCTGACG GACGACATTC GGATCGCGCT TGCGAGTGTG CTCATGCTCG CGGTGACGCC CATCAACGGA TACCGGATGG CACTCGGGTT CGGCGATCAT CACGCCTTCG ATTACGTCTG GATCGCGCTG ACAGTGCTGC TCGTCGTCTG GTGGGAACGC TCGGCCGACG GGATGGCCAC TGGCCCGATC CGGAACCGAC TCCGGTCGCG ACGCTTGTGG GTAGCCATAG CAGCACTCGG GGGCACGCTT GCGGTCCAGA CGATGAGCTG GGTCGGTTCC CCGATGGTGT TGGTCCCCTT CGGTCTGCTG ATGTTCGTTC GGGGGGCAGT TGCGTTCCGG CAGTCGCGTT CTCCGGCAGT TGCGGCACTG CCGTACCTCG GCGGAATCGG CGTCGCCAGC ACGATCGTCG GCACAATGCA TCTCGCGTTC GGGTGGTTGC CGGTCTCGCG GGTGATCGTG CCGCTAGCCG TGTTCGTCCT CGGGGCGGGA GCAGTCGGGT ACTTCGAGGC CTGTCGGTCA CTATCGTTGT CAGCGACAGT GACGGTGGCA AGTAGTGTCG TCTTCTCGGT AGTTGGGACG GTCGGGTTCG CCCTCGTTGG ACCCGGAATG GACGCCGTCG TCGAACGAGC AGTTGGGTTG TTCAGGGGGA AACAGATCGC GGAGTCAGCC GGTCTGTTCA GTGCCGAATC CGGACTGGTC GTCCAGCCGA TCCTGTTCTT CGGATTCGTG CTGTTTCTCG CGTTGCCGTT CATCGTCCTC GGCGCGTGGC ACGTCTTCGA GAACAACGAC CCACGCTGGA CGGCACCCGT GGTCTATGCG GTGTACTTCC TGTTCTGGGC TGGCGTGAAA GTTCGGTTCG GGGGGCCGCT GACCATATTC ACGGCAATCT TTGGCGGCAT CGGGTTCGTC AAAGCTGCCG CATGGGTCGA CCTTGCCAGG CCAGTCACCA GCTTCACCGA GAAAACGCCG ATCGCCCGTT TCGAACGACC CGAGGGCGGG CAGCTGGTGT CGCTGTTCCT GCTGTTTCTG CTGGTGGGTA GTCTCGGGAT GGTCCAGCTC CCCATCAAGC AAAGTCAACT CGTCGTTTCC GAGGATACCT ACGAGACGGC ACAGTGGATC GACGGCTACA GCGACGAACG GAACCTGGAG TATCCCGAAA ACGGCGTCTT CACCGGGTGG AGTTCGACGC GCATCTACAA TTACTTCGTG AACGGCCACG CGGACTCCTA CTGGTTCGAA CAACAGTATT TCGAGTCGTT CCTCGGGTCG ACTACTCCGG ACGCGTGGTA CGAGCGACTG CGGGACCGGT ACGGGTTCGT CGTCTACAGC AGGCCGATGA ATGGGTCGAG CGGCCCGACC GTCGAAAGAC AGCTGGAAAC CGGGGACTCC ACGCCGGGTT TCGCCCAGTA TCGTCTGGTC CATACGAGCG GTCCGAAACG GGTGTTTCAG CTGGTCGAAG GAGCGACGAT CGTCGGGATC GACCGGACCA GTGACACCGT CACGGCCGAG ACGAGTCCGA CCGTTTCGGG GAAATCCCAT ACCTACGAGC GGAACGCAGC ACCCAACCCC TACGGAACGT ATGCGGTGAC CGTCCCGTAC CCTGGATCGT ATTCGATCGC CGGCGACCAG GTCGACGTTG CTGCCTCCGC CGTCGAGAAC GGGACCAGAG TGGTACGCCA CGCCAGAGAC GGGCTTGCCC ACTGGCCCTT CGATGCGACC GACGGAACCG TCGCCTACGA CCGGGTCGGC GGCATTCAGG GGGACATATC GAACGCGACG GTAGCCGAAA ACGGCGTCAA CGGCACTGCC CTCGAATTCA CCCGCGAGAA CGACAGCCAG GTCCGGGCGG CCGTCGAGTC GCCCCCGGAG TTCACGGTAA GCATGTGGCT CAAGCCCCAG GCGCTGGACA CGACCGAGGC GAACGATTAT CGCATCCTGG CGCGGAGTGG ACGCGGACTG GTGCTCAACG TCGAGGAGAG TGGACGCCTT ACCTTCCGGC TGCCGGGAAC TGACGCGAAG GCACTGGGTG GTGGGTCCGT CCCGGTCGGC AACTGGACGC ACGTGGCCGC GACCTACGAC GGCAGCCAGC GAACGCTGTA TGTCGACGGG GCGGCCGTTG CGACCGACAC CGTCGATGTC GGGCCTCCCT CCTGGGGTGG GCAACTGACG TTCGGCGGAG GTGGTGACCC GACGCATACG TTCGACGGGA CGATCGACGA GATCCGGCTC TATGAGCGTG CGTTGAACGA CACGGAACTT TCGGCCCAGG CAGTACAGTC CCGGAACGAT CAGTGA
|
Protein sequence | METGRDEVES FLEANPELEA ELEALLTIDA RGPWEFDDIP LDSGAFGECV SRGIAVEHDD GGYRLANPDA VRAALDHDVD SAAESEQSSR IPDVNVTLDV PRETVLALLG ALSLLVAFRV VFVYQGVFRE VITLLGNDPY KRLYWVEQLQ AFPAFDPGAL GSIPEGVRLT GDTMMLVTLW WVSELFGGTP AAARVTLAIY PVVAAVVTGL FVYATTKLLT DDIRIALASV LMLAVTPING YRMALGFGDH HAFDYVWIAL TVLLVVWWER SADGMATGPI RNRLRSRRLW VAIAALGGTL AVQTMSWVGS PMVLVPFGLL MFVRGAVAFR QSRSPAVAAL PYLGGIGVAS TIVGTMHLAF GWLPVSRVIV PLAVFVLGAG AVGYFEACRS LSLSATVTVA SSVVFSVVGT VGFALVGPGM DAVVERAVGL FRGKQIAESA GLFSAESGLV VQPILFFGFV LFLALPFIVL GAWHVFENND PRWTAPVVYA VYFLFWAGVK VRFGGPLTIF TAIFGGIGFV KAAAWVDLAR PVTSFTEKTP IARFERPEGG QLVSLFLLFL LVGSLGMVQL PIKQSQLVVS EDTYETAQWI DGYSDERNLE YPENGVFTGW SSTRIYNYFV NGHADSYWFE QQYFESFLGS TTPDAWYERL RDRYGFVVYS RPMNGSSGPT VERQLETGDS TPGFAQYRLV HTSGPKRVFQ LVEGATIVGI DRTSDTVTAE TSPTVSGKSH TYERNAAPNP YGTYAVTVPY PGSYSIAGDQ VDVAASAVEN GTRVVRHARD GLAHWPFDAT DGTVAYDRVG GIQGDISNAT VAENGVNGTA LEFTRENDSQ VRAAVESPPE FTVSMWLKPQ ALDTTEANDY RILARSGRGL VLNVEESGRL TFRLPGTDAK ALGGGSVPVG NWTHVAATYD GSQRTLYVDG AAVATDTVDV GPPSWGGQLT FGGGGDPTHT FDGTIDEIRL YERALNDTEL SAQAVQSRND Q
|
| |