Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4440 |
Symbol | |
ID | 8745069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 20789 |
End bp | 22489 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646514977 |
Product | Ricin B lectin |
Protein accession | YP_003405924 |
Protein GI | 284172542 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0151299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGA CACGACGAAC CTACCTGAAA GGAACGGCGG CATCGGCACT GATCGGAATC GGCGCGCTTA GCGGCCTCTC CGGGTCGGCG GCGGCGGAAT CGAACTTCGA CCTCGAGGCC GGCTTCGCGG ACACGTCGTG GCTCGACGAC GACGTCGACG TCCACACGAT CACCGAACCG ACGCGGAGCG CGGTCGAATC GGCGTTCAGC GCCAGCGGAG CGCGCGTGGT CGTCTTCGAG ACGAGCGGAA CTATCGACCT CGGTGGAAAC GATCTGGCGA TCACCGAAGA CTACTGCTGG GTGGCCGGCC AGACCGCGCC GTCGCCCGGT ATCACGTTCA TCAACGGACA GGTCCGGATC AGCGCGAACA ACTGCGTCGT CCAGCACATC CGCTCGCGAA TCGGCCCCGG TTCCGACGGC TCGATCCAGA GCAACGACGC GTTCAACACC GCCGACGGTA CCCAGAACAA CGTCGTCGAT CACGTCAGCG CCTCGTGGGG CACCGATGAG TGCCTCTCCG TCGGCTACGA CACGCAGGAT ACGACGGTAA CCAACTGTCT CATTTACGAG GGGCTGTACG ACCCCTACGG CAACGAGGCG GACCACAACT ACGGGAGCCT GATCGGCGAC GGCGCCTCGA ACGTCACCCT CGCGGGCAAC GTCTGGGGGA AGGTCCGCGG TCGCGCGCCG CGACTCAAGA GCGACACCGA GACCGTCGTC GTCAACAACC TCCTGTACTT CTTCGACGAG TCGGCCAACG CCGACGACTC TGCGGTCACG AGCTTCGTCG GTAACGCGGC GATCTGTGCG GACGACGATG ACGCCATTCT CGAGGGCAGT CCGACCGCGT ACCACGCCGA CAACATTGCG TACGATCCGC CGATGGTCGA CGAGCAGCCG ATCGCCGAAC CGGAGTCGAC GAGTTCGCCG CCGCTGTGGC CGAGCGGCCT CAGCGAGATG CCGTCGGGTG ACGTCGAGAG CCACAACCTC ACCAACGCCG GGGCGCGGCC GGCCGATCGA ACGCAAAACG ACGCGCGAAT CGTCCAGGAG ATCGCCGACC GCGCCGGGCT CGACTACCTC GACTCGCCGT ACGACTACTG GGTCGGCCAC CACGACGAGG TCGGCGGCTA TCCGGAGCTC CCCGTGAACA CCCACTCGCT CGAGGTCCCC GACAGCGGTA TCCGCGACTG GCTCGCCGGC TGGGCCCAGG CCGTCGAGGA GGGCAGTTCG CCGCCCGACG GCGGTAGCGG CGACGACGGG AGCAGCGGTC CGATCCCGAC GGGCACCTAC GAGATCGCCA ACGTCAACAG CGGGCAGCTG CTCGAGGTGG CCGACGCGTC CACCGCGGAC GGCGCCAACG TCCAGCAGTG GTCCGCGACC GATCACGCCA CGCAGCAGTG GTACGTCGAG GATACCGGGA ACGGCGAGTA CGTCCTCCAG AACGCGAACA GCGGGCTGTT GCTCGAGGTC GCCGACGGCT CCACCGAGGA CGGCGCGAAC GTCCAGCAGC ACGCGGACAC GGGTTGCGAC TGCCAGCGGT GGTCCATCAA CGACGTGGGC AACGGAGAGT ACATCCTCGA GGCGGTCCAC AGCGGAAAGG TAGCCGACGT CGAGGGAGCG TCGACCAGCG ACGGGGCGAA CGTACTCCAG TGGCCCGACA CCGGCGGCGC GAACCAGCGC TGGACGTTCG ACTCGGTGTA G
|
Protein sequence | MKQTRRTYLK GTAASALIGI GALSGLSGSA AAESNFDLEA GFADTSWLDD DVDVHTITEP TRSAVESAFS ASGARVVVFE TSGTIDLGGN DLAITEDYCW VAGQTAPSPG ITFINGQVRI SANNCVVQHI RSRIGPGSDG SIQSNDAFNT ADGTQNNVVD HVSASWGTDE CLSVGYDTQD TTVTNCLIYE GLYDPYGNEA DHNYGSLIGD GASNVTLAGN VWGKVRGRAP RLKSDTETVV VNNLLYFFDE SANADDSAVT SFVGNAAICA DDDDAILEGS PTAYHADNIA YDPPMVDEQP IAEPESTSSP PLWPSGLSEM PSGDVESHNL TNAGARPADR TQNDARIVQE IADRAGLDYL DSPYDYWVGH HDEVGGYPEL PVNTHSLEVP DSGIRDWLAG WAQAVEEGSS PPDGGSGDDG SSGPIPTGTY EIANVNSGQL LEVADASTAD GANVQQWSAT DHATQQWYVE DTGNGEYVLQ NANSGLLLEV ADGSTEDGAN VQQHADTGCD CQRWSINDVG NGEYILEAVH SGKVADVEGA STSDGANVLQ WPDTGGANQR WTFDSV
|
| |