Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1526 |
Symbol | |
ID | 8534684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1658939 |
End bp | 1659943 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646383916 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003263404 |
Protein GI | 261856121 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGATC GTTCCGTACG CATTTCGTGC GTGGTACCTG CCTTCAATGA GGCTGCCAAC CTACCGAATC TGTTGACTCA ATTGAGCATG CAGTTGGGCG AGTTGTCCGA TCACTGGGAA ATCATCGTGG TCGACGACGG CAGTCGGGAT CGTACGCGCG CCGTATTGGC CGAATATGCC GATACACCGG GTTTGGTCGT GCTGCATTTC TCACGCAATT TCGGCAAGGA AGCGGCCTTG AGCGCGGGGC TTGATCGGGT GCGGGGCGAT GTCTGTTTCA TGCTGGATGC CGATTTGCAA CACCCGGTCG CCTTGATGCC GCAAATGCTG GCCGCCTGGC GCGAGGGTGC GCAGATGGTT TATTTCGTGC GGGAACATCG CAGGGATGAA CCGTTCTGGA AGCGCTGGGG TTCCGGGTTG TTGTATTGGT TGATCAATTT CGGTTCGGCT GTTCAGGTGC CGGAAGATGC TGGCGATTTT CGGCTGCTGG ATGCCAAAGT GGTCGCTGCC CTGCGTGCGC TGCCCGAACG TAATCGCTTC ATGAAGGGCT TGTATGCCTG GGTTGGCTTT CGCACGCAGG CCCTGCCGTA TACCCCCGAG CCCCGGCTGC ACGGTAAAAG CCATTTTTCG GGGCGGCGGT TGCTGAATCT GGCGTTGACC GGCTTGACGG CCTTCTCGAA TGTGCCTCTG CGGTTGTGGA GTGTGTTCGG CCTGATTCTC GCATTGCTGG CGCTGATCTA CGGCGGCTAC GTCACGCTGG CTTATTTCAT CGATCAACGC CCTGTCGCTG GTTGGACGAC CATTGTCGCC GGGCTGATGC TGTTCGGCGG CATCCAGTTG ATCTCCATCG GGATTCTCGG TGAGTATCTA GGGCGAGTGT ATGATGAGGT CAAACAACGG CCACGCTATA TCGTCGACGA ACAGATCGAT AACAGCCCGT TTGCCCAAGA ACAAATGGCC ACGCCTCTGC CGAGTTTGAT CGAATCCGAT AGCAAACATG GATGA
|
Protein sequence | MNDRSVRISC VVPAFNEAAN LPNLLTQLSM QLGELSDHWE IIVVDDGSRD RTRAVLAEYA DTPGLVVLHF SRNFGKEAAL SAGLDRVRGD VCFMLDADLQ HPVALMPQML AAWREGAQMV YFVREHRRDE PFWKRWGSGL LYWLINFGSA VQVPEDAGDF RLLDAKVVAA LRALPERNRF MKGLYAWVGF RTQALPYTPE PRLHGKSHFS GRRLLNLALT GLTAFSNVPL RLWSVFGLIL ALLALIYGGY VTLAYFIDQR PVAGWTTIVA GLMLFGGIQL ISIGILGEYL GRVYDEVKQR PRYIVDEQID NSPFAQEQMA TPLPSLIESD SKHG
|
| |