Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1041 |
Symbol | |
ID | 8534188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 1125873 |
End bp | 1126979 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 646383425 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003262924 |
Protein GI | 261855641 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCATG CAGAATCCCC AAAGTCTTCC ACTCCGAACA CTACAGAGCC AATGGGCTCT GCGCCATTGG ATTCGTCTGC GTCGAAAGCT CATCAATCAG CCAGTCTGTC CCTGGTGATC CCCGTATTCA ATGAGCAGGA AAATATCGCG CAGTTAATCA AGCGTGTGCA TGAGGCCTTG GCGAATTACA CCGCACCATG GGAATTGCTG CTGGTGGATG ATGGCTCGAG TGATCGCACT GTGAAAATGA TCGAGCAGGG TAGAGCAGAG TATGGTTCGC ATGTCCGATT GGTCCCGCTG GCGCGCAACT TCGGCCAGAC GGCGGCCATG CAGGCGGGTA TCGACTTTGC CCGTGGCGAG GTGATTGCTA CCTTGGATGG CGACTTGCAA AATGATCCCA TCGATATTCC GCGCATGGTG GATCGCCTGT TGCGTGAAGA CCTTGACTTG GTGGCTGGCT GGCGCAAGAA CCGGCAGGAT AACCTCTGGC TGCGCAAAGT CCCTTCCAAA ATTGCCAATC GATTGATTCG ACGAATTACG GGCGTCACGC TGCATGATTA CGGTTGCAGC CTGAAAGTGT TTCGTGCCGA AATCATCAAA GGTGTCCGTC TGTACGGTGA GATGCATCGT TTTATCCCTG CCTGGTTGGC AACGCAAACC TCGCCCAGTC GGATTCAGGA GGAGGTGGTT GCGCATCATC CGCGCACGGC CGGTACGTCA AAATATGGTT TGTCGCGCAC GTTCCGGGTC ATCATCGATC TGATTTCCGT GTATTTCTTT ATGCGTTTTT CTGCCCGCCC CGCGCATTTT TTCGGGATGC TGGGTATGGG GTTCGGCACG CTGGGGGGTC TGGTTTTAGC CTATCTTCTG GTTTTGAAAA TAATGGGCGA GCAAATTGGT GATCGGCCAT TATTCATGGT CGGCATCATG CTGGTGTTGA TTGCGGTTCA GATTTTGACC ACGGGTGTTC TTTCCGAAAT GTTGTCGCGC ACATACTACG AATCCAAGGA AGTGAAATCC TATCATGTGC GTCCGTCCGC GCTGACTGAA CTTGAGGATG CCAATTGGTG TCAATCGAAA TCACAACCTG ATGAGCCTGC ACTATGA
|
Protein sequence | MPHAESPKSS TPNTTEPMGS APLDSSASKA HQSASLSLVI PVFNEQENIA QLIKRVHEAL ANYTAPWELL LVDDGSSDRT VKMIEQGRAE YGSHVRLVPL ARNFGQTAAM QAGIDFARGE VIATLDGDLQ NDPIDIPRMV DRLLREDLDL VAGWRKNRQD NLWLRKVPSK IANRLIRRIT GVTLHDYGCS LKVFRAEIIK GVRLYGEMHR FIPAWLATQT SPSRIQEEVV AHHPRTAGTS KYGLSRTFRV IIDLISVYFF MRFSARPAHF FGMLGMGFGT LGGLVLAYLL VLKIMGEQIG DRPLFMVGIM LVLIAVQILT TGVLSEMLSR TYYESKEVKS YHVRPSALTE LEDANWCQSK SQPDEPAL
|
| |