Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1910 |
Symbol | |
ID | 8535068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2043235 |
End bp | 2045103 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 646384291 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003263779 |
Protein GI | 261856496 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00799183 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTAA ATACTGGGAG TTGTTCGCGG GCCGAGCACT ATTCGGATGC CTCCCCCAGA GCCCTGTTTG CAGTGAGTTC GCTGGGGCTG GGACATGCCA CCCGCAGCCT CGTTTTAATC CATGCGTTTC TGGCCGATGG CTACCGGGTT ACGGTTATTT CAACGGGCAA TGCGTTGGCC TTCATGCGCC TGGAGCTGGC CGATCACCCC GCTGTCGAAT GGCAGGATTG GCCTGATTAT CCACCGCTGG AACGCGGGAC TGGTTGGCGG TTCTACGCTT ATCTTGCGCT TGATCTGTTC ACCACTTGGC GACGAATCCG CGAAGAACAC CATCGGTTTG AATCCATTGC CTGTGATTAT GATTTTGTAT TCAGCGACGG GCGTTACGGC ATGTACAGTC GCTGGGTTCC CTCATTCATT CTGTCTCATC AAATTGCCTT CATTCCGCCC AAGGGGTTGC AGGAAGTCAG CTGGATGAGC GATCACCTCA ACGTGGCCGC GCTCAAGCAG TTCGACCGGA TTTTTATTCC GGATTTCCCC TGCCCGAGCC TGAATCTAGC CGGGAATCTT TCACATACGC CCGTTTTGCA GAACACCCGC CGTGAATATG TCGGGATTCT GTCGTCCTAT CCGCATCAGG AGCTTGAGCA GGATATCGAC TACCTGTTCG TGATCAGCGG CTATCTGCAC GAGCACAAGG GGGCGTTTGT CCGTGATCTG CTTGAGCAGG CGCAAGCTTT GCCCGGCAAG AAGGTTTTTG TCCTTGGGGA TGCGAACGCT GATCCGGCAC AGTATCGGGA CGCACTCTCT GACGATCTCG AAGTGCATCC GCTGGCGACG GGTGATCTAA GGCAGGAATT GTTCGGTCGT GCCAAATGCA TTATCTCTCG AGCCGGGTAC ACCACGGTGA TGGATCTGGT CGAGCATGAT AAGCGCGCGC TTCTTATTCC AACGCCGAAT CAAACGGAGC AGGAATACTT GGGCTACTAC CTCGGTAGTC TTAAATATTT CGTCAGCCGG AATCAGTCCG AAGCATTCGA TCTTGCCGCC GCTTTGGCGC AAACAGAAGA CACGCGGCTC TTTGAGGCAC CCTGGCGGAC GCAGGAAGCG GTGCGCCGCA TACAGTCATC AATGAGCGAT GCGCTCCACA AGAACTTTTT CTCGGTGATA GTGCCCGCAC ATAACGAGGC GGCGGTACTG GCGGAAACGT TGGACAGTCT CCTTGCACAA AGCTATCCAG CAAATCGAAT GGAAATCATC ATCGTGGAAA ATGGCTCAAC CGATGAAACC TGGGCGATTG CTGAACGGTA TGCGGCGCTA TCTGACGGCA ACCTGTCGAT TCGTGCTCTG CAAAGCGAGA AGGGCGTTTC CAAGGCCAAG AATGTGGGCT TGGCGGCCAT GAGTGTGCAT TCTGATTGGG TAATCTTTTG TGATGCCGAC ACGCAGTTGG CACCCAAAGC CTTGCGTCAG TTCAACCGCT GGATCAATCA GCATGGCTCC GAGAGTCTCG CAGTGGGAAC AACGCGTGTA CGACCCCTGA GCGTTAATCG TGTGTATGTG CGGTTGTGGT TCAGGGCCTA TGACTTGATT CATCGACTGA CGCGCAGCTC TTACTCCATA CAGTTGGCGC GTTCGCCAAT TGCGCGCGGG ATCGGTTTTC GTCCGGAATT GAGTTTTGCA GAAGATCTGA CATTCATCAG TGAGTGTCGT CGCTATGGCC GGTTTTTCTA TATTCCCAGC GATCAGGTGG CGACATCGAC CCGGCGATTT GAAGCCCAAG GCTATCTGAA GCAAAGCCTG AAATGGTTGT TCGAGGCGCT GATGCCCATG CGGATGAAGC GGAAACGAGG ATACGATGTC ATTCGCTGA
|
Protein sequence | MNVNTGSCSR AEHYSDASPR ALFAVSSLGL GHATRSLVLI HAFLADGYRV TVISTGNALA FMRLELADHP AVEWQDWPDY PPLERGTGWR FYAYLALDLF TTWRRIREEH HRFESIACDY DFVFSDGRYG MYSRWVPSFI LSHQIAFIPP KGLQEVSWMS DHLNVAALKQ FDRIFIPDFP CPSLNLAGNL SHTPVLQNTR REYVGILSSY PHQELEQDID YLFVISGYLH EHKGAFVRDL LEQAQALPGK KVFVLGDANA DPAQYRDALS DDLEVHPLAT GDLRQELFGR AKCIISRAGY TTVMDLVEHD KRALLIPTPN QTEQEYLGYY LGSLKYFVSR NQSEAFDLAA ALAQTEDTRL FEAPWRTQEA VRRIQSSMSD ALHKNFFSVI VPAHNEAAVL AETLDSLLAQ SYPANRMEII IVENGSTDET WAIAERYAAL SDGNLSIRAL QSEKGVSKAK NVGLAAMSVH SDWVIFCDAD TQLAPKALRQ FNRWINQHGS ESLAVGTTRV RPLSVNRVYV RLWFRAYDLI HRLTRSSYSI QLARSPIARG IGFRPELSFA EDLTFISECR RYGRFFYIPS DQVATSTRRF EAQGYLKQSL KWLFEALMPM RMKRKRGYDV IR
|
| |