Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1926 |
Symbol | |
ID | 8535084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 2059504 |
End bp | 2060691 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646384307 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003263795 |
Protein GI | 261856512 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00887755 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCATATTT TAATGATCTC CGATGTGTAC TTTCCGCGCA TCAACGGCGT ATCGACTTCA ATCCAGAGCT TTCGCAGCGA GTTAATCACC CTGGGACATC GGGTAACCCT GATCTGCCCG GATTATCCCG AATCGCTCAC ACTGGAACGC GCCAAAGATC AACACGACGA TGAAGATATT TTGCGCCTAC CGTCCCGCAC CGTGCTGCTC GACCCCGAGG ACCGGATGAT GAGTTACGGC GCCATCATCA ATCTGATACC GATTCTGCGG GGCAGAAGCA TTGACCTCGT TCATATCCAT ACGCCTTTCG TGGCCCATTA CGCCGGCGTA AAACTGGCCC GAAGGCTGGC CATTCCGGTC GTCGAGAGTT ATCACACCTT CTTTGAGGAA TACCTCTACA ACTACATTCG CTGGATACCC AGAAACTGGC TGAAGCGCGC AGCGCGTTTT TTCTCGAAAA GCCAGTGCAA CGCTGTCGAT GCGCTTGTTG TGCCCTCCTC ACCCATGCGC AATGCCCTGC AAACCTATGG CGTGAGCACC GAGATGCACA TCATACCGAC CGGACTCAAC CTCGATGCTT TCCGCACACC GCCAACATCG AACTTCCGGG CCAAACTATC GATCAGAGAC GACCAACCAC TGCTGCTTTA CGTGGGCCGG GTCGCACTGG AAAAGAACAT CGATTTCCTG TTGAACATGA TGCCCTTCGT TCTGAATCAA ACACCCGATG CCGTTCTGGT CATTGCCGGA GAAGGCCCAG CGGAATCACA CCTCAAACGC AGGGTTGCGG ATATGGGGTT ACAGGCTTCT GTCAAATTCG TGGGCTACAT GCGGCGGGAT GGCGCATTGC AGGATGCCTA TCGTGCCGCC GACCTGTTCG TGTTTGCCTC CCGAACCGAA ACTCAGGGAC TGGTTCTGCT GGAAGCACTG GCGCTCGGCA CGCCGGTTGT CGCACTGGGC ATCATGGGCA CACTGGATGT ACTGCATGCC GATGGCGGCT GCGTGATCGC GCCGGACGAT CCATCCGGGT TCGCCGATGC TGTCAATCAA GCCCTAAACC AACCCGATCG CTACCAGCAA TTGGTCGATC AGGCCCCGCG TTATGCAGAA ACCTGGACCG CCGCCCAAAA GTCACAACAG CTACTGGAAA TGTACCGGCA ACAACTCGCC AGTCATACAA CCTCTTGA
|
Protein sequence | MHILMISDVY FPRINGVSTS IQSFRSELIT LGHRVTLICP DYPESLTLER AKDQHDDEDI LRLPSRTVLL DPEDRMMSYG AIINLIPILR GRSIDLVHIH TPFVAHYAGV KLARRLAIPV VESYHTFFEE YLYNYIRWIP RNWLKRAARF FSKSQCNAVD ALVVPSSPMR NALQTYGVST EMHIIPTGLN LDAFRTPPTS NFRAKLSIRD DQPLLLYVGR VALEKNIDFL LNMMPFVLNQ TPDAVLVIAG EGPAESHLKR RVADMGLQAS VKFVGYMRRD GALQDAYRAA DLFVFASRTE TQGLVLLEAL ALGTPVVALG IMGTLDVLHA DGGCVIAPDD PSGFADAVNQ ALNQPDRYQQ LVDQAPRYAE TWTAAQKSQQ LLEMYRQQLA SHTTS
|
| |