Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0494 |
Symbol | |
ID | 8533621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 526402 |
End bp | 527598 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 646382876 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003262396 |
Protein GI | 261855113 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCC TTTTGAATCT CCATCCACTT GCTCGGGCTG GGGCCGGAAT TGCTGTATAC ACACAGAGAC TCCTCATGGA GTTGGTTCAC TTCCGTGATT TGGATGAAGT CGCCGGGTTC CTTGGAACGA GGGTTCTTTC CGGTGACGAG TTACGTCTTT GGTTAAGTGG CTTTGATAAT CAAACAGAAA AACGAGTAAA GGAAGGCGTG AAATCTTCAC CGCTAATGAT CGACGCCGTA CGCCGCGCCG CACGATCAAT TCCAGGTATG TATGAACTCA GGTACCAATT GAGGAATATC GCCAGTCAAT TTGCTTTAAA CAAGTTCGCC CAAGCGGGTT TTATTTATCA TGAGCCGAAT TACATACCCA CTAGATATTC GGGTAAGCAG GTAATAGCCG TTCATGATTT GTCCCATATT CGATATCCAG ATTTCCATCC GGCGGAACGT GTGGCATTTC TCAATCGTCA TCTCAAACGG GCGATTGGTT TAGCAGATTT TGTCCTGACT GACTCGGTGT TTGTGAAGGA TGAGATTCTT GATGTATTTC CTGTGTCCGG TGAAAAAATT GTTGTCACCC ACCTTGGGGT TGATGAGGCT TTTCATCCTC GACCGGAAGT AGAAACTCTG AATACCTTGC GTCAATTTAA TTTGAGCTAT CGTGGTTTTG TCCTGTCGGT TGGGACTTTG GAGCCACGAA AAAACCTGGA AAGGTTGTTG AACGCCTACG GTGCGTTGCC CGAAGGCGTT CGCCGCGATT ACCCCTTGGT CTTGGCTGGC GGCGGTGGTT GGAATGATTC TGACTTGCAA CGTCAGATTC AACAGATGGA GCGACGCGGG GAGGTGATCC GGACGGGGTA TTTGCCCCGT TCCCAACTTC TGGATTTGTA TGCCTCGGCT GCTGTGTTTG CATACCCTTC CATATACGAA GGGTTTGGTC TGCCTGTTCT GGAAGGGTTC GCCAGTGGAA CGCCAGTCTT GACGTCGAAT GTGACTTCCA TGCCAGAGGT CTCCGGGGGG GCGGCTCTGG AGGTGAATCC GCTTTCAGTG GATGAAATTC GCCATGGGCT ATTCAGTCTT TTGGATGATT CCTCCTTGAG GTCTCGATAT ATGCAACTTG GTTTGGAGCG TGTCCAAGCG TTTACTTGGG CTAAATGCGC CGAGCAGACA ATGGCCGTTT ACAAACAGCT TGGCTAA
|
Protein sequence | MKILLNLHPL ARAGAGIAVY TQRLLMELVH FRDLDEVAGF LGTRVLSGDE LRLWLSGFDN QTEKRVKEGV KSSPLMIDAV RRAARSIPGM YELRYQLRNI ASQFALNKFA QAGFIYHEPN YIPTRYSGKQ VIAVHDLSHI RYPDFHPAER VAFLNRHLKR AIGLADFVLT DSVFVKDEIL DVFPVSGEKI VVTHLGVDEA FHPRPEVETL NTLRQFNLSY RGFVLSVGTL EPRKNLERLL NAYGALPEGV RRDYPLVLAG GGGWNDSDLQ RQIQQMERRG EVIRTGYLPR SQLLDLYASA AVFAYPSIYE GFGLPVLEGF ASGTPVLTSN VTSMPEVSGG AALEVNPLSV DEIRHGLFSL LDDSSLRSRY MQLGLERVQA FTWAKCAEQT MAVYKQLG
|
| |