Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0226 |
Symbol | |
ID | 8533341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 236581 |
End bp | 238059 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646382604 |
Product | neutral invertase |
Protein accession | YP_003262136 |
Protein GI | 261854853 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3408] Glycogen debranching enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.149494 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCACAGA AAAACCCGGA TTCCTGTTTT CAAAATCCCA AGTACGCGGG CTTTTCGCCT GCGCTCGATG ATGCCTATCG CCTGATCGAC AGCGCCCTGA TCTATTACCA GGGCCAGATC GTCGGCACCG TCGCCAGCAC CGACCACACC GCACCGGCCG TCAATTACAG TGATTGCTTC GTGCGGGATT TCTTCTCGGC GGGGTTGATC ATGCTGCTTG AGGGGCGGGC TGACATTGTG CGCGCGTTCT TGCACGTAAT CATGCAGCTT CGTGGCCAGC AGGAGGCGCT GGAAGGTCAG CAGATCGCGC CGGGCGTCCT GCCGGCTTCA TTTCGCGTCC ATCGGGATGC CGATGGAGAA GAAACCATCA TCGCCGATTT TGGCGACCGG GCCATCGGTC GGGTCGCCCC CGTCGATTCG ATGATGTGGT GGGCCGCTTT GCTGCGCGCT TACGTGCGGT ACACCGGCGA CGAAGCCTTT GCCCACACGC CGGAAATCCA GCGCATGTTG CGGATGATTC TGAGCCTTTG CCTGCAAAGC CGTTTCGAGG TTTTCCCCAC CTTGCTCGTG CCCGATGGTT CCTTCATGAT CGATCGGCGG ATGGGGGTCA ACGGCCATCC GCTGGAAATT CAGGCGCTGT TCGACATGAC GCTGTGTTGT GCCGATTTGC TGGTGCCGGA AGAGGGCAGC CAGTGGCTGA TCGACCTGGC GCATCGTCGG CGCGTCGTGC TGCGACAATA CCTGCAACGG TATTACTGGC TCGATATGGA CGTGCTCAAT CGCATCTACC GGTTCTCGAC CGAAATGTTC GGCGAAGACG TCGAAAATCT GTTCAACATC TATCCCGAAT CGATCCCCGA GTGGTTGCCG GAATGGCTGC CGGATGGCGC TGGCTATTTT GTCGGCAATC TCGGGCCGGG GCGCGTGGAT TTTCGCTTTT TCTCTCAGGG TAATCTTTTG ATGCTGGTGT CCGATCTGGC CTTGCCCGAG CAGGTCAAAG GATTGATGAA TCTGATCGAC CTGCGCTGGA ACGATCTGAT CGGCCGGATG CCGATGAAGC TGGTCTATCC GGCCATCAAG ACCCACGAGT GGCGCTTGAT CACCGGTTCG GACCCGAAAA ATATTCCTTT GTCTTATCAC AATGGCGGCA ACTGGCCGGT GCTGATCTGG CCCTTCGTCG CTGCGGCCAT CAAGGCGGGC CGTTACGACA TGGCTTCTCG CGCCTGGGCC GAGGCCGAGG AACGCTTACT CAAAGACAAC TGGCCCGAAT ACTACGACGG CCGCACTGGG CGGCTGGTGG GTCGGCGTTC CAATGTCCGC CAGGTCTGGA GCGCGACGGG GTTGCTTCTG GCGAGGCATT TTCTTGACGA GCCGGATGTA TTGAATCGCC TGGGTTTTGC ACCTCAGCCG CCGGATGATC CCGAACTGAT GGGCGCTCAA TGGACGCCTC AAGGTACTCA ACATGGAGGA CCGCGATGA
|
Protein sequence | MAQKNPDSCF QNPKYAGFSP ALDDAYRLID SALIYYQGQI VGTVASTDHT APAVNYSDCF VRDFFSAGLI MLLEGRADIV RAFLHVIMQL RGQQEALEGQ QIAPGVLPAS FRVHRDADGE ETIIADFGDR AIGRVAPVDS MMWWAALLRA YVRYTGDEAF AHTPEIQRML RMILSLCLQS RFEVFPTLLV PDGSFMIDRR MGVNGHPLEI QALFDMTLCC ADLLVPEEGS QWLIDLAHRR RVVLRQYLQR YYWLDMDVLN RIYRFSTEMF GEDVENLFNI YPESIPEWLP EWLPDGAGYF VGNLGPGRVD FRFFSQGNLL MLVSDLALPE QVKGLMNLID LRWNDLIGRM PMKLVYPAIK THEWRLITGS DPKNIPLSYH NGGNWPVLIW PFVAAAIKAG RYDMASRAWA EAEERLLKDN WPEYYDGRTG RLVGRRSNVR QVWSATGLLL ARHFLDEPDV LNRLGFAPQP PDDPELMGAQ WTPQGTQHGG PR
|
| |