Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0227 |
Symbol | |
ID | 8533342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 238056 |
End bp | 240410 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 646382605 |
Product | sucrose-phosphate synthase |
Protein accession | YP_003262137 |
Protein GI | 261854854 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0132674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACA CGCCAGCGCA CGCGAAAGAG CGCCCCGCCG AAACGCGGCC AGAGCCAATG ATCGAGCCCA AGATTGAGTC CACGGTCGAG TCAAGTGCCG AGGGGCGCAA GCCCGGGCTG TACATCGTTC TTATTAGCGT GCACGGACTG ATTCGCGGTT CTGAACTGGA ACTCGGGCGT GATGCCGACA CGGGCGGCCA AACCCTGTAT GTGGTCGAAC TGGCAAGGGC GCTGGCGAAG CATCCGGTTG TTTCGCGCGT GGATCTGTTC ACGCGACTCG TGCGGGATGA TCGTGTCTCG GCGGACTATG CCCAGCCGGA AGAATCGCTG GCGGACGCAC CGAATGCACG GATTGTGCGG GTGCCTGCCG GACCGGACGA ATATCTGCCC AAGGAACAAC TTTGGGATCA TCTGGATAGC TTGAGCGATC ATGCGCTCGA TTATATTCGC CAGACTGGGC TCAAACCCGC ACTGGTGCAT AGTCATTATG CCGATGCCGG TTACGTGGGC ATGCGGCTGT CGTTGCAGCT GGGCGTGCCG CTGGCGCATA CCGGTCACTC GCTCGGGCGG GTCAAGCGTC AGCGTTTGCT GGCCAGTGGC GAGTCCGCCA AGGTCATTGA GCAGAAATAT GCGCTCTCCC GCCGGATCCG CGTCGAAGAG GAAGTGCTCG CGGCGAGCTC GCTGGTGGTG GTGAGTACGC AGGATGAAAT CGAAACCCAG TATGGTCTGT ACGATTGGGC CGATCCCTCG CGCATGGAGG TTATTCCGCC GGGCGTTGAT CTGACGCGGT TCGATCCAAA AATCACCGGG CCGATGCCTA TTGCCGATGA GTTGGCCCGG TTTCTCCGCG AGCCGGACAA GCCCGCCATC CTTGCCTTGT CCCGCCCCGA CGAGCGCAAG AACATCGCCA CGCTGGTTCA TGCCTACGGC CGCAACCCCG CGCTGCAAGA CGTCGCCAAT CTGGTCATCG TGGCAGGAAA CCGGGATGAC ATCCGCGACA TGGACCCGGG TTCCCGACAG GTCTTGACTG AAATTCTTCT GTTGATCGAT CGCTATGATC TGTACGGCAA GGTCGCCTAT CCACGCCATC ATCAATCGCA AGATGTACCC GATTTTTATC GTTGGACAGC GCAAACCCGT GGTGTATTTA TCAACCCCGC GTTGACCGAG CCCTTCGGCC TGACCCTCAT CGAGGCCGCT GCCTGTGGTT TGCCCATTCT GGCAACGGAA GATGGCGGGC CACGGGACAT CATCCGGGCT TGCAAAAATG GTGAGCTTAT CAATCCGCTG GATGCCGAGG GTATGGGCGA ACAGTTATTG GCCTTGCTGA CCGATACGGC GCGTTGGGAC AGCTATGCGC GCAATGGGAT CAAAGGGGTG CGTCATCATT ACACCTGGCC CGCGCACGCC GAACAGTACT TCGAGACGCT TGCCTCCATG CCGTTGCACC AGCAGACCAG CGCACCCGCC GGGGCTTCTG AAACCGCTGC ACATGCCGCA TCCACGCCGA TGACCGCCGA TCGCATGATT CTTATCGATG ATCGCATTCT GAATACGGAT ATTGATGTCG CCGCTCTGCG CGAATTGATT GGCCTGCTGC GACGTCACCG GCGACAGGTG GCATATGGAT TGGTTTCCGA TCGTCCCCGC CACGACATTC TTGCCCTGCT CAAAAAGCAG GGTTTGGTCG TGCCCGACGT GTTGATTACA CGGGGCGGTA CGCAAATTCA TTACGGTGCA CGCCTGTCTC GCGATAAGGG CTGGAGTCGG CACATCAGTT ACAGCTGGCA AGGGGACCGT GTGTATGAGT TGCTGGCCGA AACGCCCGGC GTGCGCCTAT CTGGTCGTAG CCACCAGGGC CTGTACGCCG TGCATGCCTA CATTGATGAT CCGGATGTAT TCGCGGGCCT GAATGAGTTG GCCGATGCGT TTCATCAAGC GGATATTTCC GCGCGACTGA CCGCATTGAA CGAACGGGAG TTTCTGGTAA CGCCGCAACG CGCATCCAAA GGCTTTGCGA TTCGTTATCT CGCGGCCCAG CATGACATTG CACTGATGAA TATGCTGGTC GTGGGGAGCG CCGAAGCAGA CAGCGATCTT CTGGGCGGCA ATGTACTGAG CGCGCAGCTC TGTGCCGAAC CGGATGAAGA GTGTGTGGTT CAGGGTGCGG ATAACTCGAT TTATTGCCCG TCGGCCAGTG GCGTGGCGGG TATTCGCGCG GCAATGGATT TTTATGATTT TCTGGGCGAG TGCCGCGCGC CGTCCACCGA TCCCGAGGCC GGTGGCGCCA ACGACGAGCC CAAACCCGCG ACTGCGGCGT CGGGACCGGC ATCATCCAAG GAGGACGAAG TATGA
|
Protein sequence | MNDTPAHAKE RPAETRPEPM IEPKIESTVE SSAEGRKPGL YIVLISVHGL IRGSELELGR DADTGGQTLY VVELARALAK HPVVSRVDLF TRLVRDDRVS ADYAQPEESL ADAPNARIVR VPAGPDEYLP KEQLWDHLDS LSDHALDYIR QTGLKPALVH SHYADAGYVG MRLSLQLGVP LAHTGHSLGR VKRQRLLASG ESAKVIEQKY ALSRRIRVEE EVLAASSLVV VSTQDEIETQ YGLYDWADPS RMEVIPPGVD LTRFDPKITG PMPIADELAR FLREPDKPAI LALSRPDERK NIATLVHAYG RNPALQDVAN LVIVAGNRDD IRDMDPGSRQ VLTEILLLID RYDLYGKVAY PRHHQSQDVP DFYRWTAQTR GVFINPALTE PFGLTLIEAA ACGLPILATE DGGPRDIIRA CKNGELINPL DAEGMGEQLL ALLTDTARWD SYARNGIKGV RHHYTWPAHA EQYFETLASM PLHQQTSAPA GASETAAHAA STPMTADRMI LIDDRILNTD IDVAALRELI GLLRRHRRQV AYGLVSDRPR HDILALLKKQ GLVVPDVLIT RGGTQIHYGA RLSRDKGWSR HISYSWQGDR VYELLAETPG VRLSGRSHQG LYAVHAYIDD PDVFAGLNEL ADAFHQADIS ARLTALNERE FLVTPQRASK GFAIRYLAAQ HDIALMNMLV VGSAEADSDL LGGNVLSAQL CAEPDEECVV QGADNSIYCP SASGVAGIRA AMDFYDFLGE CRAPSTDPEA GGANDEPKPA TAASGPASSK EDEV
|
| |