Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1504 |
Symbol | |
ID | 8534662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1629084 |
End bp | 1631045 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 646383894 |
Product | Capsule polysaccharide biosynthesis protein |
Protein accession | YP_003263382 |
Protein GI | 261856099 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3563] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGTTAG CTCAAAAGCA CCAACGCCCT TTTTACTTGC TTGAAGATGG ATTATTGCGG TCCTTCAACA CCGCCGTGGC AGGCGAACCC AGCCTTTCTT GGGTTTGGGA TGATCGGGGT ATTTTTTATG ATGCGCGTAC CCTTTCCCGC CTTGAATCTC TTATTGGCAG TTCGAACCTG ACCACCGCGC AACGAGAAAC AGCAGAGCGG GTCTTGGCGA GCATTCAAGC CAATGGTTTG AGCAAATACA ACCATGGACT TCCCGTTCCT GCAGGTTATT TCCCGAACAT ATCTACTCGA CGGGTTTTAC TGGTGGACCA AACCCGCGGT GATGCTTCGA TTGTCTTTGG TAATGCCTCC GCACAATCGT TTGAAGCGAT GCTGGCTTGC GCCTTGAACG AGGAGCCAGA TGCAGAGCAT TGGGTTAAAA TTCACCCGGA TGTGCTGGCT AACAAGCGTC AGGGCTGCAT CAATTTGGCT AATCACCCTC ATATTCGGAT CATTACCGAT GATTTTCATC CGCATGATCT TATTTCTCAT TTTGATCGGG TGTTTGTCGT GACCTCGCAA ATGGGTTTTG ATGCGCTGCT GCTCCATAAG CCGGTCACCT GTTTCGGGCA GCCGTTTTAT GCCGGATGGG GGCTAACGGA TGACAAAAAT GCCCCTCAAC GTCGGGGAGA ACAGCGCGAT GTATTGGATT TAGTTCACGC GGTTTACTTG GAGTACGCCT GTTATGTGCA TCCAATACAC GCCACTGCGA GTGATTTTTT TACGGTAGCC CAATACATTA CGCGCCAAAA ACAAGCCACG CGATTTTGGA TTGATCCGCT CGGCATGAAC ATTCACAAGG AAACCAACGA GCAGTCAATG ACTAATACGA TAAAGAAACC GCGAATTCTG TGCTTTGGTT TTCGTTTTTG GAAGCGTGCT CAGGTTCGTC CATTTTTTGG CGATCAGGTT CGTCTGGTTT TTTGCGATTC TCTGGATGAT GCGATAGCCA AGGGCATAGA AGCCACAGAT CAGTTCGCGG TTTGGTCGAG CAAGGCAGAC AGCGCGCTGC TTGATTATGC GCGTGAACAG GACATACCTC TGGTGCAGGT TGAAGATGGC TTTTTGCGCT CAGTCGGGTT GGGTTCTGAT TTTGTTGCGC CGCTATCCCT AGTATTTGAC CGAACGGGTA TCTATTCGAA CCCCAATCAC CCCAGCGATC TTGAAAACAT GCTGCAACAG CATGAATTTT CAGAAGAAGA ATGCGATGAA GCAGCGCGTC TGGTTGAGCG GATTGTTGCG AACAGAATCA CAAAATACAA TGTCGATAAC GACACACCGA TCATAATCAA ACCGGCAGGC CGGAGCGTGA TTCTGATTCC CGGGCAAGTA GCAGATGATG CGTCGATTCG GATGGGTGCT GTCCATGTGC GCACCAATGA AGAACTGATT CTTAATGTGC GCCAAACGAA CCCTGACGCG TTTATTATTT ACAAGCCACA CCCCGATGTG GCCTCGGGTA ATCGACAAGG TATCGTGTCC GAGTCAGTTT TACGAAAACA TTGTGATTTG GTCGCCGTTG ATCAGAGCGT CCTGTCCTGC CTCGATGTTG CCGATGAGGT ACATACCATC ACCTCTCTGA CCGGGTTTGA TGCCTTGATT CGCGGAATAA AAGTAGTCAC ATACGGAATG CCGTTTTATG CCGGCTGGGG GTTAACGCAT GATCACAATA GCTGTCCGCG ACGCACGCGC TCGTTAACGT TAAATCAATT AGTTGCCGGT ACTTTGTTAC GCTATCCCCG CTACTGCTTA CCCCAACATA AAGGCTTCGT TCATGCAGAC GACGTATTGA GCGAATTGAT CCACCAAAAA GCCAAGCTTA AAACGGTGCG CGTGCGAGGA ATCAGGCGAT CCATGCGGAA ATGGAGTGGA TTTTTTCGCA GTGCGTACCA TTCACTATCG CGTGAGAATT AA
|
Protein sequence | MALAQKHQRP FYLLEDGLLR SFNTAVAGEP SLSWVWDDRG IFYDARTLSR LESLIGSSNL TTAQRETAER VLASIQANGL SKYNHGLPVP AGYFPNISTR RVLLVDQTRG DASIVFGNAS AQSFEAMLAC ALNEEPDAEH WVKIHPDVLA NKRQGCINLA NHPHIRIITD DFHPHDLISH FDRVFVVTSQ MGFDALLLHK PVTCFGQPFY AGWGLTDDKN APQRRGEQRD VLDLVHAVYL EYACYVHPIH ATASDFFTVA QYITRQKQAT RFWIDPLGMN IHKETNEQSM TNTIKKPRIL CFGFRFWKRA QVRPFFGDQV RLVFCDSLDD AIAKGIEATD QFAVWSSKAD SALLDYAREQ DIPLVQVEDG FLRSVGLGSD FVAPLSLVFD RTGIYSNPNH PSDLENMLQQ HEFSEEECDE AARLVERIVA NRITKYNVDN DTPIIIKPAG RSVILIPGQV ADDASIRMGA VHVRTNEELI LNVRQTNPDA FIIYKPHPDV ASGNRQGIVS ESVLRKHCDL VAVDQSVLSC LDVADEVHTI TSLTGFDALI RGIKVVTYGM PFYAGWGLTH DHNSCPRRTR SLTLNQLVAG TLLRYPRYCL PQHKGFVHAD DVLSELIHQK AKLKTVRVRG IRRSMRKWSG FFRSAYHSLS REN
|
| |