Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2254 |
Symbol | |
ID | 8535418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2425258 |
End bp | 2427249 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646384634 |
Product | hypothetical protein |
Protein accession | YP_003264116 |
Protein GI | 261856833 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.017887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TTGAACGCTT TTACAAGCCA TTGCCTTGGC TATCCGCCTT GGCGTTGACT GCGTTATTGG CCGGTTGTGG CGGCGGCGGG CAAGCGCCCA TTTTAGGCGC CGGTGGTGCT GTGCAAGCGC CAGCGCCTGC TGTACCACCT GTCGTACCGC CTGTTATACC GCCGGTTGTA CCGCCAGCGC CGCCCACCGT AACCGCTGTC GCCCCCCTCC ACAACGCCAC GGGTGTCCCG GTAAACAACA CACTGATTAC CGCTGCATTT AGTGAGCCCA TGGCAGCCAT CACCGGTGGC GCAAGTTTCA CTGTTACCTG TGCCGCGCCT TGTGTTAGCC CCGCCGGTAC AGTTACCCTC GATAGCACCA ACCGGGTGGC AACGTTCGCA TCACCCACCG ATCTTACCGC CGCAACCCAA TATACGGCCA CGGTAACCGG GGCTACGAGT TTGGCCTCGG GTTTAGCTTT AGCAGCGCCT TATGTGTGGC GATTTACCAC CGGGATCACG TCAGATACCA CGCGCCCCAG CGTCATTTAC ACCGTACCGG TTACCACCAT TCCGGGCCCA ACCGCGAATG TTGCAGCGAA CACGGCAATA ACAGCTACTT TTTCAGAGGC GATGGCCCCC GCGTCGATCA CAGCCAGCGG AACCTTTGCT TTAACCTGTG CCGCACCTTG TGTTTCCCCT GTATCGGGTG TTGTCACGTA CGATGTCGGG AGCAAAACGG CGATTTTCCG CCCCAATGCG GTGCTTGACA ATGGCACAAC CTACACGGGA ACCATCACCA CGGCAGCAAC CGATGTAGCC GGCAATGCTC TGGCGGGTAA TCAAGCGCCC CTGCCGGCAG CCAGCAATTA CGTTTGGCAA TTTACGACCA CCACACCCGA TACCACACCG CCCACAATCG TGCTGGAAAA CCCGGCTGAT CAGGCCACTA ATGTGGCTCT GAATACGCCG GTTAATGCCA CATTTAGTGA GGCAATGGAT CCGTTAACGC TGACGACGGC CAATTTCTCG CTCCAGGTGA GTGGCCCGCC GATCGGTACG CCGTTAACCG GCGCCGTGAG TTATAACCCG CAAACCCTGG TTGCCTCGTT CACCCCGGCC AGTGACTTGC TTGCCAACAC AACCTACACG GCCACCATTA CCGGTGCCAA AGATTTAGCC GGTAATGCAC TGGCACCCGG CAGCATGCCT AATCCATGGA CTTTCACCAC CGGCAGCGGG CTTGCACCGG GCGCGGTTTC ATTGGGTTCA GCCAGCACCT TTGGTAATTT GGGCGGCACG GCAGGCACCA CTAATCAAGG TATCAATACC GTGATCAATG GGGATCTTGG CACGACGGCC ACCACGACCA CCTCTGTGAC CGGGTTTCAT GATGCGCTCG ATATTTATAC CGAAACCGGC TCAAATATCG GCACGGTTAA CGGTAAGATT TATACCTGCA CCACATCGAC CACCGGCCCT ACGGCTGTGA CGGTTAACCC AACCTCATGC AACATTGCAA CTCAAGCGCT TAGCGATGCG CAAACAGCCT ACAACAAGCT AACCCCGGCC CAGCTGCCAG GTGGCCTTGA TTTGGGTACA GATCAACTGG GCGGCTTAAC TTTGGCACCG GGTATTTACC AGTCAGCACT GGGCTCCTAC CAAATCACCG GCTCTAATCT CGTATTGGAT GGCCAAGGCA ATGCGAACGC GGTGTGGGTA TTCCAGATGG CCAGCACATT GACCGTTGGC GGCCCCGGCT TACCCATGAG CGTTCAGCTG ATTAACGGCG CGCAAGCTAA AAATGTGTTT TGGCAAGTGG GTAGCTCAGC CACCATTAAT GCGGCAGGCG GCGGCACCAT GGTGGGCAAT ATCTTGGCCT ATGCCGGTGT TGCATTCTCC ACGGCCGGGA ATGCGGCCTT AGTGACGCTC AATGGCCGCG CGGTGGGATT AAATGCATCG CTCACCATGA CCAATACCGT GATTAACGTC CCTGCGCCTT AA
|
Protein sequence | MKKFERFYKP LPWLSALALT ALLAGCGGGG QAPILGAGGA VQAPAPAVPP VVPPVIPPVV PPAPPTVTAV APLHNATGVP VNNTLITAAF SEPMAAITGG ASFTVTCAAP CVSPAGTVTL DSTNRVATFA SPTDLTAATQ YTATVTGATS LASGLALAAP YVWRFTTGIT SDTTRPSVIY TVPVTTIPGP TANVAANTAI TATFSEAMAP ASITASGTFA LTCAAPCVSP VSGVVTYDVG SKTAIFRPNA VLDNGTTYTG TITTAATDVA GNALAGNQAP LPAASNYVWQ FTTTTPDTTP PTIVLENPAD QATNVALNTP VNATFSEAMD PLTLTTANFS LQVSGPPIGT PLTGAVSYNP QTLVASFTPA SDLLANTTYT ATITGAKDLA GNALAPGSMP NPWTFTTGSG LAPGAVSLGS ASTFGNLGGT AGTTNQGINT VINGDLGTTA TTTTSVTGFH DALDIYTETG SNIGTVNGKI YTCTTSTTGP TAVTVNPTSC NIATQALSDA QTAYNKLTPA QLPGGLDLGT DQLGGLTLAP GIYQSALGSY QITGSNLVLD GQGNANAVWV FQMASTLTVG GPGLPMSVQL INGAQAKNVF WQVGSSATIN AAGGGTMVGN ILAYAGVAFS TAGNAALVTL NGRAVGLNAS LTMTNTVINV PAP
|
| |