Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0053 |
Symbol | |
ID | 8533166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 60466 |
End bp | 62061 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 646382432 |
Product | protein of unknown function DUF195 |
Protein accession | YP_003261966 |
Protein GI | 261854683 |
COG category | [S] Function unknown |
COG ID | [COG1322] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.783995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCAT TCTTCGCGCA ATTGCCCAAT TTGCCGCCAG TTTGGGCCTG GGCGGGGCTG GCTTCGACGG CATTGCTTTT GGTCGCATTG CTTTTTGTTT GGCTGGTCTT GGCGCAGCGG GCCAGGCAGA ATCGCCAATC GGCCGCGGAG GAAATCGAAC GCCTGAACGT GGCTTTGGCC GAGTCCCGGC ACGAGGTCAC CGAACAGGAA CTGGCGGCTC GGCAAGCACA GCGTGACCTG ACGGCGGCCT CGACCGAACT GGCACGTACT CAGGCAACAT TGAGTGCGCT CAGTGATCAA TTGTCCCGCA TGCAGGCCGA GCGGATGAGC GAACGCCAGC AATCCGAGCA ACGAATCGAT GTGCTGTCGC GACAGGTGCA AACCCAGGCT GCCGAGCAGG CCGAGTTGCA GGAGCGTTTG GCTCAGGAGC GCCGTGCCGC CGCCGAAAAA CTGGCTTTGA TCGATCAGGC TCAAGTGCAA TTGCAGCAGG CTTTTCAGGC GCTTTCCGCC GATGCGCTGC GCGCCAATAA TGAATCCTTC CTGAAACTGG CCGAGGAGAA TCTCGCTCGT TTTCAGGCAG GTGCCGCGCA AGATCTGAAC AAACGACAGG AAGCCATCGT TCAGATGACC CAGCCCATCC GTGAGCGTCT GGAGCAGTTT GACGTAAAGC TCAATTCGCT GGAACAGGCA CGTACCAATG CCTATGGCGC GATGAACCAG CAGATCAACG ATTTACTGCA AATCCACCTG CCCAAGTTGC ATCGCGAAAC GGCCGATCTC GTCCGAGCCT TGCGTCAACC GCAGACACGC GGGCGCTGGG GCGAGGTGCA ACTCAAGCGG GTGGTCGAGT TGGCCGGCAT GCTGGAACAC TGCGATTTCG AGGAACAGGT CAGTCAGTCC GATACGGGCG GCCGTTTACG GCCAGACATG ATCGTGCACC TGCCGGGCGG ACGTCAGGTC GTTGTGGATG CAAAGGCTCC GCTTAATGCC TATTTGCAGG CGATGGAAGC CCCTTCGGAC GAGGCGCGTG CCGCAGCGTT ACAGGACCAC GCCCGTCAGG TACGCACCCA CATCAGTCAG TTGAGCAAAA AGGAATACTT CGATCAGTTC AGCCCTACGC CGGAGTTTGT GGTGCTGTTT GTGCCGGGCG AGGTGTTTTT CTCCGCCGCC CTGATGCAGG ACCCGACGCT GATAGAGTTC GGCGCGGAGA AACGGGTGAT TCCCGCGAGT CCGACCACGC TGATTGCTTT GCTCAAGGCC GTTTCCTACG GTTGGCGGCA GGAAGCCTTG GCGAAAAACG CACAGGAAAT GGCTGAACTG GGGCGCGATC TATACGAGCG CATGGGCACA CTGGCCCAGC ATTGGCAGAA AGTGGGGAAA CATCTGGATC AGGCCGTGGG CGCGTTCAAT CAGTCGGTGG GTTCGCTCGA GGGGCGGGTG TTGCCCACGG CGCGCAAATT CCGAGAACTC GGTGCCGTAC GTAGCGATAA GGAAGATCTG CCGAGTCTGA CCCCGTTGAC GACGGAAACC CGGCCGCTTA CGGCCCAGGA GTTGACCGGA CCGGACTCAG CGGCTGCCAT CACGCAAAAG GATTGA
|
Protein sequence | MSSFFAQLPN LPPVWAWAGL ASTALLLVAL LFVWLVLAQR ARQNRQSAAE EIERLNVALA ESRHEVTEQE LAARQAQRDL TAASTELART QATLSALSDQ LSRMQAERMS ERQQSEQRID VLSRQVQTQA AEQAELQERL AQERRAAAEK LALIDQAQVQ LQQAFQALSA DALRANNESF LKLAEENLAR FQAGAAQDLN KRQEAIVQMT QPIRERLEQF DVKLNSLEQA RTNAYGAMNQ QINDLLQIHL PKLHRETADL VRALRQPQTR GRWGEVQLKR VVELAGMLEH CDFEEQVSQS DTGGRLRPDM IVHLPGGRQV VVDAKAPLNA YLQAMEAPSD EARAAALQDH ARQVRTHISQ LSKKEYFDQF SPTPEFVVLF VPGEVFFSAA LMQDPTLIEF GAEKRVIPAS PTTLIALLKA VSYGWRQEAL AKNAQEMAEL GRDLYERMGT LAQHWQKVGK HLDQAVGAFN QSVGSLEGRV LPTARKFREL GAVRSDKEDL PSLTPLTTET RPLTAQELTG PDSAAAITQK D
|
| |