Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0907 |
Symbol | |
ID | 8534048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 973927 |
End bp | 977067 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 646383292 |
Product | Protein of unknown function DUF2309 |
Protein accession | YP_003262797 |
Protein GI | 261855514 |
COG category | [S] Function unknown |
COG ID | [COG3002] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAC TCCCTTTGGG TAAACGCCTG AAAATTCGTT CCATGGTGCA CATGGCTGCC GAGCCAATCC CCAATTTCTG GCCTATGCGG ACGTTCATTC ACCATAACCC GCTCCATGGT CTGGAGCATC TGCCCTTCGA ACAGGCGGTT CGCCAGGGTG AAAAACTCTT TCACGCACGG GGATTTTTGC CGCGTGAGGA TTACCAGCGC TATCACAAGG AAGGCCGAGT TGATCAAAAC AGCATAAAGC GTGACATAGC CGATTTTATT TCAAAACAAG AAACGCTCAA CGGTTTGGAT TTGGCATCGT TACTTAGCGA CTTGATGTGT TCGGTTAAGA ACAAAGTAAC TAGAACGCGC GCGCTCGCCG ATCATGATGA TGTGTTCCAA GCCTTGCACG GGAAACAACT GGAAAATGCA GAGGCGCTCG ATCTTAAAGC GCTAACTCAA CGCTTATGTG CGCAGTTTGC ACCAGAGCGC CCCCTGTACG AAGCCATCGA TTTGCTGTTC GGCACGCAAA TGGGCACCAC ACTCGATGAA TTGGTAATCA AAAGCTGTCT CGACTTTTTC GATGAAGGCC AATCAACCAT CCAAATGCCC GGCCGCCACC AAGGATTGTT CGCAGCTTGG ACGGCCCTGG CAAAACGTAA TTTACGCCTG TTTTTACGTG GCATGCATAT CAAACAGATC CTCGATCAGG ACGATACGCC AGAGGGCATC ATCGCCTACA TTCTCGACGA ACTGGGCATT GAGGAGGCTC ACTGGGATGG GCTGATTACC CGCGAACTGA CTCGTTTGCA TGGCTGGGCA GGTTTTATTC GCTGGCGCTC CTCTTCCAAG CACTACTATT GGGCCGAGCA GTACCCGGGG GATCTCATCG ATTTCCTGGC CATCCGGCTC GTTTTAGGCT TGGCCTTGAT CCGTGAACAT AGCCGTCAAA AGCGCACACC GATGACAGTC AAAGTGCTGC AAGAATATAT CGAAGGGCAC ACCGCCGAAT GCTATCTGCG TCAAGCCTAT TACGGTGGCT GCATATTGCC CGCGTTTGCT CATGATGTTG ATGATGCGCT GTCGCATAAA AAGCCTCAAA GGATCAACAA CATTCTTCCG GGCTACCTGC GCCAACAACG CCAATTCGAG GCAACACGAC AAGCTGATGC GCTTCGTGAT TTAGCCAGCA AAGCGGGGCA AACCGATGCC CTTATGGCGC TGAATGCGCC TCAAATCAAG CAACTCATGA CACTTATCGA GGCGTTTGAA AACGAAGAAG GCATGATCTG GCTTCGTGCG ATGGAATCGG TCTATCGACG GGAAATCATC AACCAGATTC AACTGTATGC ACCGCATAAA AAAGAAAAAC GGCCCTTTGC CCAGGCATTG TTCTGTATCG ATGTGCGCTC CGAGCCGATA CGCCGTAATC TGGAAACGGT AGGCGAGTAT CAAACCTATG GTATCGCCGG GTTTTTTGGT GTTCCGGTAA GCTATATTGG CCTTGGCAAG GGCAGTGAAG TTAATCTTTG CCCGGTGGTC ATTACCCCTA AAAATCTGGT GCTTGAAGTG CCCGTGGGTG CCACAAGCAT TGAAACAGAC TTTTATTCTT CCGCCGACCA TGTGCTACAT GAGATGAAAA GCTCGATCCT TTCACCCTAC TTCACGGTTG AAGCGGCCGG TTTGCTGTTT GGTTTCGACA TGATCGGCAA AACCATTGCC CCGCGACGCT ACACCCAAAT ACGCAATCAT ATCGAACCAA AAGCACAGGC AACTCGTTTG CTGGTGGATA AACTCACCCG CGAACAAGCC GACTCAATCG TCCGTTCGCT GCAACGCGCC ATGATTGTGC GCGCCATTCA TCAGGAATTT GGCATCGAAC GCGAAGCAGT CACCGATGCC ATGATCCGCG AACTGCGCGA AGCGGCCATG GATAACTATC ACGAACAGAC CGAATTCGCG CGCCGTTTCG CCTTGAGCCC AACGGCCGAA ACTCAGTTTA TCGCAGGGCT GAAAAAGGAC TATAAAATCA ATCGCTCATT TGTTTCCATG CAAATGGAAC GCTTGGCCAG GATTGGTTTT AGCCTCGATG AACAGGTGTT TTATGTTGAC AAGGCACTCA CATCCATCGG ATTGACCGAA AACTTTTCAC GCTTTGTTTT ATTGGCCGGT CACGGCAGCA CTTCCGACAA TAATCCCTAC GAATCCGCGC TTGACTGCGG TGCATGTGGT GGTAGTCATG GGCTGGTTTC TGCCCGGGTG CTTGCCCACA TGGCCAATAA GCCTGAAGTA CGTCGCAGAC TGGCCAAGCA AGGCATCCAG ATACCTGAAG ATACTTGGTT TGTGTCCGTC ATGCACAACA CCACAACCGA TCAATTGTCA CTGCAAGACC TTGATTTGCT TCCAAACAGC CATCTTGTTT ACCTCGAACG CTTGCGTAAC GGCTTACGTG CGGCCACCCG TTTGTCAGCG GCAGAACGCT TGCCTGCTCT GCTTGATCAT CCTTCGCCCA ACATCGACAC ATTATCGGCA CAAAAACAAA TTGAGCGGAA TGCCAGCGAC TGGACCCAAG TTCGGCCAGA GTGGGGCTTG GCGAGAAATG CGAGCGTCGT CGCCGGCGGC CGACATTTGA CCGAGGGTGC GAACTTAAGC GGTCGAACGT TTTTGCAGTC TTACGATTAT CGACTCGATC CCAAAGGCCG CCACCTTGAA AACATTCTCA GCAACCCGCT AATTATCGGC CAGTGGATCA ATCTTGAGCA TTATTTCTCA GCGGTAGATA ACGAACACTT TGGCAGTGGC AGCAAGGCCT ATCACAACGT CGTAGGTCGT TTTGGTGTGG TTACGGGTAA TTTAAGTGAC TTGCGAACAG GGTTACCAGC ACAGTCGGTG CTTAAAGATG GACGCCCATA CCACGAGCCC ATCCGTCTCT TGGCGATTAT CGAAGCACCC GCAGCATTCA CCCTCGAAGT AGCGGGTCGA TTGCCCAAGG TGATGTCCCT GATTACCAAC GGCTGGATCA CTGTTGTTGT CGTTGATCCG GAAACGGGCG ATCGTCTTTT TTATGATCGC GGCGAATGGT ACAATCTCAA CAATGATCCG CAGTACACGC CCTCGGTCAA ACCCTTGCTT GAAGAAGAAC TCAGCGCATG A
|
Protein sequence | MSKLPLGKRL KIRSMVHMAA EPIPNFWPMR TFIHHNPLHG LEHLPFEQAV RQGEKLFHAR GFLPREDYQR YHKEGRVDQN SIKRDIADFI SKQETLNGLD LASLLSDLMC SVKNKVTRTR ALADHDDVFQ ALHGKQLENA EALDLKALTQ RLCAQFAPER PLYEAIDLLF GTQMGTTLDE LVIKSCLDFF DEGQSTIQMP GRHQGLFAAW TALAKRNLRL FLRGMHIKQI LDQDDTPEGI IAYILDELGI EEAHWDGLIT RELTRLHGWA GFIRWRSSSK HYYWAEQYPG DLIDFLAIRL VLGLALIREH SRQKRTPMTV KVLQEYIEGH TAECYLRQAY YGGCILPAFA HDVDDALSHK KPQRINNILP GYLRQQRQFE ATRQADALRD LASKAGQTDA LMALNAPQIK QLMTLIEAFE NEEGMIWLRA MESVYRREII NQIQLYAPHK KEKRPFAQAL FCIDVRSEPI RRNLETVGEY QTYGIAGFFG VPVSYIGLGK GSEVNLCPVV ITPKNLVLEV PVGATSIETD FYSSADHVLH EMKSSILSPY FTVEAAGLLF GFDMIGKTIA PRRYTQIRNH IEPKAQATRL LVDKLTREQA DSIVRSLQRA MIVRAIHQEF GIEREAVTDA MIRELREAAM DNYHEQTEFA RRFALSPTAE TQFIAGLKKD YKINRSFVSM QMERLARIGF SLDEQVFYVD KALTSIGLTE NFSRFVLLAG HGSTSDNNPY ESALDCGACG GSHGLVSARV LAHMANKPEV RRRLAKQGIQ IPEDTWFVSV MHNTTTDQLS LQDLDLLPNS HLVYLERLRN GLRAATRLSA AERLPALLDH PSPNIDTLSA QKQIERNASD WTQVRPEWGL ARNASVVAGG RHLTEGANLS GRTFLQSYDY RLDPKGRHLE NILSNPLIIG QWINLEHYFS AVDNEHFGSG SKAYHNVVGR FGVVTGNLSD LRTGLPAQSV LKDGRPYHEP IRLLAIIEAP AAFTLEVAGR LPKVMSLITN GWITVVVVDP ETGDRLFYDR GEWYNLNNDP QYTPSVKPLL EEELSA
|
| |