Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1916 |
Symbol | |
ID | 8535074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2048489 |
End bp | 2050291 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 646384297 |
Product | 5'-Nucleotidase domain protein |
Protein accession | YP_003263785 |
Protein GI | 261856502 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTAT CTCGTCGGGA ATTCCTGCAA ATGCTCGCGG CAGCATCTGC AGCGGGTATC AGTCTTGAAA CCGCCCAAGC GGGCACTGGC AATCAAACTG TTGCCAATAC CGTCAAGGTG AACCGTGCCT CCGGCAACAT GTACGAAGTG CCCAAATTCG GTAACGTGCA CATTCTGCAT TACACCGATA CGCACGCGCA GCTCGACCCG ATCTATTTCC GTGAGCCGAG CATCAATCTC GGTGTTGGCT CAATGGAGGG GAATGCACCT CATCTCGTCG GCGAGGCCTT CCTGAAGCAT TTTGGGTTGA AGCCAAATAC GCCAGAGGCG CATGCCTTCA CCTGCCTGAA CTACGTTGAG GCCGCTAAAA AATTCGGTAA GGTCGGCGGT TACGCGCACC TTTTGACCCT CGTTAAGCAC ATGAAGGCCC AGCGTCCAGG CGCTTTGGTC CTTGATGGTG GTGATAACTG GCAAGGTACC GGTATGGCGT TGTGGACAAA CGCCCAAAGT CAGATCGATA CGCAAAAACT GTTTGGTCTG GATGCGTTTA CATCGCATTG GGAAGCGACC TACGGCAAAG ATCGCATGAT GGACGGCATC AAGCAGCTTG AAGCCGCTGG CAACATGACA TTCGTTGCTC AGAACATTCG CAACGAAGAC TTTGAAGATC GCGTATTCAA GCCGTACATC ATTCGTGAAC AGAATGGCGT CAAGGTTGCC ATTGTGGGCC AGGCATTCCC GTACACACCT ATCGCCAACC CGCGTTGGAT GACCGAAGGC TGGACATTCG GCATCAACCC GGAAGACATG CAAGACGTTG TCAACAAAGC CCGTAAAGAA GGCGCTGAGT GCGTTGTTGT CCTGTCGCAC AACGGCATGG ACGTTGACTT GAAGATGGCT GCACAAGTAA CGGGCATCGA CGCCATTATG GGTGGCCATA CCCACGATGC GATTCCGCAT CCGACCATCG TCAAAAATGC CGGTGGCAAA ACAATCGTCA CCAACGCAGG CTCGAATACC AAGTACCTCG GCGTGTTGGA CATGGATTTC AAAAACGGCA AATTACAGGA TTTCAAATAT CACCTGTTGC CGGTCTTTGC TGACTTCATC GAGCCAGATA AAGAAATGCA GGCGCTTATC GACAAGCTGC TTAACCAAGA CGTTACCTTC CAAGGTAAAA CCTTCAATGT TAAGAAACGT TACGACGAAG TGGTGGCCAC CAACGACAGC CTGTTGTACC GCCGTGGTAA CTTCACCGGT TCATGGGATC AACTGATCTG TCAGGCCATC ATTGACCAGA ACGACTGCGA AATCTCATTC TCTCCAGGGG TTCGTTGGGG TACTTCTCTT ATTCCGGGCG AGCCGATCAA ATACGGCGAT CTGATGACCG AAGTCGGTTT GACCTATCCA AACGTCACCG TGAATGAGTT CACCGGCGAG CAAATCAAGG GTATTCTCGA AGACGTCTGC GATAACATCT TCAACAAAGA CCCGTACTAC CAGCAAGGTG GCGACATGGT TCGCGTTGGT GGCATGACCT ATGCCTGCGC GCCGAACGAA GTCAAAGGCA AGCGTATTTC CGAAATGGCG CTCGGCGGCA AACCGCTGGA TCCGAACAAG AAGTACAAGG TCGCTGGCTG GGCTTCCGTT GCCAAACCGG GTGAAATTCC GGGCGACACC GGCAAGATGA TCTGGGATCA GGTTGAAACT TGGTTGCGGG ACAAGAAGCA CATCAAGCCC GTCAAGATCA ACGAGCCGCG TTTGATCGGT GTTAAAGGTA ATCCAGGTTT GACTAACTGC TGA
|
Protein sequence | MNLSRREFLQ MLAAASAAGI SLETAQAGTG NQTVANTVKV NRASGNMYEV PKFGNVHILH YTDTHAQLDP IYFREPSINL GVGSMEGNAP HLVGEAFLKH FGLKPNTPEA HAFTCLNYVE AAKKFGKVGG YAHLLTLVKH MKAQRPGALV LDGGDNWQGT GMALWTNAQS QIDTQKLFGL DAFTSHWEAT YGKDRMMDGI KQLEAAGNMT FVAQNIRNED FEDRVFKPYI IREQNGVKVA IVGQAFPYTP IANPRWMTEG WTFGINPEDM QDVVNKARKE GAECVVVLSH NGMDVDLKMA AQVTGIDAIM GGHTHDAIPH PTIVKNAGGK TIVTNAGSNT KYLGVLDMDF KNGKLQDFKY HLLPVFADFI EPDKEMQALI DKLLNQDVTF QGKTFNVKKR YDEVVATNDS LLYRRGNFTG SWDQLICQAI IDQNDCEISF SPGVRWGTSL IPGEPIKYGD LMTEVGLTYP NVTVNEFTGE QIKGILEDVC DNIFNKDPYY QQGGDMVRVG GMTYACAPNE VKGKRISEMA LGGKPLDPNK KYKVAGWASV AKPGEIPGDT GKMIWDQVET WLRDKKHIKP VKINEPRLIG VKGNPGLTNC
|
| |