Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0027 |
Symbol | |
ID | 8533140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 32408 |
End bp | 33346 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646382406 |
Product | protein of unknown function DUF519 |
Protein accession | YP_003261940 |
Protein GI | 261854657 |
COG category | [R] General function prediction only |
COG ID | [COG2961] Protein involved in catabolism of external DNA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0764421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAATA TCATCGTCAA TCGTTATGCC TGTTTGAACC TCTGCCTGGT AGAAACCATG AATTACCAAC ATCACTTTCA CGCCGGTAAT CATGCCGATG TCTTAAAGCA CTTGGTGCTG CTTCAACTCA TCGAGCTAAT GCAGCAAAAA CCGACCGGAT TTTTGCTGCT CGAAACCCAT GCTGGCGCGG GTTTGTACGA TCTGCAAGCT ACCGAAGCCC GGCGCAGCGA TGAAGCATCT GGCGGTATTG CGCGGCTTTT ACAAGCGACA CAAGCCGCCG ATACCGTGCC GGTTCTGATT CAGACCTATC TTAAGCAAAT CGAACAATTT GGAAGCGTCC CTAATTTAGG CTATTACCCC GGCTCACCGC TGTTGGCCGT CTGCGCCCTG CGCCCGCAAG ATCGTTATAT CGGGGTTGAA CTGGTGCCCA AGGTCGCACG GGAGTTAAGT CGCAATCTCG CTCAGCGCCC CATGCTGGAG CCTTGCATTC CCGACCGCAG GGTCATCGCA CGGGATGGCG AGGGCCTGGC CGCGCTTAAA GCCGATTTGC CGCCCCTGGA GCGGCGCGGT TTGTTCCTGA TCGACCCGCC CTATGAACAG CCCCAAGAGC GCGACGATAT CGCCGCCGCC TTGCAAGCCG GATTGCAACG GTTTGAAACC GGAGTTTATG CCCTGTGGTA TCCGATCAAG CAACGCCCCT ACCTCGATCG GTGGCTCAAC CGGATTGCCA AGAGCACGCC TCGTCCGGTA CTGACCATCG AAAATAGTAT TTTCCCCGAT GAATCGGGCA ATCGGCTCAC CGGCTCCGGC CTGTTGATTA TCAACCCGCC GTGGCAGTTC GATACACTGA TGCAACCCGT GCTCGATTTC GTTAACGACG CGCTCAAGCA GGATACCGCC GCGCCCCGTG CGATTCGTTG GCTGAATCCG GCCCAGTAA
|
Protein sequence | MMNIIVNRYA CLNLCLVETM NYQHHFHAGN HADVLKHLVL LQLIELMQQK PTGFLLLETH AGAGLYDLQA TEARRSDEAS GGIARLLQAT QAADTVPVLI QTYLKQIEQF GSVPNLGYYP GSPLLAVCAL RPQDRYIGVE LVPKVARELS RNLAQRPMLE PCIPDRRVIA RDGEGLAALK ADLPPLERRG LFLIDPPYEQ PQERDDIAAA LQAGLQRFET GVYALWYPIK QRPYLDRWLN RIAKSTPRPV LTIENSIFPD ESGNRLTGSG LLIINPPWQF DTLMQPVLDF VNDALKQDTA APRAIRWLNP AQ
|
| |