Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2071 |
Symbol | |
ID | 8535230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2217243 |
End bp | 2218322 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 646384449 |
Product | protein of unknown function UCP012641 |
Protein accession | YP_003263936 |
Protein GI | 261856653 |
COG category | [S] Function unknown |
COG ID | [COG4307] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACAT TTAAATGCAG CAACTGCGGT CAGATCACTT TTTTTGAAAA TGTGGTTTGT GAACACTGCG GCATGCCGCT CGGTTATATC CCAGAGATAC GGGCCATGGT CGCATTCCAG CCCGATTCCG ATTTGTTCTG GTCACCTGTG AATAGCGCTC TGGCCGAAGC TTATCGCCCT TGCCGGAACT ATGTCTCCCT GCATATATGC AACTGGATGG TGGCCGAATC GGATCAAAGT GTCTGGTGCC GGAGTTGTCG TTTCACCGAG ATGATTCCTG CGCTATCCGT GCCTGAGCAT TTGCAGCGCT GGTTTTTGCT GGAGGCGGCC AAGCGGCGAT TGATTTATTC ATTAGTGCAG ATCGGATTGC CGATACCCGA TCGAGATAGG GACCCCGCAC ACGGGCTGCG TTTCAAGTTT CTGGCGGACA CCCCTCAGGA GCGGGTGCTG ACTGGGCATC TGGACGGGGT AATCACATTG AATATCGATG AGGCGGATGA CGCTACACGT GAGCTCAAGC GTACCCGCAT GCATGAGTCG TACCGCACGT TACTGGGGCA TCTGCGCCAT GAAGTCGGCC ATTTTTACTG GTTGCACCTG ATCGCTGAAA GCCCTTGGCT TGAAGGGTTC AGGGCATTGT TCGGTGACGA GCGACAGGAT TATAAGCAAT CGTTGGATCA GTACTACGCT ACTCAGTCAT CTTCTGACTG GACAGCTGAA TTCATCAGCG AGTATGCCAG CGCGCACCCC TGGGAGGATT GGGCCGAAAC TTGGGCGCAT TATCTGCATA TCGTCGATGC GCTTGATACC GCAGCCAACT GGCACGCGCG GACTAACCGT GCTCAGCCAA GTTCACCGTT TCCCGAGGCA TCAGCGCCGT TGACCATCGA GGCGTTTAAA GCCGCATTGA CGCAAGATTG GCTGCCGTTA GCCTTGTTTC TCAATAGCAT GAACCGCAGC CTCGGCCAGA AGGATAGTTA CCCTTTCGTG ATTCCTGACG CCGTGATCCG CAAGCTCTGT TTTATCCACG AAGTCGTTCT GGCTGCCCGT GGAGCATCGC CAGACGCCCG CGGGTATTGA
|
Protein sequence | MKTFKCSNCG QITFFENVVC EHCGMPLGYI PEIRAMVAFQ PDSDLFWSPV NSALAEAYRP CRNYVSLHIC NWMVAESDQS VWCRSCRFTE MIPALSVPEH LQRWFLLEAA KRRLIYSLVQ IGLPIPDRDR DPAHGLRFKF LADTPQERVL TGHLDGVITL NIDEADDATR ELKRTRMHES YRTLLGHLRH EVGHFYWLHL IAESPWLEGF RALFGDERQD YKQSLDQYYA TQSSSDWTAE FISEYASAHP WEDWAETWAH YLHIVDALDT AANWHARTNR AQPSSPFPEA SAPLTIEAFK AALTQDWLPL ALFLNSMNRS LGQKDSYPFV IPDAVIRKLC FIHEVVLAAR GASPDARGY
|
| |