Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0535 |
Symbol | |
ID | 8533662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 568470 |
End bp | 569507 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646382916 |
Product | protein TolA |
Protein accession | YP_003262436 |
Protein GI | 261855153 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain [TIGR02794] TolA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000935471 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCCGCT GGTTTACACG TATGCCGTTG TCCATCCTGT TGGCGATTGC GCTTCATTTG TTGATCATCT TGTCTGTGCT GATTGCCTGG AAATTCAATG CTTTTTCAGC GGCCTCGCAG TCGTCAAGCA CTTCGAATGA GCCGGTCATT CAGGCGCAAT CCGTCTCGCA AGCCGCAATT GATCAGCAGG TCAATCGGCT TAAGCAGGCT GATCAACAGC GGGAAAAGTC GCTCAAGCAG CAGCAAACAG AGGCCCAGCA GGCCGCTGCC GCTCGTCGAG CGGAGCAGGC TCGTCTGCAA CAGTTAGAAG CGATGCGCGA AGCCAAGCAG AAGGCGGCTG CCGCACAAGA GCAACAGTTG AAAGCCCTTC AGGATGAACA GAAAAAGGCT CAGGAATCCG TGCAGGCGCA AAAGCAGGCG CTAGCAAAGA TGCAGGAAGA AGCCGCCAAG GCAGCCGCTG AAAAACAAGC GGCTCAAGCG GCTGCTGCCA AAGAACGGGC CACTGCGGCT GCGGCCAAGC AGGCAGCGGC TGAAGCAGCG GCGCAGGCGG AACAAGCTAA GAAACAAGCA GCGGCTGAAA AAGCCGCAGC CGAAAAGGCG GCCAAGGAGC AGGCGGCCAA AGAGAAAGCC GCGAAGGCCG CTGCCGACAA GGCTGCAAAA GAAAAAGCGG CCAATTTGGC TGCCCAGAAG GCGGCTCAAG CGCGCAAAGC GGCCTTGCAA CAGCAGCTCC AGGATGAGTT GAGTGCAAGC CAGGCTCAGG GAATTCTGGC AGCCTACGCC GCCGCCATTC AGCAAAAGGT GACCGCTCAG TGGTTCAAGC CGCCGGGCTG GCAGCCTGAC TGGACATGCG ATGTTCGAAT TTCGCAGGCC AAAGATGGTA CGGTGCTCAA TGTCAAAATA TTGCAGTGCG ACGGTGACCA ATTGTTCCAG CGATCTGTTC AGCAGGCCGT TGAGCGCGCG TCGCCTTTGC CTCTGCCTTC GGATATGTCG TTGTTTCAAT CGACGATCAA TTTCAAGTTC AGGGCGAACA CACAATAG
|
Protein sequence | MFRWFTRMPL SILLAIALHL LIILSVLIAW KFNAFSAASQ SSSTSNEPVI QAQSVSQAAI DQQVNRLKQA DQQREKSLKQ QQTEAQQAAA ARRAEQARLQ QLEAMREAKQ KAAAAQEQQL KALQDEQKKA QESVQAQKQA LAKMQEEAAK AAAEKQAAQA AAAKERATAA AAKQAAAEAA AQAEQAKKQA AAEKAAAEKA AKEQAAKEKA AKAAADKAAK EKAANLAAQK AAQARKAALQ QQLQDELSAS QAQGILAAYA AAIQQKVTAQ WFKPPGWQPD WTCDVRISQA KDGTVLNVKI LQCDGDQLFQ RSVQQAVERA SPLPLPSDMS LFQSTINFKF RANTQ
|
| |