Gene Apar_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1081 
Symbol 
ID8413954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1224200 
End bp1225240 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content41% 
IMG OID645022670 
Producthistidine kinase 
Protein accessionYP_003180100 
Protein GI257784883 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.837913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTTC TTAAGTATTT AAAAAATCGA ACTATTTTCT TTCTTGCGCT CATAGTAAGC 
AGCACGTTAA TTGGCTTTAT GCTTAAAGGC ATCGGACTTA ACGTGTTTGC TATTGCTTAC
TGTATTTTGG TTTTATGGGT AATAGGAGCG GCTGCACTTC TTGCTGATTA TTTTCATCGT
CGTTGGTTTT ACCAGCAGCT TGCTGATAAT TTAGCAGCAA CTGACAAAGA GAGTTACCTG
GTTCCTACTA TGCTGGACGA ACCTTCAACG CTTGAGGAGA GACTTTTTAG AGATGCGCTT
TCAAGCATGT CTAAGTCTAT GATGGACAAG CTTGCTGATG AGCATATGCG ATCAAAGGAA
TATCGCGAGT ATATTGAGAT GTGGGTACAT GAAATCAAGA CACCCATCGC TGCAGCTCAT
TTAGTTGCTC AGAATAATCC TTCTTTGCCC ATGGACGATG TACAGACGCA GCTAACTAAA
ATTGAATCGC TGATAGAGCA GGCTCTCTTT TATGCGCGCA GTGCATCTGT GGATCGAGAT
TTTTCTATTA GAGATGTTAA CTTGCAGCTC ATTGTTAAAG ATGCTCTTAA AAACAATACA
CGTGTTTTTA TTGATGCAGG TGTTTCTCCG CATTTTGATG ATTTGGATCA TGTGGTAAAA
GCTGATCCAA AGTGGCTTGA GTTTATTTTG CGACAGCTTC TGGTTAATGC AGCAAAGTAT
TCAAATCCGG AGCTAGCACC TGGCGAGAAG AACGTTTGGA TTTCTACGTC CGTCTCAGAA
CCACGTGAGG GAACCACTTC GACAACTCTT TTTATTAAGG ATAATGGTAT CGGTATCCCT
GAAGAGGATC TTTCTCGTAT TTTTGATAAG GGATTTACCG GTCAAAATGG ACGCAAGTAC
GCCAAATCTA CCGGCATGGG TCTCTATCTT TGTTATGAGC TCTGTAAAAA GATGGGCCTT
AAACTTGCTG TGAGCAGTAC TGTTGGTAAA GGCTCAATCT TTTCTATTAC GTTTAATCAG
TCCTTTACCG ATTTGCATTA A
 
Protein sequence
MSLLKYLKNR TIFFLALIVS STLIGFMLKG IGLNVFAIAY CILVLWVIGA AALLADYFHR 
RWFYQQLADN LAATDKESYL VPTMLDEPST LEERLFRDAL SSMSKSMMDK LADEHMRSKE
YREYIEMWVH EIKTPIAAAH LVAQNNPSLP MDDVQTQLTK IESLIEQALF YARSASVDRD
FSIRDVNLQL IVKDALKNNT RVFIDAGVSP HFDDLDHVVK ADPKWLEFIL RQLLVNAAKY
SNPELAPGEK NVWISTSVSE PREGTTSTTL FIKDNGIGIP EEDLSRIFDK GFTGQNGRKY
AKSTGMGLYL CYELCKKMGL KLAVSSTVGK GSIFSITFNQ SFTDLH