Gene Apar_1345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1345 
Symbol 
ID8414233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1515012 
End bp1516937 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content51% 
IMG OID645022945 
Productserine/threonine protein kinase with PASTA sensor(s) 
Protein accessionYP_003180360 
Protein GI257785143 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.138962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGTA GAATGCTCGG TGGGCGTTAC CAGGTTCAAG ACAAAATTGG TACCGGTGGA 
ATGGCAACCG TTTACCGCGG ACAGGACCAA GTTCTTGGTC GTACTGTTGC CATCAAAATG
ATGCTGCCAC AGTACGCAAA TGACCCGTCC TTTGCTGCTC GTTTTAAGCA GGAGGCACAG
GCAGCGGCGG CGCTCTCAAG TCCATACATT GTCTCGGTCT ATGACTGGGG CAAAGACGGC
GAGTCTTACT ACATTGTTAT GGAGTATCTG CGCGGTACAG ACCTTAAGAG CGGCATCCGT
AAGCATGGAG CTCTTGATTC CCGTAAGGTC GCTCAGATTG GCTCTCAGAT TGCCCAGGCG
CTCTCTGTAG CACACCGTCA CGACATCATT CACCGTGACA TCAAACCACA AAACATCATG
GTCCAGCCAG ATGGCAACAT TAAGGTCATG GACTTTGGCA TTGCTCGTGC AAAGAACAGC
AGTCTAACCA CCGATAACTC TGTTCTCGGT ACCGCTCACT ATGTTTCTCC AGAGCAAAGT
ACCGGCAAAC CTTTGGGACC AACCACCGAT ATCTATTCCC TGGGTATTGT CATGTACGAA
GCTGCTACTG GTCGCGTTCC CTTTGTTGGC GACGATGCAA TTAGCGTTGC CATGAAGCAA
GTCAATGAGG CTCCTCAGCC ACCATCGCTC ATTAATCCCA ACATTGACCC TGCTCTCGAG
GCAATCATCC TTCGCTGCAT GGAGAAAAAT CCAGGGGATC GTTACCAGAG CGCTGACGAG
CTTGCTCGCG CTCTTCGCGA TTTCATTGCT GGTCGTGCCA CTATTCCAAG TAACACTACT
GTTAGCCCTC GCATTGTGAC GCCGCCTCAG CCAACTTCTA GGCTTGATCG CCGCGGCATC
GAAGGCTCCA ACACCTACAT GACTCGAGGT GGCGATACTG GTCGCTTGAA CCGCGTTCAT
TCTCGTGAAG AAGCTGATGA GCTGGACAAG CGCGAGAATA ATCGTCACAA GCGCAATATT
ATCCTTGGTA TTCTAGCTGC TTTGGTTGTC GTAGCTCTTG CTGGTTTTGC TATTGCACAC
ATCCTCGGCA ACACCTCCCA GCAGTACCTG GTTCCGCAGT TTGTTGGACA AACAAAAGAC
CAAGCAACTC AGGCGCTAAA TGCATCTGGC AGCCACTTTA AACTAGGAAC TGTCACAGAA
AGCTATAGCG ATTCTGCTCC TGAAGGCCAA GTTATCGACC AGACACCAAG CGCTAACAGA
CAGGCTCCTG AGGGAACCAC TGTTAACCTT GTTATCTCTA AAGGCGCTAA ACCTGCTGCG
GCAGTTAAGG TACCCGATCT CACTAACAAG AGTCCATCAG AGGCCGAATC GGCCCTTGCT
GCTGTTGGTC TCAAAGCAAG AAATGGTGAT TCGGTTGAGT CGGACAACGT AGCTGTTGGT
TACGTTGCAA CACAGGAGCC AGCTGCAGGT AGCGACGCTA AGGCTGGCGA CACCATCACC
TATCATTTGT CTTCTGGTAA GGGTAAGGTA GACGTCCCAG ACGTCACGGA GATGACTTCT
GAGCGCGCGT CTGACGTTCT GAAAGACGCT GGATTTAAGG TAGAAACTCA ACAGCAGCCT
TCATCAAGCG TTCCTGAAGG CCGTGTTATC TCCCAATCCC CAGCTAACGG CAAGGCAGAT
AAAGGTTCTA CTGTAACTAT CGTTGTTTCT ACTGGTGCTC AGAGCGGATC TGTTCCAAAT
GTTGTTGGCA AAGACTTTGA GACCGCTCAG ACCACGCTCG AAAACGCTGG CTTCCAGGTC
AATACTGTTT GGGTCTATGA CGACAACGTT GCCACCGGTA ATGTTGTTGG CCAGACGCCA
TCAAGCGCTG TACCTGGCGC TACTATCACC ATCCGCGTAT CCAGTGGACC GCGACCATCA
GTGTAA
 
Protein sequence
MPGRMLGGRY QVQDKIGTGG MATVYRGQDQ VLGRTVAIKM MLPQYANDPS FAARFKQEAQ 
AAAALSSPYI VSVYDWGKDG ESYYIVMEYL RGTDLKSGIR KHGALDSRKV AQIGSQIAQA
LSVAHRHDII HRDIKPQNIM VQPDGNIKVM DFGIARAKNS SLTTDNSVLG TAHYVSPEQS
TGKPLGPTTD IYSLGIVMYE AATGRVPFVG DDAISVAMKQ VNEAPQPPSL INPNIDPALE
AIILRCMEKN PGDRYQSADE LARALRDFIA GRATIPSNTT VSPRIVTPPQ PTSRLDRRGI
EGSNTYMTRG GDTGRLNRVH SREEADELDK RENNRHKRNI ILGILAALVV VALAGFAIAH
ILGNTSQQYL VPQFVGQTKD QATQALNASG SHFKLGTVTE SYSDSAPEGQ VIDQTPSANR
QAPEGTTVNL VISKGAKPAA AVKVPDLTNK SPSEAESALA AVGLKARNGD SVESDNVAVG
YVATQEPAAG SDAKAGDTIT YHLSSGKGKV DVPDVTEMTS ERASDVLKDA GFKVETQQQP
SSSVPEGRVI SQSPANGKAD KGSTVTIVVS TGAQSGSVPN VVGKDFETAQ TTLENAGFQV
NTVWVYDDNV ATGNVVGQTP SSAVPGATIT IRVSSGPRPS V