Gene Apar_1340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1340 
Symbol 
ID8414228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1507460 
End bp1509253 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content48% 
IMG OID645022940 
Productserine/threonine protein kinase 
Protein accessionYP_003180355 
Protein GI257785138 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00115725 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGTGACT CGCGGTCACA GACCACGCAG ACCACATATA TACATGCACC TTCTGCTCAG 
CAGACAACTT TTGCGGCTGA GCCGGAAGTT TTGCTGGATC GATATCGCGT TCTTGCCCGT
AGAGGTAATG GAGGCTTTGG TACTGTTTGT ACCTGCTGGG ATACTCGCTT GCAGAGGCGT
GTAGCTATCA AACGCATGCC ACTTTTAGGT GCTTCCGAAA CTCCTGGTGT GCTGGCCTCT
ACGGTTGATG AAGCGCTCGC AGAGGCGCGA ACTGCCTGCT TATTAGCACA CCCCAACATT
GTGACCGTTC ATGACTTTGA AATTGAAGGT AATTACGCCT ACCTAGTTAT GGAGTTTGTA
GACGGTCTCA ACCTATCTGA GCTCCTAGCT CGTGTTGAAG GCGGCTATCT TACCTACGCT
GAAGCTGCTC ACATGGTCTC TTCACTTGCC AAGGCGCTCC AGTACGCCCA CGATAACGGC
GTTCTTCACC TGGATATCAA GCCAACTAAC ATTATGATTG ACCGTCAAGG TACCGTAAAA
CTTGCCGATT TTGGTATGGC AACGCTTGCT TCGGCTGCTG GTTATGGTGG TGCTCGAGGC
GGCACGGTAG GCTACATGCC TCCTGAGCAA GTTGAAGGCA TGCTTGTTGA CGAACGTGCT
GATATTTTCT CGCTTGCTGT TGTTTTGAGA CAGGCCCTTA CAGGATCTAA TGTATTCTCA
GGCAGAACTG CAAAGGAATC GCTTGATCGT ATATACAAAG GTCCAAAGAT TCCTCTGTTA
AAAGAAGATC CTGAAGTTCC GTTTGCTGTA GATGCGGCCT TGACGCAAGC ACTTTCTCCT
GAGCCATCTC TGCGACAGGG CAGTATTTCT GAATTCGCGC AAGAAATTGT GACTCCTCTT
GGCAACGAAA AACAGGGCGA GAAAAGCCTC AAAGCCTTAG TAGAACAATC AGAAGAAGAG
ACCGAGACCT GGGATGTCAA GCATCTTCCT CTTTCAATTC GTTTCCCCTG GCTGCCCTCA
GTTGCTGTGC GTGGTACATC TGCACTGGTC ACCGGCGTTC TTCTCGCCCA ATTGTTTCAG
TTTATTGAAC CAGATTCACT CATATTTATT GTCGTGGGCT CCCTTGTGGG AGCAGCAATC
GCCGCTCTTT GGACACCCTT AGGATCCGCG CTTGTCATTG CGTGTACAGC GTACGCTCTA
GCGAGCATTA GTCCTACCAG CACTTCATTC CCATTTGCAA CGCTTGTAAC CTTAGTGAGC
GTTATCTGGT GGGCATTTGC GGGAAGAGCC TCAAAACTCA GCAGTATTAA CTGCTTGCTA
GGTGCGTTAT TACCTACTCC TGTTTCGGCA CCAGCTCTTG CCTCTGCTAC TATGCGACCT
TTGCCTGCGG TTTTGACAGG AGCCTTCAGC TATCTCTTTG GAACTCTGCT TATTAAAGGC
ATGTCTTTTG GATTTGCTGC AACACCTCTT GCCTATGACT ACACCTCACT AGCCGCAGGT
CTTCCTTTCT GGATCCGTTT TGGAACATGC GCACTATCTG CTCTTTTAGG CTCACTGATA
AGTCAAAAAA GACGTCGCGG ATGGATGATT TTTGGACAAA TTGTCTGCGC TACAGTACTT
TCAGGTGGCT TTATATGGGC AGCTTGGATG GAGAATCCCA ATTTTTGGGT AGTTGAAAGT
ATAGTTTCAG TACTAATTAC GGTATTCTTA TGTGTGTTTG TATGTATTGC AATTGTCCTG
ATAGGTCCAC TTCAAGCGGA TCAGGAAGGC GAGGAATTAA ATGAGCTTTC TTAG
 
Protein sequence
MSDSRSQTTQ TTYIHAPSAQ QTTFAAEPEV LLDRYRVLAR RGNGGFGTVC TCWDTRLQRR 
VAIKRMPLLG ASETPGVLAS TVDEALAEAR TACLLAHPNI VTVHDFEIEG NYAYLVMEFV
DGLNLSELLA RVEGGYLTYA EAAHMVSSLA KALQYAHDNG VLHLDIKPTN IMIDRQGTVK
LADFGMATLA SAAGYGGARG GTVGYMPPEQ VEGMLVDERA DIFSLAVVLR QALTGSNVFS
GRTAKESLDR IYKGPKIPLL KEDPEVPFAV DAALTQALSP EPSLRQGSIS EFAQEIVTPL
GNEKQGEKSL KALVEQSEEE TETWDVKHLP LSIRFPWLPS VAVRGTSALV TGVLLAQLFQ
FIEPDSLIFI VVGSLVGAAI AALWTPLGSA LVIACTAYAL ASISPTSTSF PFATLVTLVS
VIWWAFAGRA SKLSSINCLL GALLPTPVSA PALASATMRP LPAVLTGAFS YLFGTLLIKG
MSFGFAATPL AYDYTSLAAG LPFWIRFGTC ALSALLGSLI SQKRRRGWMI FGQIVCATVL
SGGFIWAAWM ENPNFWVVES IVSVLITVFL CVFVCIAIVL IGPLQADQEG EELNELS