Gene Apar_0660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0660 
Symbol 
ID8413520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp736032 
End bp737342 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content43% 
IMG OID645022237 
Productshikimate kinase 
Protein accessionYP_003179680 
Protein GI257784463 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase
[COG0703] Shikimate kinase 
TIGRFAM ID[TIGR00507] shikimate 5-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.737166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTA AAGCTGAGCA GCAAATCTCA CATAACTCTA CAACTCCCCT TGCCCCTTTT 
GGACTTATTG GCGAGAAACT CTCTCACAGC TGGTCAGCAG AAATTCATAA AAAATTAGGA
TCATTTCCTT ACGAACTCCA TGAGCTCTCC ACTTCAGAGC TGAAGGCTTT CCTCAAAGAC
CAACCCTGGA GAGGTTTAAA CGTTACCATC CCTTACAAAA AGGAAGCCTG TGCTCTTGCA
GACTCAGCTT CTGAGGATGC CCGAGCTATA GGTGCTGCAA ATACACTGGT AAAAGACGAG
AGTGACTACA TAACTGCAGA CAACACTGAT GTGTATGGCT TTGAATACCT TATTAAAAGC
CTCAAGATCA GTTTGAACCA AAAGAAGACG CTTGTTTTAG GGGCCTTTGG CGGTGCTGGT
CAGGCAATTT GCTATGCACT CAAAAAACAT GGTGCGTATG TTGTGGGCGT CTCTAGAAAC
TCTCAAGTAA GCTCATCATT TGTTGACTGT ACAATCACAT ATGACCTGCT TAAATTTCAC
TATGATGCGG TTCTTCTCGT CAACGCAACA CCAGTAGGAA TGTCTCCTCA TGCTGGCATC
TCCCCTCTTT CAAAAGAGGA ACTAACTTCT TTTGCTTCTT TACAGTGCGT GATTGATCTT
ATCTATAATC CGCTACGCTC ACAGCTTCTT TTAGATGCAG AAAGCCTTGG ACTACTCAAT
GTAAACGGTC TAAAAATGCT GGTAGCGCAA GCAGCCAAGG CTAGCTCGCT ATTTTTAGGT
CAAGAAGTAT CAGACCTCCA GATAGAAAAA ATCTCGCAAG AGATTCATTC TTCAAAAGAG
AACATTGTGC TCATTGGTAT GCCCGGAGTA GGAAAGACTT CTACAGGAGA AGCACTTGCA
AAGCTCTTAA ATCGGCCTTG GATAGACACT GATTTTCTCA TTGAACAAAA GGTCCACTGC
AACGCAGCAA CTTATCTACA AACATATGGT GAAGCAGCAT TTAGGAGTCT AGAGCATGAA
ATAATTCAAG AAATTTCTAG TATGACCGGC GTGGTAATTT CTTGCGGAGG TGGCGCCGTT
GTTACACCTT CAAACTACCA ACTGCTTCGT CAAAATGGAA AACTTATCTA TCTAACTCGT
CCGCTTGAAA ATCTAGCTAT TGCAGGAAGA CCTCTCTCAC AAAGCATTGG TATACAAGAG
CTTGCTAAAG AAAGGCTACC TATCTATGAA GCATGGGCAA ATCTAACTTT TCCCTGTCTT
GGTAGCCCCG ATGCAGATGC CAAAGGGCTT CTCGCCACAG TACTTAAATA A
 
Protein sequence
MSFKAEQQIS HNSTTPLAPF GLIGEKLSHS WSAEIHKKLG SFPYELHELS TSELKAFLKD 
QPWRGLNVTI PYKKEACALA DSASEDARAI GAANTLVKDE SDYITADNTD VYGFEYLIKS
LKISLNQKKT LVLGAFGGAG QAICYALKKH GAYVVGVSRN SQVSSSFVDC TITYDLLKFH
YDAVLLVNAT PVGMSPHAGI SPLSKEELTS FASLQCVIDL IYNPLRSQLL LDAESLGLLN
VNGLKMLVAQ AAKASSLFLG QEVSDLQIEK ISQEIHSSKE NIVLIGMPGV GKTSTGEALA
KLLNRPWIDT DFLIEQKVHC NAATYLQTYG EAAFRSLEHE IIQEISSMTG VVISCGGGAV
VTPSNYQLLR QNGKLIYLTR PLENLAIAGR PLSQSIGIQE LAKERLPIYE AWANLTFPCL
GSPDADAKGL LATVLK