Gene Apar_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1040 
Symbol 
ID8413913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1174502 
End bp1175944 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content48% 
IMG OID645022629 
Productpyruvate kinase 
Protein accessionYP_003180059 
Protein GI257784842 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0469] Pyruvate kinase 
TIGRFAM ID[TIGR01064] pyruvate kinase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGCA AAAAGACCAA AATTGTTTGT ACTATGGGTC CTGCAACAGA AAGTGATGAG 
GTTCTTCGTG AACTCATCCT TGCTGGCATG AACGTCGCAC GTTTCAACTT CTCGCACGGT
AGCCATGAGT ATCACCGCAC TATGATTGGT CGCGTTCGTT CTATCTCTGA TGAACTTGGT
ATTCCTATTG CTATCATGCT CGATACAAAA GGTCCTGAGG TTCGCACCGG TCTTCTCGAG
GATGGCAAAA AGGTCACTCT TACCACCGGT GAGTCCGTCA TTGTTACCAC TGATGATGAC
GTCATCGGCA ACGCTCAGCG TTTCTCACTT GACTACAAGA ATCTTCCAAA AGAGGTTAAG
AAGGGCTCTA TCATCCTTAT TGATGATGGC CTTATTGGTC TTGAAGTTGA TCACGTTGAG
GGCACTGATA TGCACTGCAA GATTATCAAC GGCGGTGAGC TTGGAGAGAA GAAGGGTGTA
AACGTTCCTA ACGTTAATAT CGGTCTTCCT TCCGTTACTG AGCAAGACCG CGCAGACATC
ATGTTTGGCT GCGAGCTTGG TATTGACGCA ATTGCTGCTT CCTTCATCCG CGATGGCGCT
GCAGTTGAAG AGATTCGTAA CATCTGCCGT GAAATGGGCA CTCCTAACGT ACAGATCTTC
CCTAAGATTG AGTCTGCTCT TGGCGTAAAG AACTTTGACG AGATTCTTGC AGCTTCTAAC
GGCATCATGG TTGCCCGCGG TGACCTTGGT GTTGAGGTTC CTGCAGCTGA GGTTCCTCAC
ATCCAGAAAA CCATCATCAA GAAGTGCAAC GATGCTTACA AGCCTGTCAT CACCGCAACT
CAGATGCTTG ACTCCATGAT TCGCAATCCT CGTCCAACCC GTGCAGAGGT TACAGACGTT
GCTAACGCAA TTTATGATGG TACTGACTGC GTCATGCTTT CTGGTGAGTC TGCAGCTGGT
AAATATCCTG TTGAAGCAGT TAAGACTATG GCTTCCATCT GCAAGGAAAC CGAGAAATAC
CTGCCAGTCA AGGATGTCTA TCACCAGCGT GAGGGCCTTA AGAACGTCAA TGGTGCTACT
GGTTTTGCTG CCGTTGATAT GGCAATTCGC GTTGATGCTA AGTGCATCAT TTGCCCAACA
CACTCTGGCC GCACTGCTCG CCTTGTTTCT AACTTCCGTC CAAAGCGTCC TCTGTATGCA
ATGTCTCCAT CCGATGAGGC AGTTCGTCGT ACCTGCTTCT ATTGGGGTGT CTATGCATTC
AAGACCTCTG AGCAGGGCAC ACTTTCTAAC ACCATGTACA ACGCTTTGAA CGTTGCAAAG
GAAGTTGGCG TTGTAGAGAC TGGCGATATT GTTGTTCTGA CCGCTGGCGA TCCTCATACC
AGCCCACGTC TTGGTGACTA CACCACTTCA ACTAATGTTG CTATGATTTG CCAGGTTCAG
TAA
 
Protein sequence
MARKKTKIVC TMGPATESDE VLRELILAGM NVARFNFSHG SHEYHRTMIG RVRSISDELG 
IPIAIMLDTK GPEVRTGLLE DGKKVTLTTG ESVIVTTDDD VIGNAQRFSL DYKNLPKEVK
KGSIILIDDG LIGLEVDHVE GTDMHCKIIN GGELGEKKGV NVPNVNIGLP SVTEQDRADI
MFGCELGIDA IAASFIRDGA AVEEIRNICR EMGTPNVQIF PKIESALGVK NFDEILAASN
GIMVARGDLG VEVPAAEVPH IQKTIIKKCN DAYKPVITAT QMLDSMIRNP RPTRAEVTDV
ANAIYDGTDC VMLSGESAAG KYPVEAVKTM ASICKETEKY LPVKDVYHQR EGLKNVNGAT
GFAAVDMAIR VDAKCIICPT HSGRTARLVS NFRPKRPLYA MSPSDEAVRR TCFYWGVYAF
KTSEQGTLSN TMYNALNVAK EVGVVETGDI VVLTAGDPHT SPRLGDYTTS TNVAMICQVQ