Gene Apar_1191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1191 
Symbol 
ID8414069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1333903 
End bp1335537 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content52% 
IMG OID645022785 
Productmalate dehydrogenase 
Protein accessionYP_003180210 
Protein GI257784993 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.16614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0761619 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGG GTTTTGAACT ACTGAACGAT CCCTTCCTCA ACAAAGGAAC CGCTTTTTCG 
CAGGAGGAAC GACAGAAGTA TGGTCTGGTA GGTCTTTTAC CTCCAAACAT TCAGACCATC
GAGGAGCAAG CAGAGCAGGC ATATGTGCTT TTCCAGCAGT ATCCTGACCT CGAGACTAAG
CGTCACTACC TGATGAGGCT TTTCTCCGAA AACCGCACGC TTTTCTATAA CCTGTTCTCA
AAGCACGTCG AAGAGTTCAT GCCTATCGTC TACGATCCAA CCATTGCACC TGACATTGAA
CAGTACTCCC TGCGCTACGT TGACTCGCAG TACGCATGCT TCCTATCCGC TGATCACCCT
GAGGACCTTG AAACTTCGCT GAAGGACGCA GCCGCAGGTA GAGACATTGA CCTGATTGTC
GTTACCGACG CAGAGGCCAT TCTGGGCATT GGTGACTGGG GCACCAACGG TGTCGAAATT
TCCGTCGGCA AGCTCATGGT TTACACCGCT GCAGCTGGCG TTGATCCAAA CCGCATCATG
CCTGTTGTCA TTGACGCAGG CACCAACCGC CAGGAGTTGC TCGATAACCC TCTCTACCTG
GGCGAGCGTC ACAAGCGCGT TGATGAGGAC CGCTACAACG CCTTCATTGA TAACTTTGTA
ACCACCGTGG AGCAGCTCTT CCCTAACCTC TACCTGCACT TTGAGGACTT CGGACGCTCA
CACGCTGCAG CAATCCTGGA CCGCTACAAA AACACCTACC CCGTCTTTAA CGACGACGTC
GAGGGCACTG GCATTGTTAC CCTCGCAGGC ATCCTCGGCG GCCTCAACAT TTCTGGCGAG
AAACTCGTTG ACCAGGTATA TCTCTGCTTT GGCGCAGGAA CCGCCGGTTG CGGCATTGCT
GAGCGCGTAC TACAAGAGTT TGTGGACCAG GGAATGGATC GCGAGGAGGC TCGCAAGCGC
TTCTACCTGG TAGATCGCCA GGGCCTGCTC TTCGACGACA TGGACAACCT TACTCCACAG
CAAAAGCCCT TTGCTCGCAA GCGCTCAGAA TTTGCAAACG CTGATGAGCT CACTAACCTT
GCAGCTGTTG TAAAGACGGT CCATCCAACA ATCATGGTTG GCACCTCTAC CGTTCACGGT
GCCTTCACCG AGGAGATTAT TAGCGAGATG GCAGCTCATT GTAAGCGCCC AATGGTCTTC
CCTCTATCCA ACCCAACTAA ACTTGCAGAG GCAGCCGCTC AGGACCTGCT GACCTGGACT
GATGGTCGCG CGCTTGTAGC ATGCGGTGTC CCATCCGATG ATGTTGAGCT CAACGGTATA
ACCTACCAGA TTGGCCAGGC CAACAACGCT TTAATCTATC CGGGTCTTGG ACTTGGCGTT
CTTGCATCAA AAGCTCGCCT ACTCACAGAC CAAATGATTT CGCTGGCAGC TCACTCACTT
GGCGGAATTG TTGACACCAC AAAGCCTGGT GCTGCAATTC TTCCTCCAGT CTCCAAGATT
ACTGAGTTCT CTGAGCGTAT TGCCATTGGT GTTGCAGAGG AAGCAATCAA GCAAGGTCTA
AACCGCAAAC CAATCGCTAA TGCAAAAGAG GCAGTTGATG CCCTCAAGTG GTTCCCTGTC
TACAAAGAAC TCTAA
 
Protein sequence
MKTGFELLND PFLNKGTAFS QEERQKYGLV GLLPPNIQTI EEQAEQAYVL FQQYPDLETK 
RHYLMRLFSE NRTLFYNLFS KHVEEFMPIV YDPTIAPDIE QYSLRYVDSQ YACFLSADHP
EDLETSLKDA AAGRDIDLIV VTDAEAILGI GDWGTNGVEI SVGKLMVYTA AAGVDPNRIM
PVVIDAGTNR QELLDNPLYL GERHKRVDED RYNAFIDNFV TTVEQLFPNL YLHFEDFGRS
HAAAILDRYK NTYPVFNDDV EGTGIVTLAG ILGGLNISGE KLVDQVYLCF GAGTAGCGIA
ERVLQEFVDQ GMDREEARKR FYLVDRQGLL FDDMDNLTPQ QKPFARKRSE FANADELTNL
AAVVKTVHPT IMVGTSTVHG AFTEEIISEM AAHCKRPMVF PLSNPTKLAE AAAQDLLTWT
DGRALVACGV PSDDVELNGI TYQIGQANNA LIYPGLGLGV LASKARLLTD QMISLAAHSL
GGIVDTTKPG AAILPPVSKI TEFSERIAIG VAEEAIKQGL NRKPIANAKE AVDALKWFPV
YKEL