Gene Phep_3833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3833 
Symbol 
ID8254967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4603118 
End bp4605433 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content44% 
IMG OID644937497 
ProductMalate dehydrogenase (oxaloacetate-decarboxylating) (NADP(+))., Phosphate acetyltransferase 
Protein accessionYP_003094086 
Protein GI255533714 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID[TIGR00651] phosphate acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.701616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.763898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGA TTAATAGAAA GCAGGATGCA TTGGATTACC ACTCGCAAGG ACGGCCGGGT 
AAAATCCAGG TTATACCCAC TAAACCAACC AATTCCCAAA GGGACCTGGC GCTTGCTTAT
TCACCTGGCG TAGCAGAACC CTGCTTGAAA ATAGCCGAAA ATACAGAAGA TGTTTATAAA
TATACAGCAA AGGGCAACCT TGTAGCCGTA ATCAGTAATG GTACCGCTGT TTTGGGCCTG
GGCAATATTG GTCCCGAAGC TGGTAAACCG GTGATGGAAG GTAAGGGCCT TCTTTTTAAG
ATATTTGCCG ATATTGATGT TTTTGATCTG GAACTTGATA CCACGAATGT TGATGATTTT
GTAAAAATAG TAAAAGCACT GGAGCCTACA TTTGGGGGAG TGAACCTGGA AGATATCAAA
GCTCCTGAAT GCTTTGAAAT TGAGCGGCGT TTGAAAGCTG AAATGAATAT TCCGGTGATG
CACGATGATC AGCACGGTAC GGCTATCATT TCTGCAGCAG CTTTGTTAAA TGCCTGTGAA
CTGCAGAAAA AGAAGATGGA TAAGATCAGG ATCGTTGTAA ATGGTGCTGG TGCCGCTGCT
ATTTCCTGTT CACGTTTATA TGTTTCCTTG GGGGCTAAAA AAGAAAATAT TGTGATGTGC
GACAGGTCGG GTGTGATCAG GGATAACCGG GAAAACCTGG ACGAGATCAA AGCTGAGTTT
GCAACTTCCA GAAAACTGGA TACACTGGCA GAAGCGATGA AAGATTCCGA TGTTTTTATC
GGTTTATCGT CGGCCGACTG CGTTACAGAA GATATGCTGA AATCGATGGC TAAAAACCCT
ATTGTGTTTG CCATGGCGAA CCCCAATCCG GAGATAGCTT ATGAACTGGC CATTAAATCC
CGTAAAGATA TCATCATGGC TACAGGCCGT TCTGATTACC CCAACCAGGT GAACAATGTA
CTGGGCTTCC CCTATATTTT CAGGGGAGCA CTGGATGTGA GGGCAACTGC AATCAATGAA
GAGATGAAAA TTGCTGCAGT AAAGGCCATT GCGTCGCTGG CAAAAAAATC TGTTCCAGAG
GCTGTAAACA TGGCCTATAA TGAGAAAAAC ATAAAATTTG GTAAGGAATA CATTATTCCA
AAGCCGATGG ACTTGAGGTT AATGACTAAC GTTTCTGCAG CGGTGGCCAG GGCAGCAATT
GAATCGGGGG TTGCCCGGAA AACCATTACC GACTGGGACG CTTACGAGGA AGAACTGAAG
CACCGGTTGG GCATGGACGA TGCCATCATG CGTGCAATCA CGAATAAGGC CAAATCCGAT
CCTAAGCGTG TGGTATTTGC AGAGGCAGAT CATTATAAGA TACTAAAAGC TGCCCAGATT
GTTAAGGATG AAAACATTGC CATCCCGATT TTACTGGGCA ATAAAGAGGT AATCGAAAGG
ATTATTGAAG AAAGTGCACT GGAGCTGGAA GGCGTAACAA TCATAGATAC TTTTAAGGAG
CCTGAACTGA TGCAAAAATA TGGCCAGGCC CTGTATGAAA AAAGACAACG GAGGGGGCTT
ACCTTGTTTG ACGCCACCAA GCTGATGCGT GACAGGAATT ATTTTGGTGC ATCTATGGTA
GAGTTTGGTG AGGCGGATGC CATGATATCG GGACTGACAA GAAATTATGT TTCGACCATT
AAACCTGCCC TGCATGTAAT TGGCACTGCA CCCGGTGTAA ACCGTGTTGC AGGAATGTAC
ATGATGATGA CCAAGAAAGG CCCTGTATTT TTTGGGGATA CTACAGTGAA TGTAGATCCA
ACCGCTGAAG AATTGGTAGA CCTGACCTTG CTTTTGGAAC GTTCTGTGAG TAAATTTAAC
ATCCATCCGC GTATTGCGTT GCTGTCTTAT TCCAATTTTG GTTCTAATGA GGGTGTGGTG
CCTGAAAAAG TTAGAAAGGC TGTAAAAATA CTGCATGATC AGCATCCGCA CATTATGGTA
GATGGCGAAA TGCAGGGAAA TTTTGCCATT AATAATGCCC TGTTAAAAGA TAATTTTCCT
TTCAGCAGAT TGATAGACGG GCCGGCAAAT ACGTTGATCT TCCCTAATCT GGAATCGGGC
AATATTGCTT ATAAGCTTTT GCAGGAACTG GGTGAGGCTG AGGCTATTGG GCCTATTTTA
TTGGGCTTGA ACAAGCCTGT TCATATTGTT CAACTTGGAA GTTCGGTTAG GGAGATTGTA
AATATGGTTA CCTTAGCTGT TCTGGATGTT CAGGGAAAAG AACAAGAGGT AAATCTTAAA
AAAGGCGGAT TGTTAAAAAG AATAGCTAAG AAATAA
 
Protein sequence
MSKINRKQDA LDYHSQGRPG KIQVIPTKPT NSQRDLALAY SPGVAEPCLK IAENTEDVYK 
YTAKGNLVAV ISNGTAVLGL GNIGPEAGKP VMEGKGLLFK IFADIDVFDL ELDTTNVDDF
VKIVKALEPT FGGVNLEDIK APECFEIERR LKAEMNIPVM HDDQHGTAII SAAALLNACE
LQKKKMDKIR IVVNGAGAAA ISCSRLYVSL GAKKENIVMC DRSGVIRDNR ENLDEIKAEF
ATSRKLDTLA EAMKDSDVFI GLSSADCVTE DMLKSMAKNP IVFAMANPNP EIAYELAIKS
RKDIIMATGR SDYPNQVNNV LGFPYIFRGA LDVRATAINE EMKIAAVKAI ASLAKKSVPE
AVNMAYNEKN IKFGKEYIIP KPMDLRLMTN VSAAVARAAI ESGVARKTIT DWDAYEEELK
HRLGMDDAIM RAITNKAKSD PKRVVFAEAD HYKILKAAQI VKDENIAIPI LLGNKEVIER
IIEESALELE GVTIIDTFKE PELMQKYGQA LYEKRQRRGL TLFDATKLMR DRNYFGASMV
EFGEADAMIS GLTRNYVSTI KPALHVIGTA PGVNRVAGMY MMMTKKGPVF FGDTTVNVDP
TAEELVDLTL LLERSVSKFN IHPRIALLSY SNFGSNEGVV PEKVRKAVKI LHDQHPHIMV
DGEMQGNFAI NNALLKDNFP FSRLIDGPAN TLIFPNLESG NIAYKLLQEL GEAEAIGPIL
LGLNKPVHIV QLGSSVREIV NMVTLAVLDV QGKEQEVNLK KGGLLKRIAK K