Gene Daro_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4084 
Symbol 
ID3566848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4377105 
End bp4378847 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content62% 
IMG OID637682556 
Productphosphoenolpyruvate--protein phosphotransferase 
Protein accessionYP_287280 
Protein GI71909693 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value0.814023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCA CACTGCACGG CCTGGGAGTT TCCGGGGGCA TCGCCATCGG GCGGGCCATG 
CTGATGTCGC ACGCCACGCT GGAGGTCTCG CACCTGACCC TGGCGCCGCG CATGGTCGAC
AAGGAAATCG AGCGCTTCGA CATGGCAGTC AATGCCGTCA AAGAAGAGCT GATCCTGATG
AAGGAGAACA CCGAGCACGC CCCGGCCGAG CTCAATGCCT TCATCGACAT TCACACGATG
TTCCTCGAGG ACCCGGAACT GGCCGTCAAG CCGCGCGACA TCATCCGCGA GCGCCGCTGC
AACGCCGAAT GGGCGCTGGT CCAGCAGATG GAGCATCTGG TCGGGCAATT CGAGCAGTTC
GATGACCCCT ACCTGCGCGA GCGCAAATTC GACGTGGTGC AGGTGGTCGA ACGCGTCGTC
AAGGAACTGC TCGGCCACCG CAGCCGAAAT GCCCTGAAAA CGGCCAAACG TTCCAAGGAA
GAGGCACTGA TCGTCGTCGC CCACGACCTG TCGCCGGCCG ACACCATCGC CTTCAAGGAA
CACCGTTTCG CCGCCTTCAT TACCGACGTG GGCGGTGCCA CGTCGCACAC GTCCATCCTC
GCCCGCAGCA TGGCGATCCC GGCCGTGCTC GGCCTGGAAA ATGCGCGCGG GCTGATCCGC
GATGGCGAGC AATTGATCGT CGATGGCATG CGCGGCGTTG TCATCGTCAA TCCGGATCAG
CGGGTGCTTG ACGAGTACCA GCTGCGCAAA GAACAGATCG AGCTCGAAAA GACCAAACTA
AAGCGCCTGA AGACTGCAAA ATCGGAGACG ATCGATGGTG TTTCGGTGCA TTTGTTCGCC
AATATCGAAC TACCAAATGA CGTTCCGGTA GCCCTGGATT GCGGCGCCGA GGGCATCGGC
CTGTTCCGTA CCGAGTTCCT TTTCCTTGAC CGCGGCGACA TGCCGGACGA ACAGGAGCAG
TACGAGGCCT ACAAAAAAGT GGTCAAGGGC ATGGCCGGGC GGCCAGTCAC CATCCGCACC
TTCGACCTTG GCGCCGACAA GGATCTGAAC CCGCAAGGTA ATGCCGGCGA CCGGGTCAAG
ACCAATCCGG CCCTCGGCCG GCGGGCCATC CGCCTGTCGC TGGCCGAGCC CCGGATGTTC
CAGACCCAGT TACGGGCCAT CCTCCGCGCC TCGAAACACG GCCCGATCAA GCTGTTGATC
CCGATGTTGG CGCATGCCCA CGAAATCGAC CAGACGCTGG CGGCACTGGA GCAAGCCAAA
TCCAGCCTGC GCGGCGAAAA AGCCACCTTC GACGAAAACA TCGAAGTCGG CGGCATGATC
GAAATCCCCG CTGCCGCACT GGCCGTCGGT CTTTTCCTGC GCCGGCTCGA TTTCCTGTCG
ATCGGCACCA ATGACCTGAT CCAGTACACG CTGGCCATCG ACCGCTCAGA CGAGCAGGTG
GCCGGCCTCT ACGACCCGTT GCACCCGGCC GTGCTGATGC TGATCGCCCA TACGCTGTCG
ATGGCGGAAA AAGTCGGCGT TCCGGTTTCT GTCTGCGGTG AAATGGCCGG CGACCCGGAC
CTCACCCGCC TGCTGCTCGG CATGGGCCTG CGCATCTTCT CTATGCACCC GTCGCAAATC
CTCAAGGTCA AGAATCGCGT GCTGAAGGCC GAAGTCAATG AACTCGCCCC CAACGTCCGC
CGCATCCTGC GCCTCGACGA ACCGATGAAG CTGCGCGAGG CACTGGACAA GTTGAATGCC
TGA
 
Protein sequence
MSFTLHGLGV SGGIAIGRAM LMSHATLEVS HLTLAPRMVD KEIERFDMAV NAVKEELILM 
KENTEHAPAE LNAFIDIHTM FLEDPELAVK PRDIIRERRC NAEWALVQQM EHLVGQFEQF
DDPYLRERKF DVVQVVERVV KELLGHRSRN ALKTAKRSKE EALIVVAHDL SPADTIAFKE
HRFAAFITDV GGATSHTSIL ARSMAIPAVL GLENARGLIR DGEQLIVDGM RGVVIVNPDQ
RVLDEYQLRK EQIELEKTKL KRLKTAKSET IDGVSVHLFA NIELPNDVPV ALDCGAEGIG
LFRTEFLFLD RGDMPDEQEQ YEAYKKVVKG MAGRPVTIRT FDLGADKDLN PQGNAGDRVK
TNPALGRRAI RLSLAEPRMF QTQLRAILRA SKHGPIKLLI PMLAHAHEID QTLAALEQAK
SSLRGEKATF DENIEVGGMI EIPAAALAVG LFLRRLDFLS IGTNDLIQYT LAIDRSDEQV
AGLYDPLHPA VLMLIAHTLS MAEKVGVPVS VCGEMAGDPD LTRLLLGMGL RIFSMHPSQI
LKVKNRVLKA EVNELAPNVR RILRLDEPMK LREALDKLNA