Gene PP_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPP_2021 
Symbol 
ID1042907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida KT2440 
KingdomBacteria 
Replicon accessionNC_002947 
Strand
Start bp2294322 
End bp2295452 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content64% 
IMG OID637145435 
Producthypothetical protein 
Protein accessionNP_744171 
Protein GI26988746 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2706] 3-carboxymuconate cyclase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.145054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGGA CCTGGACGAG CCTGCTGACC GCTAGCCTGA TGAGCCTGAC AATCTCTGCC 
CACGCTGCCA CCTTGCTGGT GGGCAGCTAC ACCGATGGCG CCAGCCAGGG TATCTACCGC
TACCATTTCG ACGACAAGGC CGGCCAAATC GGCCCCACAC CTCTGCAGGT GGTGAAAAGC
GTCAGCCCTT CGTGGCTGGT GCTGTCGGCC GACCAGCGTC AGCTGTTCGC GGTGAATGAG
ACCCCGCAGG GCCATGCCAG CAGTTTCAGC ATCAGCAGCA AAGGCGAAAT CAAGCCGCTC
AACCAAGTGG TCACCCAGGG CGACGAGCCC ACCCACGCCA GCCTCAGCCG TGACCAGCGC
TACCTGTTCG TGGCCAACTA CGCGGTCAAC CCCGACCCCG GTGGCAGCCT GGTGGTGATC
CCGGTGGCCA AGGACGGCAC GCTCAAGCCC GTGGTGCAAC AGGCCCGGCA TAAGGCGAGT
GGGGTCAACC CTGAGCGCCA GGCCGGTGCC CACGTGCATT CGCTGGTGCT GTCGCCGGAT
GGCCAGCACC TGTATGCCAG CGACCTGGGT GCCGACAAGG TGTTCATCTA CCGCTACGAC
GGTGCCAGTG CGGACCACCC GCTGACAGCG GCGATACCTG CGTCCGTGGC CTTGCCGCCG
GGCAGCGGTC CGCGTCACTT GCTGTTCGAC GCCAAGGGCC GGCACGCCTA CCTCACCCTG
GAAATGAACG CCGAGGTGGT GATGTTCGAT GTGCAGGACG GCAACCTGGT TGAACGCCAG
CGCTTACCCC TGACCGAGCG CCAGGAGGCC GCAGCGAAGG CAGCAGGTGG CTTGCACCTG
TCGGCGGACG GGCGCTTCCT GTACGTGAGC AACCGTGGCA CGGCCAATGA AATTGTGGCG
TTCAGCGTGG GCAAGCAGGA CGGCCAGTTG ACGTTCCTGC AGCGTCGCCC GGCAGAAGGT
GATCACCCTC GGGAGTTTGC CCTGGACCCG AGTGACAACT TCCTGCTGGT GGCCAACCAG
AAGAGCAACC AGATCGTGGT GATACGTCGC GATCCGCGCA GTGGCAAGCT GCTGGAGACG
GTGCAGACGC TGAAGCAGGA TGCACCCTCG GACCTCAAGT TCATCGAGTG A
 
Protein sequence
MNRTWTSLLT ASLMSLTISA HAATLLVGSY TDGASQGIYR YHFDDKAGQI GPTPLQVVKS 
VSPSWLVLSA DQRQLFAVNE TPQGHASSFS ISSKGEIKPL NQVVTQGDEP THASLSRDQR
YLFVANYAVN PDPGGSLVVI PVAKDGTLKP VVQQARHKAS GVNPERQAGA HVHSLVLSPD
GQHLYASDLG ADKVFIYRYD GASADHPLTA AIPASVALPP GSGPRHLLFD AKGRHAYLTL
EMNAEVVMFD VQDGNLVERQ RLPLTERQEA AAKAAGGLHL SADGRFLYVS NRGTANEIVA
FSVGKQDGQL TFLQRRPAEG DHPREFALDP SDNFLLVANQ KSNQIVVIRR DPRSGKLLET
VQTLKQDAPS DLKFIE