Gene Pden_4181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_4181 
Symbol 
ID4582731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008687 
Strand
Start bp1337701 
End bp1339008 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content66% 
IMG OID639771488 
Productextracellular solute-binding protein 
Protein accessionYP_917941 
Protein GI119386886 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000770865 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.668736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGA TGCTGCGCGG CCTGATGGGG GCCTGCGCCA TGACGGCGCT GGCCGCGCCC 
GCATGGGCCG AGACGCTGAC CATCGCCACC GTGAACAACG GCGACATGAT CCGCATGCAG
AAGATGACCA AGCCGTTCAC CGACGCGAAC CCGGACATCC AGCTGGAATG GGTCACGCTG
GAAGAGAACG TGCTGCGCCA GCGCGTCACC ACCGACATCG CCACCAAGGG CGGGCAATAT
GACATCGTGA CCGTCGGCAA TTACGAGGTG CCGATCTGGG CCAAGCAAGG CTGGCTCCTG
GCGCTTGAGG ACATGGGCGC GGATTACGAT GCCGCCGACA TCCTGCCGCC CATCGCCGAG
GGGCTGTCGC TGGACGGCAA GCTTTATGCC GCGCCCTTCT ATGGCGAATC CGCGATGATC
ATGTATCGCA CCGACCTGAT GGAAAAGGCC GGGCTGGAGA TGCCCGAGCG CCCGACCTGG
GACTTCATCT ATGACGCGGC GCGCAAGATG ACCGACAAGT CGGCCGAGGT CTATGGTATC
TGCCTGCGCG GCAAGGCCGG CTGGGGCGAG AACATGGCCT TCCTGACCTC GATGGGCGCA
AGCTACGGTG CGCCCTGGTT CGACATGGAA TGGAAGCCGC AATTCACCGG CGAGGCGTGG
AAGAAGGCGC TGACCGACTA CGTCGCGATC ATGAACGAGG CCGGCCCGCC CGGCGCCTCG
TCCAACGGCT TCAACGAGAA CCTGGCGCTG TTCCAGACCG GCAAATGCGG CATGTGGATC
GACGCCACCG TGGCGGCAAG CTTCGTGTCG AACCCCAAGG ATTCGACCGT GGCCGACAAG
GTCGGCTATG CGCTGGCCCC CGAGGGCGAG AAGCCGCAGA TGTGGCTTTG GGCCTGGACG
CTGGCGATCC CGTCCTCGAC CGACGCGCCG GATGCGGCCA AGAAGTTCGT CGCCTGGGCG
ACCTCGAAAG CCTATACCGA GCAGGTCGCG GCGGAAGAGG GCTGGGCGAA CGTGCCGCCG
GGCACCCGGA CCTCGCTCTA CGAGAACCCC GCCTATCAGG AGGCCGCGCC CTTCGCCAAG
CCGACGCTCG ACAGCATCAT GTCGGCCGAC CTGAAGAACC CGACCACCGC CGAGGTGCCC
TATATCGGCA CGCAATGGGT GGGCATCCCC GAGTTCCAAG CGCTTGGCAC CGCGGTCGGC
CAGCAATTCT CGGCCGCCCT GGCCGGGCAG GCCAGCGTCG ACCAGGCGCT GCAGATGGCC
CAGCAGATCG CCGAGCGCGA GATGGCCAAG GCCGGCTATC CGAAGTGA
 
Protein sequence
MSMMLRGLMG ACAMTALAAP AWAETLTIAT VNNGDMIRMQ KMTKPFTDAN PDIQLEWVTL 
EENVLRQRVT TDIATKGGQY DIVTVGNYEV PIWAKQGWLL ALEDMGADYD AADILPPIAE
GLSLDGKLYA APFYGESAMI MYRTDLMEKA GLEMPERPTW DFIYDAARKM TDKSAEVYGI
CLRGKAGWGE NMAFLTSMGA SYGAPWFDME WKPQFTGEAW KKALTDYVAI MNEAGPPGAS
SNGFNENLAL FQTGKCGMWI DATVAASFVS NPKDSTVADK VGYALAPEGE KPQMWLWAWT
LAIPSSTDAP DAAKKFVAWA TSKAYTEQVA AEEGWANVPP GTRTSLYENP AYQEAAPFAK
PTLDSIMSAD LKNPTTAEVP YIGTQWVGIP EFQALGTAVG QQFSAALAGQ ASVDQALQMA
QQIAEREMAK AGYPK