Gene Pden_1090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_1090 
Symbol 
ID4578786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008686 
Strand
Start bp1063371 
End bp1064624 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content66% 
IMG OID639768409 
Productextracellular solute-binding protein 
Protein accessionYP_914894 
Protein GI119383838 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACC TATTTCCCGC CCTTCTCGCC AGCTCGGCAT CGCTGATCGC CGCGATGCCC 
GCTCAGGCCG AGGTACAGCT CGAGGTCATG CATCACTGGG TGGCCGAAAG CGAAGCCGCC
GCGCTCAACG TCGTCCGGGA AGCACTGGCT GCCAAAGGCT ACGGCTGGCA GGACAGCGCC
GTGGGCGGAC AGTCCGGCGG CAACATGCAG CAGGCGTTGC GCTCGCGCCT GGCCTCTGGC
AACCCGCCCG GCGCCATGCA GTTCATCGGC TGGGAGGGGA TCGACTGGTC GGCCGAGGGC
GTGATGCGCG AGCTCAACGC GCTTGACGAG GCGAATGGCT GGGAGGCCGC CATTGCGCCG
CAGGTGCTGC CCTTCGTCAA GAACGGCGAC GACCTGATCG CGGCGCCCAT CAACATGCAT
CGCCAGAACT GGGTCTGGGC GAACAAGGCC GCCTTCGACA AGGCCGGCAT CGAGCAGCCC
GGCACCTGGG CCGAACTGAT CGAGGACGGG GCGAAGCTGA GGGAGGTCGG CGTCATCCCG
CTGGCCATGG GCGACGAGCC CTGGCAGATC CAGGTGATAT TCGACGCGCT CGTCGCGGAT
ATCGGCGGGC CGGAATTCTA CAACAAGGCG CTGGTCGAAC TGGACCCCGA GGCGCTGTCC
TCGGACACCA TGAAACAGGT CTTCGACACG CTGCGCCAAG TGCGGGGGCT GGTCGACGAC
GGTTTCACCG GCCGCGACTG GGCCGTGGCC TCGGGCATGG TCATCAACGG CCAGGCCGGG
ATGCAGATCA TGGGCGACTG GGCCAAGGGC GAATTCCTGG CCAAGGGCCT GAAGCCGGGC
GAGGATTTCC TGTGCTTCGC GACGCCATCG GAAACCCCGT CCTTCCAATG GCTGATCGAC
AGTTTCGGCA TGTTCAAGGT GACCGATCCC GAGGTGACCA AGGCGCAGGA CGCCCTGGCC
GAAGTGGTCA TGGGCCCCAA GGTCCAGCAT GACTTCAACC TGATCAAGGG CTCGATCCCG
GCCCGCACCG ACCTGCCGGT GGACGATTTC GACGATTGCG CGAAGAAGGG CTTCGAGGAT
CGCGCCATCG CGATCGAGAA CGGCGCCATG CTCGGGGCCT CGACGCATGG CTTCGCCAGC
CAGCCCCAGT TCGCCACCGT GTTCGGGGAT GTCGTGGCGC AGTTCTTCGT CAACGACATG
GCCTCGGAAG ACGCCGTGCA GATGCTGGTC AGCGGAATCG ACAACGCCCG CTGA
 
Protein sequence
MKNLFPALLA SSASLIAAMP AQAEVQLEVM HHWVAESEAA ALNVVREALA AKGYGWQDSA 
VGGQSGGNMQ QALRSRLASG NPPGAMQFIG WEGIDWSAEG VMRELNALDE ANGWEAAIAP
QVLPFVKNGD DLIAAPINMH RQNWVWANKA AFDKAGIEQP GTWAELIEDG AKLREVGVIP
LAMGDEPWQI QVIFDALVAD IGGPEFYNKA LVELDPEALS SDTMKQVFDT LRQVRGLVDD
GFTGRDWAVA SGMVINGQAG MQIMGDWAKG EFLAKGLKPG EDFLCFATPS ETPSFQWLID
SFGMFKVTDP EVTKAQDALA EVVMGPKVQH DFNLIKGSIP ARTDLPVDDF DDCAKKGFED
RAIAIENGAM LGASTHGFAS QPQFATVFGD VVAQFFVNDM ASEDAVQMLV SGIDNAR