Gene Pden_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_3044 
Symbol 
ID4581608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008687 
Strand
Start bp221466 
End bp222683 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content66% 
IMG OID639770368 
Productmajor facilitator transporter 
Protein accessionYP_916821 
Protein GI119385766 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCCG AGGAGCCCGC CGCGCCGGAT CGGAGTCCCC TGCCCCCGAT CTTGGCGGTT 
GTCGCGGTGG AAAGTGCTGG TCTGGGCATG ATCCTGCCCC TGCTGCCGTT CTACGCAAGC
GAGTTCGGAG CCACGCCCCT GACCATCGGC CTGCTGCTGG CCAGCTTTTC GCTGTGCGAG
TTGCTGGCCG CGCCGATCCT CGGGAAAGCC TCCGACCTGT TCGGCCGCAA GCGGCTGCTG
GTGGCCAGCC AGATCGGAAC CTGTGCGAGC TTCCTCCTGC TCGCCCTGGC CCCCAATCTT
GCCGTCGTGT TCGCCGCTCG AATCCTCGGC GGCCTGTCGG CAGGCAACAT CTCGATCGCG
ACCGCCTATG TGTCGGACCG GACTGCGCCC CACAAGCGGC GACAGGCCAT CGGCTTCGTC
AGTGCCGCGA TGGGTGTCGG CACGATGGTC GGACCGGCGC TGGCAGGTGT TCTGGCACCG
ATGGGACTGG CCGTGCCGTT CGGAGTCGCC GCAGTCGTTT CCCTGGCCAG CATCCTGGCA
ACCTGCCTCC TGCTTCCCGC CGAAGGACAG CAGCCGATCC ACAAGCCAGC GCCAGCACCT
TTCCTGGCGC AGATCGGCGG TCTTCTCGCC TTGCCGCAGG TGAAGCACCT TCTTGCGGCG
CTCGCCCTGC TGTTCATGGC GTTCAGCCTG TTCGGATCGC AATTCGCCCT GGTGCTGCAC
GCCCGCTTCC AATGGCAAGG CCGGCCCTTC GATCCGACAG AGATCGGGTT CGTGTTCGTT
GCGGCCGGAG CGATCAATAT TCTGGTGCAG GTCGTCGTGA TGCCGCTCCT TGGCAAGGCC
CTGCGCGAGA GGACGCTGGC CATGGCCTCC TTCAGCCTCC TCGCCATCGG GTTCATGATC
ATCGGCCTTG TCGGCACGGT CCCGGCGTTG GCGGCCGGAA TAGTGATGGC CGCAACAGGC
ATCGCCTTGG CCCGGCCCAC GCTTGTCGCC GCACTTTCAC TCATCGTGCC CGCCACCTCG
CAGGGTGCTG CGATGGGGAT GGCCCAGTCC ATGGCCGCGC TGATCAACAT CGTCGCACCG
ATTGCCGGGG GGCTGCTGAT CGAGGGCCAG CATTTCACCG GGTGGGCCAT GGCCCTTGTC
GGGCTGGCCT TGGCCGGTAT TGCCGTCGTG GCACTATCTC CGGCGGGATC GGATTCATCA
AAGAGAGCTG ACCCGTGA
 
Protein sequence
MMPEEPAAPD RSPLPPILAV VAVESAGLGM ILPLLPFYAS EFGATPLTIG LLLASFSLCE 
LLAAPILGKA SDLFGRKRLL VASQIGTCAS FLLLALAPNL AVVFAARILG GLSAGNISIA
TAYVSDRTAP HKRRQAIGFV SAAMGVGTMV GPALAGVLAP MGLAVPFGVA AVVSLASILA
TCLLLPAEGQ QPIHKPAPAP FLAQIGGLLA LPQVKHLLAA LALLFMAFSL FGSQFALVLH
ARFQWQGRPF DPTEIGFVFV AAGAINILVQ VVVMPLLGKA LRERTLAMAS FSLLAIGFMI
IGLVGTVPAL AAGIVMAATG IALARPTLVA ALSLIVPATS QGAAMGMAQS MAALINIVAP
IAGGLLIEGQ HFTGWAMALV GLALAGIAVV ALSPAGSDSS KRADP