Gene Pden_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_3643 
Symbol 
ID4582195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008687 
Strand
Start bp793859 
End bp795055 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content71% 
IMG OID639770953 
Productmajor facilitator transporter 
Protein accessionYP_917406 
Protein GI119386351 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0314134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.384152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCGC AGCCCAATCT CAAGCTTGCC ATCCTTGCCC TTGGCCTTGG CGCCTTTGCC 
ATCGGCACTT CGGAATTCGC CGCCATGGGG CTGCTGCCCT GGTATGCCAG CGACCTCGGC
ATCACCGAGC CGCAGGCGGG CCATGTGGTT TCCGCCTATG CGCTTGGCGT GGTCGTGGGG
GCGCCCGTCA CCTCGATCCT GGGCGCGCGG CTGCCGCGGC GGCGCTATCT TGCGGCGCTG
ATCGCGGTTT ATGGCGCGAT GAACCTGCTG GCGGCGGTGC TGCCGGGTTA CGGCACGCTG
GTCGGCATGC GCTTTCTGGC CGGCTTGCCG CATGGCGGCT TTCTGGGCGT GGCCATGCTG
TTTGCCGCCG ATGCCCTGCC GCGCGAACAG CGTGCCAAGG GCGTGACGCA GGTGCTGCTG
GGGCTGACCA TCGCCAATAT CGCCGGGGTG CCCTTGGCCG GTATCCTGGG GCAGGGCTTC
GGCTGGCGCT GGGGTTTCGC GCTGCCCGGC GTGCTGGCGC TGTTGGCGGG CTGGCTGATC
CTGCGGCTGG CGCCCCGGGT CGGCGCGCCC AAGGACGCGC GGCCGCTGGC CGAACTGAGG
GCGCTGCGCA ATCCGGCGGT CATGCTGATC CTGCTGGTCG GCGCCATCGG CTTCGGCGGG
CTTTTCGCGG TCTATTCCTA TCTTTCCGCC GCGATGCTGG CGACGGCCCA GCCGCCGGGC
TGGGCCATAC CGGCTGCGCT TTCGGCCTTT GGCATCGGCG GCACGCTAGG CAGCATTCTG
GCCGCCCGCC TGACCATCCG GCACGGCACC TGGGGCGCGG CCTTGCGGCT GATGCTGTTC
ATGGCCGTGA CCCAGGGCTT TGCGGCCTGG GCGGTGGGCA ATTGGGGGCT GATGCTGGTC
TCGTCCTTCC TGCTGGGTCT GGGCTCGGGC ATGGTGGTGC CCTTGCAGAC CCGGCTGATG
GATGTCGCGG GCGAGGCGCA AAGCATGGCC GCGGCGATGA ACCATGCGGC CTTCAACGCC
GCCAATGCGC TGGGGCCGTG GCTGGCCGGG CTGGCGCTGG CGGCGGGCTG GGGCTGGCGT
TCCTCGGGCC TGGTGGCGGT GGCGCTGTCA GGGGCCGGTC TGCTCGCCCT CGGCCTTGCC
TGGCGGCAGG CCCGTCTCTC CGGGTACCAA CTGCATGATC GCCCGGCGCG AGTGTGA
 
Protein sequence
MSPQPNLKLA ILALGLGAFA IGTSEFAAMG LLPWYASDLG ITEPQAGHVV SAYALGVVVG 
APVTSILGAR LPRRRYLAAL IAVYGAMNLL AAVLPGYGTL VGMRFLAGLP HGGFLGVAML
FAADALPREQ RAKGVTQVLL GLTIANIAGV PLAGILGQGF GWRWGFALPG VLALLAGWLI
LRLAPRVGAP KDARPLAELR ALRNPAVMLI LLVGAIGFGG LFAVYSYLSA AMLATAQPPG
WAIPAALSAF GIGGTLGSIL AARLTIRHGT WGAALRLMLF MAVTQGFAAW AVGNWGLMLV
SSFLLGLGSG MVVPLQTRLM DVAGEAQSMA AAMNHAAFNA ANALGPWLAG LALAAGWGWR
SSGLVAVALS GAGLLALGLA WRQARLSGYQ LHDRPARV