Gene Pden_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_0803 
Symbol 
ID4580627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008686 
Strand
Start bp777739 
End bp779373 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content68% 
IMG OID639768122 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_914611 
Protein GI119383555 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.288414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCCC CTGCCCGCAA GAGCCCCTTT GCCCCGTTCC GCCATCGCGA CTTCCGCCTG 
CTCTGGTCGG CGACGCTGAT CTCGAATTTC GGCGGGCTGG TCCAAGCGGT GGGGGCGGCC
TGGATGATGA CGCAGCTGAC CGATTCGGCG ACGCTGATCG CGCTGGTGCA AGCATCCAAC
ACGCTGCCGA TCATGCTGTT TGCGCTGCTG TCGGGCGCGC TGGCCGACAT CTTCGACCGC
CGCACGCTTC TGCTCGGGGC ACAGGTCTTC ATGGCCGCGG TCTCGGTGCT GCTGGCGGTG
CTGACCTGGC AAGGCTGGAT GACGCCCTTG CTGCTTTTGT CGCTGACCTT TCTGATCGGG
GTCGGACAGG CGATCTACAA CCCGCCCTGG CAGGCCAGCA TGCAGGACCT GGTGCCGCGC
GACGACCTGC CGGCGGCGGT CTCGCTGAAC TCGGTCGGCT TCAACCTGAT GCGCTCGGTC
GGTCCGGCGG TGGGCGGGAT CATCACCGCC GCCTTCGGAG CCGCCGCCGC CTTTGCGGTC
AATGCCGCAA GCTACATCCC GCTGCTGGGC GCGCTCACGC GCTGGCATCC GGTGACGCCG
CCCCGCGTCA CCACGCCCGA GCCCTTCGTC GCCGCCGTGG GCGCCGGCCT TCGCTATGTG
GCGCTGTCGC CGAACCTGGT GCGGGTGCTG TCGCGCGGGG CGCTGTTCGG CTTTTCGGCC
ATCGTCGTCA TGGCGCTGTT GCCGCTGGTG GCCAAGCAGA ACCCCACGGG CGGCTCGCTG
CTGTTCGGCC TGCTGCTGGG CTGCTTCGGC CTGGGCGCGA TCTGCGGCGC GCTGATCAAC
CCGCTGGTGC GCGAAAGGCT TGACAACGAG AACGTGGTGC GCGTCGCCTT TGCCGCCTTC
GGCGCCTCGG CACTGATGCT GGCCCTGACC GAAAGCACCT GGCTGCATGC GCTGGCCATG
CTGCCGGCGG GCGCAAGCTG GGTGCTGGCG CTGTCGCTCT TCAACGTCAC GGTGCAGCTT
TCGACGCCGC GCTGGGTGGT GGCGCGGGCG CTGGCGCTTT ACCAGACCGC GGTCTTCGGC
GGCATGGCGG CGGGCAGCTG GGCCTGGGGT TCGGTCGCCA ACAATTACGA CGTGAACACG
GCGCTGATCA CGGCCTCGGT GCCGCTGTTC CTGGGCGCGA TGCTGGGGCA CTGGCTGCGC
ATCCCCGAAT TCGGCACGCT GGACCTCGAC CCGCTCAACC GCTTTCGCGA GCCGGAACTG
GCGCTGGACC TGCGCGGCCG TTCGGGCCCG ATCATGGTGA TGGTCGATTA CGAGATCGAC
CAGAAGGACG TGCCAGAATT CCTGCGCCTG ATGGCGCTGC GCCGCAACGT GCGCCGCCGC
GACGGGGCGC GGAACTGGGC GCTCTTGCGC GACCTGGAGC ATCCCGAGCG CTGGACCGAA
AGCTATCACA TCGCCACCTG GGACGAATAC GTGCGCCACA ACCTGCGCCG CACCAAGGCC
GATTTCGAGA CCTACCAGGA CCTGAACAAG CTGCATCGCG GCACCGAGCC GCCCATCGTC
CACCGCATGA TCGAGCGCCA CACCGTCAGC CTGGACGACG ATGTGCCGCT GATCGGCAAG
CTGGAAGTGC CCTGA
 
Protein sequence
MPAPARKSPF APFRHRDFRL LWSATLISNF GGLVQAVGAA WMMTQLTDSA TLIALVQASN 
TLPIMLFALL SGALADIFDR RTLLLGAQVF MAAVSVLLAV LTWQGWMTPL LLLSLTFLIG
VGQAIYNPPW QASMQDLVPR DDLPAAVSLN SVGFNLMRSV GPAVGGIITA AFGAAAAFAV
NAASYIPLLG ALTRWHPVTP PRVTTPEPFV AAVGAGLRYV ALSPNLVRVL SRGALFGFSA
IVVMALLPLV AKQNPTGGSL LFGLLLGCFG LGAICGALIN PLVRERLDNE NVVRVAFAAF
GASALMLALT ESTWLHALAM LPAGASWVLA LSLFNVTVQL STPRWVVARA LALYQTAVFG
GMAAGSWAWG SVANNYDVNT ALITASVPLF LGAMLGHWLR IPEFGTLDLD PLNRFREPEL
ALDLRGRSGP IMVMVDYEID QKDVPEFLRL MALRRNVRRR DGARNWALLR DLEHPERWTE
SYHIATWDEY VRHNLRRTKA DFETYQDLNK LHRGTEPPIV HRMIERHTVS LDDDVPLIGK
LEVP