Gene Pden_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_1000 
Symbol 
ID4578776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008686 
Strand
Start bp976763 
End bp979738 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content63% 
IMG OID639768322 
Productphage tape measure protein 
Protein accessionYP_914807 
Protein GI119383751 
COG category[R] General function prediction only 
COG ID[COG3941] Mu-like prophage protein 
TIGRFAM ID[TIGR02675] tape measure domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.734571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCG ATCTCGAAAA GCTGGTTGTC CAGCTCTCTG CTGACATCAA GGGCTATGAG 
CGCGAAATGC GCAAGGCCGT TGGCGTGACG AACCGGCAAG CGCGCGACAT CGAAAAGCGG
TTCCTCGCGA TGCAGCGCAA TCTGGATGGT ATCGGATCGC GGGCCGCCAA GTCGCTGATC
GCGCCGTTTA CCGGCATTGC TGCGGCGCTT GGTGGCCGGG AACTGATCCG CATGACTGGG
CAGTGGACCG ACCTGACCAG CCGGGTGAAC CTGGCGGCAG GCAGCATGGA GAAAGGCAAT
GAGGTCATGC GGCGCGTCAG CGAGATGGCG CGTCGCACCT ATTCCGACCT CAGCCAGACC
GCCGAGGGAT ACCTGGCATT CTCGACCACG CTGACCGAAC TTGGCGTTTC GACGGATCGC
CAACTGGACT TCGTGGAAAG CCTGAACAAC GCCCTGGTTG TGTCCGGCGC CAAGGGCCAG
ACCGCCGAAC GTGTCATGAG TGCGCTCTCC AAGGCGATGG CGCTCGGATC GCTCCAGGGT
GACAACCTGA ATACGGTGAT CGATTCCGGC GGACGTGTCG CGCAGGCCCT CGCCGATTCC
ATGGGTGTCA CCACCATGGA ATTGCGCAAG CTGGGATCGG AAGGGAAAAT CGGGCGCCGC
GAACTGCTCG GCATCTCGAA GGAGATGGAG AAGCTCCGGC GCGAGGCGGG CGAGATGCCC
ACCACGATCC AGGACGGCTT CATGCTTCTG AACAACGCCC TGCTGGAATA TGTCGGGCGT
GGGGACGATG CCGTCGGGAT GTCGGGCCGG ATCGCCGAGG CGCTTACGGT CATCGCCGAC
AATTTCGATA CCGTCGCCGA TTCCGGCCTG AAGCTTGCCG CGGTTCTCGC GGCCGGCATG
CTCGGCCGGT CTATCGGGGG CATGATCACC AAGCTGGGCG CCGCAACCGG GGTGTTGATC
AAGTTTGGCG CCGCATTGCG CGCGGCTACC TCCATGGCCA GCGTCGGCAC CGCAATCAGT
GGGTTGAGCG CGGCGGCGGG GCCGCTGGTC ATGGTTATCG GAGGTCTGCT CGCTGGCGGT
GTCCTGCTTT ACTCGGATCG TGCCCGAGAG GCAGAACAGC GCAGCAAGGA TCTGCGGGAT
GAGCTGCAAC GCCTGGGGCT TTACGCACCG GATGCAGCCG TGGCGCTGGA AGAGGTCGCA
GATGCTGCCG ATGGTATCGG CACCGAGGAT CAGTTGGCCC GCATCGAGCG GTTCCGCAAG
GGCCTCGAAG ATATCAAGGG CACTGGCGGG CTTTGGTCCT GGATCAGTGG TGGCGACGGC
GAACTGGACG GGTTGATCCA GCAGGTAGAC CGCTTTTACC ATGCCTTTGA TCAGGCTGAT
CTTCCGCTGG TCCGGCAGTT GCGCGATCTG GCAAAGGCTT ATCAGGACAA AGAGATTTCC
GCCGAAAGCT TTGCTCGCAG CGTTGCGGCC GCCGATTTGG AGGGCGCAGG GCGTGCAGCG
CGTGAATATG CAGCGGCATT GCGCGAAATT GCGGAGCGCT CGAACGCACT TACCACCGGG
CTCCTGTTCG ATGGCGTTGC AGTCGAAATC GATGCGGCAA CTGCATCTGT GGAGGGGCTT
CTGGAAGAGC TGCACTTCGT CGCCAACCTT CAAGGTCTCG GCGACGTGGT TGTGCAGGAA
ATCAGGGATG TCATCGCGGA AATGAACAAG GGCAAGAAGT CCGCCCAAGA GGCTGCCGAT
GAGATCGAGG CTCTCGGAAA CACCCGCGCA GACTTTAATG GCCTGCACGG TGATATCGCA
AATGCCATCA AGGCCCTTGG TGAACTGCGC GCGGCGGCGA TCCGAACCGC ATCGACCATC
GCGGCCACGG TGGCGATGCG CCCGAGTTCT GGTGGCGACG ACACCTCGCT TGATCGCCAG
AATGAGGCAT CGGCGGCTTA CATTGCCGAG CATAAGCGTC GCCTGGACCT GACCCGTGAG
CAGGTGTCGC TGGAAAAGGA GATGGAGCGG ATCACGAAAG ATGCTGCCCA GAACCAGGCT
GTGTTGACGC AAGAGCAAAT CCGCCAGATG GCGGTTGCGA ATCTTGCCGC CGAGGCCCGT
CGCGCCGAGG AAGGCCGTTC CGGGCGCTCG GGCGGCGGCC GGAAGTCCGG CGGCGGAAAG
GGCAAGAAGG GAACGACGGT CACCGACATA TTCGAGGACG CTGGCCGCGA TCTGGAAAAC
CTGGAGCGGC AGATAGAGCT GGTCGGCAAG TCGGCGCAGG AGACTGCCAA GCTGCGTGCG
CAGTGGGAAT TGCTGGATGC CGCGAAGAAA GCCGGCCTGC CCATCGACGA CAAGCTGCGC
CAGAACATCC TGGAGCATGC CGAACATGTC GGATACCTCA CCGATCAGCT GGAAAAGGCC
GAGATTGCGC AGCAGCAATT CGACCAGGCG ATCGATGGCG TCGCAGATGC CTTTGCGGGT
GCTTTGATGG CCGGTGAAAG CTTGCGGGAT GGACTGGCGC AGGTGCTCAA GCAGATCGCA
GCAGATATCA TCAACAGCGG CATTCGGAAC GCCCTCATCG GCCAGTTCGG CGGGGGAGGT
GGTGGGATCC TCGGCGGTCT CTGGCAGTCT ATGATGGTCG GCGGTGACAG GCTGACCGGG
GCGTTGCGTC TGGCTGGTCT TCCGGCTCGG GCGAATGGTG GGCCGGTGCA GGCTGGTCAG
ATGTATATGA CCGGCGAGAA GGGGCCGGAG CCGTTCGTCC CTGCGGTGAA CGGTCGTATC
CTCAGCGTCG CACAGGCACA GGCGGCGTTG CGCGGGCGCG CCGGCGGGCG TTCGGGCGGG
ATCAACGCCA CCTTCGCCCC GAACATCAGC ATCGCGCCCG GTGTCACCCA AGCCGAACTG
GCCATGACCA TGGCGGCCGC CCAACGGGAG TATGAGCAGA AGTTCCTGCC CATGCTGCAA
AAGCACATGC CCAGCTATAA TGAGCGGTAT ACCTGA
 
Protein sequence
MATDLEKLVV QLSADIKGYE REMRKAVGVT NRQARDIEKR FLAMQRNLDG IGSRAAKSLI 
APFTGIAAAL GGRELIRMTG QWTDLTSRVN LAAGSMEKGN EVMRRVSEMA RRTYSDLSQT
AEGYLAFSTT LTELGVSTDR QLDFVESLNN ALVVSGAKGQ TAERVMSALS KAMALGSLQG
DNLNTVIDSG GRVAQALADS MGVTTMELRK LGSEGKIGRR ELLGISKEME KLRREAGEMP
TTIQDGFMLL NNALLEYVGR GDDAVGMSGR IAEALTVIAD NFDTVADSGL KLAAVLAAGM
LGRSIGGMIT KLGAATGVLI KFGAALRAAT SMASVGTAIS GLSAAAGPLV MVIGGLLAGG
VLLYSDRARE AEQRSKDLRD ELQRLGLYAP DAAVALEEVA DAADGIGTED QLARIERFRK
GLEDIKGTGG LWSWISGGDG ELDGLIQQVD RFYHAFDQAD LPLVRQLRDL AKAYQDKEIS
AESFARSVAA ADLEGAGRAA REYAAALREI AERSNALTTG LLFDGVAVEI DAATASVEGL
LEELHFVANL QGLGDVVVQE IRDVIAEMNK GKKSAQEAAD EIEALGNTRA DFNGLHGDIA
NAIKALGELR AAAIRTASTI AATVAMRPSS GGDDTSLDRQ NEASAAYIAE HKRRLDLTRE
QVSLEKEMER ITKDAAQNQA VLTQEQIRQM AVANLAAEAR RAEEGRSGRS GGGRKSGGGK
GKKGTTVTDI FEDAGRDLEN LERQIELVGK SAQETAKLRA QWELLDAAKK AGLPIDDKLR
QNILEHAEHV GYLTDQLEKA EIAQQQFDQA IDGVADAFAG ALMAGESLRD GLAQVLKQIA
ADIINSGIRN ALIGQFGGGG GGILGGLWQS MMVGGDRLTG ALRLAGLPAR ANGGPVQAGQ
MYMTGEKGPE PFVPAVNGRI LSVAQAQAAL RGRAGGRSGG INATFAPNIS IAPGVTQAEL
AMTMAAAQRE YEQKFLPMLQ KHMPSYNERY T