Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pden_1000 |
Symbol | |
ID | 4578776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Paracoccus denitrificans PD1222 |
Kingdom | Bacteria |
Replicon accession | NC_008686 |
Strand | + |
Start bp | 976763 |
End bp | 979738 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639768322 |
Product | phage tape measure protein |
Protein accession | YP_914807 |
Protein GI | 119383751 |
COG category | [R] General function prediction only |
COG ID | [COG3941] Mu-like prophage protein |
TIGRFAM ID | [TIGR02675] tape measure domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.734571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACCG ATCTCGAAAA GCTGGTTGTC CAGCTCTCTG CTGACATCAA GGGCTATGAG CGCGAAATGC GCAAGGCCGT TGGCGTGACG AACCGGCAAG CGCGCGACAT CGAAAAGCGG TTCCTCGCGA TGCAGCGCAA TCTGGATGGT ATCGGATCGC GGGCCGCCAA GTCGCTGATC GCGCCGTTTA CCGGCATTGC TGCGGCGCTT GGTGGCCGGG AACTGATCCG CATGACTGGG CAGTGGACCG ACCTGACCAG CCGGGTGAAC CTGGCGGCAG GCAGCATGGA GAAAGGCAAT GAGGTCATGC GGCGCGTCAG CGAGATGGCG CGTCGCACCT ATTCCGACCT CAGCCAGACC GCCGAGGGAT ACCTGGCATT CTCGACCACG CTGACCGAAC TTGGCGTTTC GACGGATCGC CAACTGGACT TCGTGGAAAG CCTGAACAAC GCCCTGGTTG TGTCCGGCGC CAAGGGCCAG ACCGCCGAAC GTGTCATGAG TGCGCTCTCC AAGGCGATGG CGCTCGGATC GCTCCAGGGT GACAACCTGA ATACGGTGAT CGATTCCGGC GGACGTGTCG CGCAGGCCCT CGCCGATTCC ATGGGTGTCA CCACCATGGA ATTGCGCAAG CTGGGATCGG AAGGGAAAAT CGGGCGCCGC GAACTGCTCG GCATCTCGAA GGAGATGGAG AAGCTCCGGC GCGAGGCGGG CGAGATGCCC ACCACGATCC AGGACGGCTT CATGCTTCTG AACAACGCCC TGCTGGAATA TGTCGGGCGT GGGGACGATG CCGTCGGGAT GTCGGGCCGG ATCGCCGAGG CGCTTACGGT CATCGCCGAC AATTTCGATA CCGTCGCCGA TTCCGGCCTG AAGCTTGCCG CGGTTCTCGC GGCCGGCATG CTCGGCCGGT CTATCGGGGG CATGATCACC AAGCTGGGCG CCGCAACCGG GGTGTTGATC AAGTTTGGCG CCGCATTGCG CGCGGCTACC TCCATGGCCA GCGTCGGCAC CGCAATCAGT GGGTTGAGCG CGGCGGCGGG GCCGCTGGTC ATGGTTATCG GAGGTCTGCT CGCTGGCGGT GTCCTGCTTT ACTCGGATCG TGCCCGAGAG GCAGAACAGC GCAGCAAGGA TCTGCGGGAT GAGCTGCAAC GCCTGGGGCT TTACGCACCG GATGCAGCCG TGGCGCTGGA AGAGGTCGCA GATGCTGCCG ATGGTATCGG CACCGAGGAT CAGTTGGCCC GCATCGAGCG GTTCCGCAAG GGCCTCGAAG ATATCAAGGG CACTGGCGGG CTTTGGTCCT GGATCAGTGG TGGCGACGGC GAACTGGACG GGTTGATCCA GCAGGTAGAC CGCTTTTACC ATGCCTTTGA TCAGGCTGAT CTTCCGCTGG TCCGGCAGTT GCGCGATCTG GCAAAGGCTT ATCAGGACAA AGAGATTTCC GCCGAAAGCT TTGCTCGCAG CGTTGCGGCC GCCGATTTGG AGGGCGCAGG GCGTGCAGCG CGTGAATATG CAGCGGCATT GCGCGAAATT GCGGAGCGCT CGAACGCACT TACCACCGGG CTCCTGTTCG ATGGCGTTGC AGTCGAAATC GATGCGGCAA CTGCATCTGT GGAGGGGCTT CTGGAAGAGC TGCACTTCGT CGCCAACCTT CAAGGTCTCG GCGACGTGGT TGTGCAGGAA ATCAGGGATG TCATCGCGGA AATGAACAAG GGCAAGAAGT CCGCCCAAGA GGCTGCCGAT GAGATCGAGG CTCTCGGAAA CACCCGCGCA GACTTTAATG GCCTGCACGG TGATATCGCA AATGCCATCA AGGCCCTTGG TGAACTGCGC GCGGCGGCGA TCCGAACCGC ATCGACCATC GCGGCCACGG TGGCGATGCG CCCGAGTTCT GGTGGCGACG ACACCTCGCT TGATCGCCAG AATGAGGCAT CGGCGGCTTA CATTGCCGAG CATAAGCGTC GCCTGGACCT GACCCGTGAG CAGGTGTCGC TGGAAAAGGA GATGGAGCGG ATCACGAAAG ATGCTGCCCA GAACCAGGCT GTGTTGACGC AAGAGCAAAT CCGCCAGATG GCGGTTGCGA ATCTTGCCGC CGAGGCCCGT CGCGCCGAGG AAGGCCGTTC CGGGCGCTCG GGCGGCGGCC GGAAGTCCGG CGGCGGAAAG GGCAAGAAGG GAACGACGGT CACCGACATA TTCGAGGACG CTGGCCGCGA TCTGGAAAAC CTGGAGCGGC AGATAGAGCT GGTCGGCAAG TCGGCGCAGG AGACTGCCAA GCTGCGTGCG CAGTGGGAAT TGCTGGATGC CGCGAAGAAA GCCGGCCTGC CCATCGACGA CAAGCTGCGC CAGAACATCC TGGAGCATGC CGAACATGTC GGATACCTCA CCGATCAGCT GGAAAAGGCC GAGATTGCGC AGCAGCAATT CGACCAGGCG ATCGATGGCG TCGCAGATGC CTTTGCGGGT GCTTTGATGG CCGGTGAAAG CTTGCGGGAT GGACTGGCGC AGGTGCTCAA GCAGATCGCA GCAGATATCA TCAACAGCGG CATTCGGAAC GCCCTCATCG GCCAGTTCGG CGGGGGAGGT GGTGGGATCC TCGGCGGTCT CTGGCAGTCT ATGATGGTCG GCGGTGACAG GCTGACCGGG GCGTTGCGTC TGGCTGGTCT TCCGGCTCGG GCGAATGGTG GGCCGGTGCA GGCTGGTCAG ATGTATATGA CCGGCGAGAA GGGGCCGGAG CCGTTCGTCC CTGCGGTGAA CGGTCGTATC CTCAGCGTCG CACAGGCACA GGCGGCGTTG CGCGGGCGCG CCGGCGGGCG TTCGGGCGGG ATCAACGCCA CCTTCGCCCC GAACATCAGC ATCGCGCCCG GTGTCACCCA AGCCGAACTG GCCATGACCA TGGCGGCCGC CCAACGGGAG TATGAGCAGA AGTTCCTGCC CATGCTGCAA AAGCACATGC CCAGCTATAA TGAGCGGTAT ACCTGA
|
Protein sequence | MATDLEKLVV QLSADIKGYE REMRKAVGVT NRQARDIEKR FLAMQRNLDG IGSRAAKSLI APFTGIAAAL GGRELIRMTG QWTDLTSRVN LAAGSMEKGN EVMRRVSEMA RRTYSDLSQT AEGYLAFSTT LTELGVSTDR QLDFVESLNN ALVVSGAKGQ TAERVMSALS KAMALGSLQG DNLNTVIDSG GRVAQALADS MGVTTMELRK LGSEGKIGRR ELLGISKEME KLRREAGEMP TTIQDGFMLL NNALLEYVGR GDDAVGMSGR IAEALTVIAD NFDTVADSGL KLAAVLAAGM LGRSIGGMIT KLGAATGVLI KFGAALRAAT SMASVGTAIS GLSAAAGPLV MVIGGLLAGG VLLYSDRARE AEQRSKDLRD ELQRLGLYAP DAAVALEEVA DAADGIGTED QLARIERFRK GLEDIKGTGG LWSWISGGDG ELDGLIQQVD RFYHAFDQAD LPLVRQLRDL AKAYQDKEIS AESFARSVAA ADLEGAGRAA REYAAALREI AERSNALTTG LLFDGVAVEI DAATASVEGL LEELHFVANL QGLGDVVVQE IRDVIAEMNK GKKSAQEAAD EIEALGNTRA DFNGLHGDIA NAIKALGELR AAAIRTASTI AATVAMRPSS GGDDTSLDRQ NEASAAYIAE HKRRLDLTRE QVSLEKEMER ITKDAAQNQA VLTQEQIRQM AVANLAAEAR RAEEGRSGRS GGGRKSGGGK GKKGTTVTDI FEDAGRDLEN LERQIELVGK SAQETAKLRA QWELLDAAKK AGLPIDDKLR QNILEHAEHV GYLTDQLEKA EIAQQQFDQA IDGVADAFAG ALMAGESLRD GLAQVLKQIA ADIINSGIRN ALIGQFGGGG GGILGGLWQS MMVGGDRLTG ALRLAGLPAR ANGGPVQAGQ MYMTGEKGPE PFVPAVNGRI LSVAQAQAAL RGRAGGRSGG INATFAPNIS IAPGVTQAEL AMTMAAAQRE YEQKFLPMLQ KHMPSYNERY T
|
| |