Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pden_2461 |
Symbol | |
ID | 4580794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Paracoccus denitrificans PD1222 |
Kingdom | Bacteria |
Replicon accession | NC_008686 |
Strand | + |
Start bp | 2472753 |
End bp | 2477666 |
Gene Length | 4914 bp |
Protein Length | 1637 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639769790 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_916244 |
Protein GI | 119385188 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.489367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.518001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGGGAC CGCTGGCGGC GCAGACCCTG CCCAGCGGGC CGAACGTGGT CGGCGGCAGC GCCAGCGTCG GCACGCCGCG CACCGGCGCC ATGCGCGTGG ACCAGCGGAG CGACCGCGCC ATCATCGACT GGAGCAGCTT CAGCATCGGG GCCGGCGGCT CCGTCCGGGT GCATCAGCCG GGACGTGACG CAGCGCTGCT GAACCGGGTG ACCGGAGACC GCCCGACCCG CATCGACGGT TTTCTGGGCG CGAACGGCCA GGTCTTCGTC GTCAACCGCA ACGGCATCCT GGTCGGGCGC GAGGGACGTA TCGCCACGGC GGGTTTCGTC GGCTCGACGC TGGACATCTC GAACGAGGAT TTCACCGCCG GCCGGCTGCG CTTTTCCGGC GACCGGCCCG GCACGGTCGA GAATCGCGGC CACATCGACA TCATCCCCGG CGGCTATGCC GCGCTCTTGG GCGGACGGGT CGCCAACAGC GGCACGATCC GCGTGCCGCT GGGCACGGTG GGGCTGGGCG CGGGGCGGCG GGCGGTGCTG AACCTTTCGG GCGACAACTT CCTGTCGGTG GCCCTGCCGC CCGCCGAGGA CGGCGAGGCC ATGGCGCTGG TCCGGCAGCA CGGCCGGATC TCGGCCGATG GCGGCGTGAT CGAGATCGAG GCGGCGACGG CACGCCATGC CGCGCGCCAT GCGATCAACC TCACCGGCGT GACCGAGGCG CGCACCGTCT CGGGCCGCTC GGGCCGGGTG GTGTTGGGCG GCGGCGACGG CGGCCGGGTG CGGGTCGCGG GCCGGGTCGA TGTCAGCGGC GGCCGACGCG CGGCGGCACC AATCACCGCC GAACGGCCTG CGGTGCGGCC CCGGCGCGGC GGCAGCATCA CCATCACCGG CGACCGCATC GCGCTGGAGG GCGCCGTTTT GGATGCCAGC GGCACCGGCG GCGGCGGGCT GATCCGCGTC GGCGGCGATT ATCGCGGCAC CGGCGACCTG CCCCGGGCGC GGCGCACCAC GCTGGATGCC AGGGCGCAAC TGCTGGCCGA CGGGATCGGC CGTGCCGATG GCGGGCGGAT CATCGTCTGG TCGGATGCCC ATACCCGCTT TGCCGGCGGC GTCTCGGCCC GGGGCGGCAC TCAGGGCGGC AACGGCGGCT TTGCCGAGAT ATCCAGCCGC GGCGAGTTGG CGATCCGCAG TTCCGACGTG CGGGTCTCGG CCCCCCGCGG CCGGCCGGGC ACCGTGCTTT TCGACCCGCA GAACCTGCGC ATCGTCGGCG AGGACAGCTA TGACCCCGAC GATCCGACGC ATGTGCTGGT GACGGATATC TACAGCATGT TGCTGGATGC GGGGCATTAC ATCCTGTCGA CCGAGGGAGA GGGCGACGAT GCCGGCGACC TGGTGGTCGA CACGCCGATG ACCCTGACCT TCGCCGCCGG CGCGGCCAGC CATCTGGACC TGCGCGCCGA CAACGACCTG CTGGTCAACG CCGCGATGAG CTGGAGCGGA CCGGGCCAGT TGTCGCTGAC CGCGGGCGCC GGCATCACCA GCATCGGCGC GCTGAGCTGG AGCGGCGCCA CGGCGCTGAA CATGACCGCA GGCGAAAGCA TCGCGCTGAA CGCGTCGGTG CAGGGACCGG CCGGGGCGCT GAACCTGCAA GCGCCGCTGA TCACGGCGAT GGCGGGCGTC GCAGTGGACA GCTTCCGCCT GAACGGCGCC TCGCAATGGC GGCAGGTCGC CGCCACGCTG CCCGGCTTTT CGGCGCGCGA CTTCGCCATA TCCGACACCG CGGGCTTCCT GCGTGCGACG GGCGGCGCCG GGACCGCCGC CAGCCGCTAT GTCATCGCCG ATGTCTACGG GCTCCAGGGC ATCGGCTCGG CCGGCCATGC CGGAGCGCAT TACGCCCTGG GCGCCGATAT CGCGGCGGCG GGCACGGCCG CCTGGAACCT TGGCGCGGGG TTCCGGCCCA TCGGCAGCCG GGGGACGCCG TTCTCGGGCA GCCTCGACGG CGCCGGGGCC GGCGGCGGCC ATGCGATCAC CGGGCTTGCG CAGGTCGTCT CCTCGGGGCC GGGCGGGCTG TTCGGCACCA TCGCCGGCGC CACGATCCGC AACCTGCGCG TGCTGGACAT CGACCTGAGC GCGCAGCAAG GCTTTGCCGC CGTCGCAGGC GGCCTGGTCG GAGAGACGCA GGCCGGCGAG GCGCCGAACG TGATCGCGAA TGTGCTGGTC ACCGGCCGGA TCGAAGCCGA GATGGGCGAG GATTCGGTCA GCAGCGCCAG CTTTGGCGGG CTAGCCGGGG TTTTCGGCAA CGGCCGCATC ACCGGCGCGG AATCGCGCGT CACCCTGGCG CTTTCGGGCG AGGCGGACGG CTGGGGCGAC GTGGTTCTTG CCGGCGGCCT TGTCGGGCAG TCGCTGGGCG GGACGACGAT CACCGGCAGC CGCTATGCCG GCAGCATCGA ATCCGGCTTC GACGGGTCCG AGGACAACAG CGAGGGCTTC GCCCCCGCCT CGCGCATCGG TGGGCTGGTC GGGATGACCG ATGACGGCGA CAGCATCGCG GATTCAACCG CCGCCGCCCA GATCACCCAG ACCGGCAACG GAAACTGGGT CATCGGCGGG CTGGTGGGCG AGAATGCGGG GGCGCTGACC GGGGTCTCGG CAACCGGCGG CATCGCCCTG ACGCAGGGAG GGACAGAGAC CATCCGCCTG CTGGCCCTTG GCGGCCTGGC CGGGACGAAC AGCGGCACCG TCTCGGATGC CTGGAGCGAT GTGGCCATGG ACCTCGACAC CGCGGGCTAT GTCCGGGCCG GCGGACTGAT CGGCACAAAC GAGGGCACGG TCGGCACCGC CTATGCCCTG GGCAGTCTGG GCCTTGCCCT GTCCGGCCTG CCCGAAAGCG GCAGCATCGC CGAGATCGGC GGGCTGATCG GCGCGAATGA AGGCACGATC GCCGATGTCT TTGCCGGCAA TGCCGTCGAC ATCTCGGGTG ATGCGGCCGT CGCGGCGGGC GGGCTGGTCG GCTGGAATCC GGGCGAAATC GCCCGAGCCC GCGCCTCGGG CCGCCTGGGG GTGGCGCTGG ACGGCACGGG CGGCCCGGAT GCGCCGCGCT CGGCCCTGGG CGGGCTGGCG GGCTGGAACG ACGGCGGCAT CGCCGACGCC TATTCGCTGG CGCCCGTGAC CTATTCCGGC AACCTGCCGG CGACCATCGG CGGGTTGGTC GGCTCGAACG CCGGCAGCAT CGAGCGCAGC TATGCCGCGG GCCGGATCGC GGCCACCGCA ACGCCGGCCC TGCTGCGCAC CGGCGGGCTG GTCGGGGCCG ATGACGAAGG CGAAGGCACC GGCAGCGTCG CCGTGTCGTT CTGGGACCGC CAGACCAGCG GGCAGCAGCA ATCCGCCGGC GGCACCGGGC TGACCACGGC GCAGTTGCGC GATACTGCGG GCTTCATGGG CCGCGCCACG GGCTGGAGCT TTACCACCAC CTGGGCGCCG GGCGGCAATG GCGCCTATCC GCAGATCTAC AGCATCGACC CCGTGCTCTG GACCCAGCCC ACGCCGCTGA CCGCGACCTA TGGCGACGCC CTGCCCACGC CCGGCGGCAC GGTGCATGGG CTGGGGCGCT ATCTGTTCGG CGACCAGCCC TTGGTCGCGG GGATCTTCAG CCTGCCGGCG GGCACGCGCA ATGCCGGGAC TTATGCCATC CAGACGGCGG GCACGGTCAC CTCGCCCGGC GGCACCAGCT ATGACATCAT CGCCTCGGCG GCGAGCCTGA CCATCAACCG CGCGCTGCTG ACCGTCTCGG CCGACGACCT GAGCAAGATC TATGGCGAGC GGCTGCTGTT CTCGACCGGC GATGCCACCG CGGCCGGGCT GCGCCATGAC GACCGGCTGA CCGCGGTCGA TCTGGCCAGC GCGGGCGCCG AGGCCGGGGC CTGGGTCGCC GGCACGCCCT ATGCCATCAC CGCAGGCGGA GCCCGGATCG GCGGCGCGGA CGGCGATGCG ACCGGCAATT ACATCATCAG CTATGCCGAG GGCGCGCTGG AGGTGGTGCC GCGCAGCATC ATCATCGGCG CGCGCGACAG CCTGGCCCGG TTCGGCACCG CGCCGCTATT GGACTGGGCG CTGACCGGCG GAAGCCTGGC CGAGGGCGAC ACGATCACCG GCGTCCTGCT GGCCAGCGAT GCCACGCCGG CCAGCCCGCC CGGCCTTTAT GGCATCACCG CCTCGGATGC GCGGATGCTG GACGGGGCCG AGGCGAACTA TGCCATCGCC TATGCGCCGG GAACCTTGCA GATCGTCAAC CCGGCCGCGA CCCGGGGGAT CCCGATCCCG CAGAACACCT TGCCGGGAAC GGCGCTGCCG AACCCGACGG ACGGCCCGGT GCCCTTCCCG ATCACGGTGG CCGGCACGGC TTCGGTGCTG CCGGGCGGGC CGCCGCCGGT CACCTCCGGC CCGCAGGACG ACGGGCTGAC CCGGCTGGCC GCATTGTCGG ACGAGGTCTC GCAGATGATC GACGCCTGCA GCCAGCACGA GGGCCAGGCC GAGGACATGC TGGCCTGCCT GTCGCGGGCG CTGGACCGCT ATTCCAGCGC GCTGGATGAG CTTTCGGCCG AACTGCCGCC CTCGATGCAG ACGGTCTCGG CCATCCTGCG CCAAGCCAGC GCGGATATCG GCGCGGCGCG CAGCCGGGCG CTGGACCGGC TGGCCACGGC GGGCAGCGAG GCCGAGCGCC GCGCCATCCG CCGCGACGCC CTGCGCGAGG CCAGCCGGAC GATGTCGAAC GCCCGGGCCG AAATCGTCAA GCAGATCGAA TTGCTGCGGG TCGAAGACCC CGAACTGGCC CGCACCCATG CCCGGCAGGA GGGGCTGATC CTTGCCACGG TCGAGAAAGC CGATGCCGTC CTGGTGCGCG CGGTCGGGCT TTAG
|
Protein sequence | MPGPLAAQTL PSGPNVVGGS ASVGTPRTGA MRVDQRSDRA IIDWSSFSIG AGGSVRVHQP GRDAALLNRV TGDRPTRIDG FLGANGQVFV VNRNGILVGR EGRIATAGFV GSTLDISNED FTAGRLRFSG DRPGTVENRG HIDIIPGGYA ALLGGRVANS GTIRVPLGTV GLGAGRRAVL NLSGDNFLSV ALPPAEDGEA MALVRQHGRI SADGGVIEIE AATARHAARH AINLTGVTEA RTVSGRSGRV VLGGGDGGRV RVAGRVDVSG GRRAAAPITA ERPAVRPRRG GSITITGDRI ALEGAVLDAS GTGGGGLIRV GGDYRGTGDL PRARRTTLDA RAQLLADGIG RADGGRIIVW SDAHTRFAGG VSARGGTQGG NGGFAEISSR GELAIRSSDV RVSAPRGRPG TVLFDPQNLR IVGEDSYDPD DPTHVLVTDI YSMLLDAGHY ILSTEGEGDD AGDLVVDTPM TLTFAAGAAS HLDLRADNDL LVNAAMSWSG PGQLSLTAGA GITSIGALSW SGATALNMTA GESIALNASV QGPAGALNLQ APLITAMAGV AVDSFRLNGA SQWRQVAATL PGFSARDFAI SDTAGFLRAT GGAGTAASRY VIADVYGLQG IGSAGHAGAH YALGADIAAA GTAAWNLGAG FRPIGSRGTP FSGSLDGAGA GGGHAITGLA QVVSSGPGGL FGTIAGATIR NLRVLDIDLS AQQGFAAVAG GLVGETQAGE APNVIANVLV TGRIEAEMGE DSVSSASFGG LAGVFGNGRI TGAESRVTLA LSGEADGWGD VVLAGGLVGQ SLGGTTITGS RYAGSIESGF DGSEDNSEGF APASRIGGLV GMTDDGDSIA DSTAAAQITQ TGNGNWVIGG LVGENAGALT GVSATGGIAL TQGGTETIRL LALGGLAGTN SGTVSDAWSD VAMDLDTAGY VRAGGLIGTN EGTVGTAYAL GSLGLALSGL PESGSIAEIG GLIGANEGTI ADVFAGNAVD ISGDAAVAAG GLVGWNPGEI ARARASGRLG VALDGTGGPD APRSALGGLA GWNDGGIADA YSLAPVTYSG NLPATIGGLV GSNAGSIERS YAAGRIAATA TPALLRTGGL VGADDEGEGT GSVAVSFWDR QTSGQQQSAG GTGLTTAQLR DTAGFMGRAT GWSFTTTWAP GGNGAYPQIY SIDPVLWTQP TPLTATYGDA LPTPGGTVHG LGRYLFGDQP LVAGIFSLPA GTRNAGTYAI QTAGTVTSPG GTSYDIIASA ASLTINRALL TVSADDLSKI YGERLLFSTG DATAAGLRHD DRLTAVDLAS AGAEAGAWVA GTPYAITAGG ARIGGADGDA TGNYIISYAE GALEVVPRSI IIGARDSLAR FGTAPLLDWA LTGGSLAEGD TITGVLLASD ATPASPPGLY GITASDARML DGAEANYAIA YAPGTLQIVN PAATRGIPIP QNTLPGTALP NPTDGPVPFP ITVAGTASVL PGGPPPVTSG PQDDGLTRLA ALSDEVSQMI DACSQHEGQA EDMLACLSRA LDRYSSALDE LSAELPPSMQ TVSAILRQAS ADIGAARSRA LDRLATAGSE AERRAIRRDA LREASRTMSN ARAEIVKQIE LLRVEDPELA RTHARQEGLI LATVEKADAV LVRAVGL
|
| |