Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3467 |
Symbol | |
ID | 4786285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3677154 |
End bp | 3678761 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640092047 |
Product | Flp pilus assembly protein TadD |
Protein accession | YP_001022655 |
Protein GI | 124268651 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | [TIGR02466] conserved hypothetical protein |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACGAC CCGCCCCCCG CTCCGCACGC GGCCCCCACC CGGCCGAGGT GAACGCCGTG CTCGCGCACT GCCAGGCCGG CCGCTGGCCG GAGGCAGAGA CCGCAGCGGC GCGGCTGTTG AAGAGCCACC CGCAGGATGC GGCCCTGCAC AACCTGCACG GCACGGCCTG CTCGGAACAG CACAAGCTCG ACGCCGCGGC GGCCAGCTTC CGCCGGGCGG CGGCGCTGGC GCCGCAGTCG GCCGAACTGC TGTTCAACCT GGCCGCCACC TGCGGCCGCC TGGGCCGGCT CGACGAGGCC GTGGCCGCCT ACCGGCGGTC CGTCGCGCTG AAGCCCGACT TCGCGGTGGC GCACTACAAC CTCGGCACCG CGCTCAAGGA TTTGCAGCAG CTCGACGAGG CCGTCACCAG CCTGCGGCGC GCGGTGGCGC TGCAGCCGGG CTACGCCGCC GCGCACGCGA ACCTCGGCGC CGTGCGCCAG GCCCAGGGCC ATCTCGACGA CGCCATCGCG TGCTACCGCG CCGCGCTGGC GATCACGCCC ACCGCCCGGG CGCACCTGAG CCTCGCCTCG GCGCTGCGTG CGCACGGCCT GCTCGACGCC GCGGCCGCCA GCCTGCGTGA CGCGCTCGCG CTCGACCCGG CCTACGCCGA CGCCCACAAC AATCTCGGCG AGACGCTGTG GGACCAGGGC CGTGTCGACG ATGCGCTGGC CAGCTACCGC GCCGCACACC GCCTCGATCC CGCGCACCCC GAGGCCAACC ACAACCTCGG CGTCCTGCTC CAGGCCGCCG GGCAATGGGG CGATGCCATC GCCTGCTTCG AGCGTTCGCA GCTGCGCGAC TGGCAGGAGC GGCGCCTCTA CTGCCTGTAC AAGACCGAGC GCTACGCCGA GTTCCGCGCC GCGCTGGCGC CGATGCTGTC CGCCAGCCCG CACCGCTCGC CCTTTCTCGC CACGCTGTCG GCGCACCACG CCGCGAACTT CGCCGAACCC GACCCCTACG GTTTCTGCCG CACGCCGCTC GATTTCGTGC AGCATGCCCG CATCGACGCG TTGGCCGCGC CGGGCAGCGC CCTCGTCACC GAGTTGCTGC GCGACATCGA ACACGCCGAG ATCGCCGAAC GCAAGCAGGG CCGGCTGCAC CACGGCATCC AGTCGGCCGG CAACCTGTTC AAGCGCCCCG AGGACTCGTT CCGCCGCCTC GCGGCGCTGA TCGGGCAGGC GGTCGTCGCC TACCGCGCGA AGTGGGCCGG CGCCGATTGC GAGTACGCGC GCGCCTTCCC CGCCGACCCG GTGTTCAGCA GCTCGTGGTA CGTGAAGATG CGGCAGGGCG GCCACCTCAC CTCGCACATC CACGAGACCG GCTGGCTGAG CGGCGTGGTG TACCTGGCAC TGCCGCCGCG CGCCGAGGGC AGCGACGACG GCTGCATCGA GTTCAGCACC GACGGCGACG GTTATCCGCG CCGGCACGAG CACTTCCCGC GCCGGGTGCT GGCGCCTCAG GTCGGCGACC TGGTGCTGTT CCCGTCGTCG CTGTTCCACC GCACGCTGCC GTTCCGCGCC GACGCCGACC GCGTGTGCAT CGCCTTCGAT ATCGCGCCCG GTCCCTAG
|
Protein sequence | MQRPAPRSAR GPHPAEVNAV LAHCQAGRWP EAETAAARLL KSHPQDAALH NLHGTACSEQ HKLDAAAASF RRAAALAPQS AELLFNLAAT CGRLGRLDEA VAAYRRSVAL KPDFAVAHYN LGTALKDLQQ LDEAVTSLRR AVALQPGYAA AHANLGAVRQ AQGHLDDAIA CYRAALAITP TARAHLSLAS ALRAHGLLDA AAASLRDALA LDPAYADAHN NLGETLWDQG RVDDALASYR AAHRLDPAHP EANHNLGVLL QAAGQWGDAI ACFERSQLRD WQERRLYCLY KTERYAEFRA ALAPMLSASP HRSPFLATLS AHHAANFAEP DPYGFCRTPL DFVQHARIDA LAAPGSALVT ELLRDIEHAE IAERKQGRLH HGIQSAGNLF KRPEDSFRRL AALIGQAVVA YRAKWAGADC EYARAFPADP VFSSSWYVKM RQGGHLTSHI HETGWLSGVV YLALPPRAEG SDDGCIEFST DGDGYPRRHE HFPRRVLAPQ VGDLVLFPSS LFHRTLPFRA DADRVCIAFD IAPGP
|
| |