Gene Mpe_A3467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3467 
Symbol 
ID4786285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3677154 
End bp3678761 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content73% 
IMG OID640092047 
ProductFlp pilus assembly protein TadD 
Protein accessionYP_001022655 
Protein GI124268651 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID[TIGR02466] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGAC CCGCCCCCCG CTCCGCACGC GGCCCCCACC CGGCCGAGGT GAACGCCGTG 
CTCGCGCACT GCCAGGCCGG CCGCTGGCCG GAGGCAGAGA CCGCAGCGGC GCGGCTGTTG
AAGAGCCACC CGCAGGATGC GGCCCTGCAC AACCTGCACG GCACGGCCTG CTCGGAACAG
CACAAGCTCG ACGCCGCGGC GGCCAGCTTC CGCCGGGCGG CGGCGCTGGC GCCGCAGTCG
GCCGAACTGC TGTTCAACCT GGCCGCCACC TGCGGCCGCC TGGGCCGGCT CGACGAGGCC
GTGGCCGCCT ACCGGCGGTC CGTCGCGCTG AAGCCCGACT TCGCGGTGGC GCACTACAAC
CTCGGCACCG CGCTCAAGGA TTTGCAGCAG CTCGACGAGG CCGTCACCAG CCTGCGGCGC
GCGGTGGCGC TGCAGCCGGG CTACGCCGCC GCGCACGCGA ACCTCGGCGC CGTGCGCCAG
GCCCAGGGCC ATCTCGACGA CGCCATCGCG TGCTACCGCG CCGCGCTGGC GATCACGCCC
ACCGCCCGGG CGCACCTGAG CCTCGCCTCG GCGCTGCGTG CGCACGGCCT GCTCGACGCC
GCGGCCGCCA GCCTGCGTGA CGCGCTCGCG CTCGACCCGG CCTACGCCGA CGCCCACAAC
AATCTCGGCG AGACGCTGTG GGACCAGGGC CGTGTCGACG ATGCGCTGGC CAGCTACCGC
GCCGCACACC GCCTCGATCC CGCGCACCCC GAGGCCAACC ACAACCTCGG CGTCCTGCTC
CAGGCCGCCG GGCAATGGGG CGATGCCATC GCCTGCTTCG AGCGTTCGCA GCTGCGCGAC
TGGCAGGAGC GGCGCCTCTA CTGCCTGTAC AAGACCGAGC GCTACGCCGA GTTCCGCGCC
GCGCTGGCGC CGATGCTGTC CGCCAGCCCG CACCGCTCGC CCTTTCTCGC CACGCTGTCG
GCGCACCACG CCGCGAACTT CGCCGAACCC GACCCCTACG GTTTCTGCCG CACGCCGCTC
GATTTCGTGC AGCATGCCCG CATCGACGCG TTGGCCGCGC CGGGCAGCGC CCTCGTCACC
GAGTTGCTGC GCGACATCGA ACACGCCGAG ATCGCCGAAC GCAAGCAGGG CCGGCTGCAC
CACGGCATCC AGTCGGCCGG CAACCTGTTC AAGCGCCCCG AGGACTCGTT CCGCCGCCTC
GCGGCGCTGA TCGGGCAGGC GGTCGTCGCC TACCGCGCGA AGTGGGCCGG CGCCGATTGC
GAGTACGCGC GCGCCTTCCC CGCCGACCCG GTGTTCAGCA GCTCGTGGTA CGTGAAGATG
CGGCAGGGCG GCCACCTCAC CTCGCACATC CACGAGACCG GCTGGCTGAG CGGCGTGGTG
TACCTGGCAC TGCCGCCGCG CGCCGAGGGC AGCGACGACG GCTGCATCGA GTTCAGCACC
GACGGCGACG GTTATCCGCG CCGGCACGAG CACTTCCCGC GCCGGGTGCT GGCGCCTCAG
GTCGGCGACC TGGTGCTGTT CCCGTCGTCG CTGTTCCACC GCACGCTGCC GTTCCGCGCC
GACGCCGACC GCGTGTGCAT CGCCTTCGAT ATCGCGCCCG GTCCCTAG
 
Protein sequence
MQRPAPRSAR GPHPAEVNAV LAHCQAGRWP EAETAAARLL KSHPQDAALH NLHGTACSEQ 
HKLDAAAASF RRAAALAPQS AELLFNLAAT CGRLGRLDEA VAAYRRSVAL KPDFAVAHYN
LGTALKDLQQ LDEAVTSLRR AVALQPGYAA AHANLGAVRQ AQGHLDDAIA CYRAALAITP
TARAHLSLAS ALRAHGLLDA AAASLRDALA LDPAYADAHN NLGETLWDQG RVDDALASYR
AAHRLDPAHP EANHNLGVLL QAAGQWGDAI ACFERSQLRD WQERRLYCLY KTERYAEFRA
ALAPMLSASP HRSPFLATLS AHHAANFAEP DPYGFCRTPL DFVQHARIDA LAAPGSALVT
ELLRDIEHAE IAERKQGRLH HGIQSAGNLF KRPEDSFRRL AALIGQAVVA YRAKWAGADC
EYARAFPADP VFSSSWYVKM RQGGHLTSHI HETGWLSGVV YLALPPRAEG SDDGCIEFST
DGDGYPRRHE HFPRRVLAPQ VGDLVLFPSS LFHRTLPFRA DADRVCIAFD IAPGP