Gene Mpe_B0367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0367 
SymboltraB 
ID4787995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp317220 
End bp318428 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID640092799 
Productsex pilus assembly protein 
Protein accessionYP_001023377 
Protein GI124262907 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0565513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCCA AGTCGATCAA GGACTTCTGG GTCACCGCAG GGCCCAAGAA GCGAAACATG 
GTGCTGATGG CTGGAGCCGC CTTCGCGATG ATCGTCGTCG CCACGGTGAT GGACTCCGGT
TCGTCGAAGG GTGGGCCGGC GCGCAAGTCG CCCGTCGACA CCAGCCGCAC GCAGCTGATG
CTTCCGAAGG CGCCGGACAA CTCCGTCGAA GCGCTTGCGG CGGACAGCCG CGCGCAATCC
GAGCAGCTCA CGCGTCTGCA GGAGCAGCTG AAGAAGGAGA CGGCCGACAA GGAACTGCTG
CTCAAGCGGC TGGACGAGGG TGATCGCGGC CGCAAGCCGG ACGCAGTCAC CACCGACCTG
TTGAACGAGG TCGTTGCCCT CAAGACCAAG ATCCAGGAAA TCGAGACGCG TGGCGCTCCT
GTTGCCCAGG CTGCCGCTTC GTCGCCTTCG CTGGGCGACC CGCTTCCTGG CGCCAGCGTA
CCGATGGCGG AGCCGCCGGC TCCGGCCGAG CCGGTCAACC GGTTGCGCGT CAGCGGCGAG
GCCAAGAAGA TCGATCGCAA GGCCGCAGTC TCCGAAGACA AGCCGGTTGC CTACATCCCG
GCCGGCTCGT TCCTGGAGGC CTCTCTCCTG AACGGTATGG ATGCGCCGGT TTCGTCGGTC
GCGCAGAAGA ACCCGGTGCC CGCGGTGATG CGAGTCAAGA CCGAGGCGGT CCTGCCGAAC
CACTTTTCCC AGGACGTCAA GGAGTGCTTC GTCCTCGTGA GCGGCTTCGG CGTGCTCAGC
AGCGAACGCG TCCAACTGCG CACGGAGACG ATCTCGTGCG TCAAGGAAGA CGGCAAGGCC
ATCGAAGCCA AGATCGACGG CTACGTCGTC GGCGAAGACG GCAGTGTCGG ACCTCGTGGC
CGCCTCGTGA GCAAGCAGGG CCAGCTGATC GCTCGCTCGC TGGCCGCTGG CGTGCTCGCG
GGCTTCGGCG AGGCGCTGAC TCCTCAGGCC GTTCCGCAGC TGAGCCTGTC GCCCAGCGGC
ACCACCCCGA CCACTCGCCT CGACGCCCAG ACCTTCGCGG CGACCGGCGT GGCGCGCGGC
TTCTCCGATG CATCGAAGGC GGTCTCCGGC TTCTTCCTCG AGATGGCGCG CGAAGCAACC
CCTGTCGTCG AGATCAACGC CGGTCGCAAG CTGACGATCG TCGTCATCAA GGGCTTCGAA
CTCAAGTAA
 
Protein sequence
MLPKSIKDFW VTAGPKKRNM VLMAGAAFAM IVVATVMDSG SSKGGPARKS PVDTSRTQLM 
LPKAPDNSVE ALAADSRAQS EQLTRLQEQL KKETADKELL LKRLDEGDRG RKPDAVTTDL
LNEVVALKTK IQEIETRGAP VAQAAASSPS LGDPLPGASV PMAEPPAPAE PVNRLRVSGE
AKKIDRKAAV SEDKPVAYIP AGSFLEASLL NGMDAPVSSV AQKNPVPAVM RVKTEAVLPN
HFSQDVKECF VLVSGFGVLS SERVQLRTET ISCVKEDGKA IEAKIDGYVV GEDGSVGPRG
RLVSKQGQLI ARSLAAGVLA GFGEALTPQA VPQLSLSPSG TTPTTRLDAQ TFAATGVARG
FSDASKAVSG FFLEMAREAT PVVEINAGRK LTIVVIKGFE LK