Gene M446_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2028 
Symbol 
ID6129495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2263123 
End bp2264748 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content76% 
IMG OID641642258 
Producttype II secretion system protein E 
Protein accessionYP_001768926 
Protein GI170740271 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.115803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0405936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGGCGT TCGAGCGGTC CGTCACGGGA GAGGCCCTGC CGCCGGAGGC GGGGGGCGCG 
GAGGCGCTCT GGCGCGCGGG CGGCGTCCCG GCGGGGGACC TCGCCCAGGC GCTCGCCGCG
CATCACGGCC TGCCCCGCCT CGACCGCGAC GCGGTCGCGG CCCTGCCGGA CCTGACCGCC
GGGCTGTCCC GCCGCTTCCT CGCCGACGCC TTCCTGTACC CGGTCGCCGG GCCGGAGGGG
CCGCTCCTCC TGGTGGCCGA TCCCGGAAAC GAGGAGGCGA TCCGCGCCGT GGCGCTCGCC
CTCGGCTGCG CGCCCCGGCT CGGGGTCCTC TCCTTCGAGG AGATCGGCGA ACTCATCGCC
CGGGCCGAGG GCGCCACCGC CGCGCCCGAG GCGCGGGCGC GCCCCGCCCA CGCGGATCTC
GGCCTCACGG ACGACGTCGA GGCGCTGCAG GACCTCGCCC GGGGCGCCCC GATCGTGCGG
GCCATCGACG CGCTCCTGGA GCGCGCCGTC GAGGTCGGGG CGACCGACAT CCACCTGGAG
ACCGGCCGCG AGGAGTTGCG GGTGCGGCTG CGGATCGACG GGCGGCTGCG CCTGCACCAG
ACCCTGCCGA AGGACATGGC CCCGGCGATC ATCTCGCGGG TGAAGATCCT GGCCGGGCTC
GACATCGCCG AGCGGCGCCT GCCGCAGGAC GGGCGCACCA ATGTGCGGGT GCGCGCGAGC
GAGGCGGATC TGCGCGTCGC CGTGATGCCG ACCCTCTACG GCGAGACGGC GGTGCTGCGC
ATCCTGGTGA AGGACGCGCG CCTCCTCGAC TTCGCCCGGG TCGGCCTCTC GGCCCGCGAC
CGGGACGCGC TGGAGCGGAT GCTCGGCGAG CCGCACGGGC TGATCATCGT CACCGGGCCG
ACCGGCAGCG GCAAGACCAC CACGCTCGCC ACCGCGGTCT CGCTCCTCAA CGACCCGGCC
CGCAAGATCG TCACGGTCGA GGACCCGATC GAGTACCAGA TCCCCGGCAT CCACCAGACC
CAGATCAAGC CCGGCATCGG GCTCACCTTC GCGAACGCGC TGCGCTCCTT CCTGCGCCAC
GATCCGGACG TGATCATGGT CGGCGAGATG CGCGACCGCG AGACGGCGGC GATCGGCATC
CAGGCGGCGC TCACCGGCCA CCTCGTGCTG ACCACCCTCC ATACCAACAG CGCCCCGGAC
GCGGTCATCC GCCTCGCCGA CATGGGCGTC GAACCCTACC TGATCGCCGC GTCCCTGCGG
GGGGTCGTCG GGCAGCGCCT GGTGCGGCGC CTGTGCGAGC GCTGCCGGGC GCCGGATCCG
GACGGGGGCG CGGCGCTCGA CGCGGTCTGC GCCCGCCGGG GCTTCGCGCG GCCGGGGGGC
GGGCGGGTCC ATCGCCCGGT CGGCTGCCCG CATTGCGGCG GCAGCGGCTT TCGCGGCCGG
GTCGGGGTCT TCGAGGTGAT GCCGGTGGAC GAGGCGCTCA CCGGCCTGAT CCGGCGCGAG
CCCGACCCGC TGGTGCTGCT GCGCGCCGCC CGCGAGGCCG GGATGACGAC GATGCTGGAG
GACGGCCTCG CCAAGGCCGC CGACGGGCTC ACCTCGCTCG ACGAGGTGAT GCGGATGACC
GGCTAG
 
Protein sequence
MGAFERSVTG EALPPEAGGA EALWRAGGVP AGDLAQALAA HHGLPRLDRD AVAALPDLTA 
GLSRRFLADA FLYPVAGPEG PLLLVADPGN EEAIRAVALA LGCAPRLGVL SFEEIGELIA
RAEGATAAPE ARARPAHADL GLTDDVEALQ DLARGAPIVR AIDALLERAV EVGATDIHLE
TGREELRVRL RIDGRLRLHQ TLPKDMAPAI ISRVKILAGL DIAERRLPQD GRTNVRVRAS
EADLRVAVMP TLYGETAVLR ILVKDARLLD FARVGLSARD RDALERMLGE PHGLIIVTGP
TGSGKTTTLA TAVSLLNDPA RKIVTVEDPI EYQIPGIHQT QIKPGIGLTF ANALRSFLRH
DPDVIMVGEM RDRETAAIGI QAALTGHLVL TTLHTNSAPD AVIRLADMGV EPYLIAASLR
GVVGQRLVRR LCERCRAPDP DGGAALDAVC ARRGFARPGG GRVHRPVGCP HCGGSGFRGR
VGVFEVMPVD EALTGLIRRE PDPLVLLRAA REAGMTTMLE DGLAKAADGL TSLDEVMRMT
G