Gene Mnod_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_3047 
Symbol 
ID7303974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp3141319 
End bp3142788 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content76% 
IMG OID643600746 
Producttype IV / VI secretion system protein, DotU family 
Protein accessionYP_002498291 
Protein GI220922989 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins
[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family
[TIGR03350] type VI secretion system OmpA/MotB family protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.166638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCGC CCTTCGATCC CTTCGGCCGC TCGGACCGCA CCATCATCCT GCCGAACCCG 
GCCGGACGGC GTGCGCCGCA GGCGGCCCTG CAGGCGACGA GCCCAGCCCC CGAGGCGCCG
GTCGCCCGGC GCGTCCCGGC GAGCCTCCAG GCGCCCTTCC CGGCGCCCAG CTTCGCGGCG
CCGCCCGCGA TGGGCGAGGA CGCCTGGGCG CGCCCCGATC CGCTGCCGCC CGCCCGTGAG
CCCGCGCCGC CGGGGCGCGC CCTGGTGCTC AGGCGCGACG TGGTGGTGGC GCCGAACGAG
AACCCGTTCC TGCGCGCGGC CGGGCCGCTC CTCCTGCTGA TCGGGCGCCT GCGCGTCCAG
CTCTCGCGCG CCTCCTTCGC CAACCTGATG GAGCAGGTCG CCGCGGCGAT CGAGGAGTTC
GAGCGCGAGG TGCGCGGCGC GGGCGCCTCG CCCGAGCAGA CGCGGACCGC CAAGTACGTC
GTCTGCGCCA CGGCCGACGA CGTGGTGCAG AACATCCCGA CCGAGGACCG GCACGTCTGG
ACGCAGTACT CGATGCTTAG CCGCTTCTTC GGCGAGCGCG TCGGCGGCGT GCGCTTCTTC
GAGGAGCTGG AGCGGGCCAA GCTCGACCCG GCCGGCAACT ACGCGCTGCT GGAACTGCAG
CATGCCTGCC TCGCGCTCGG CTTCCAGGGC ATCCACCGCA CCTCCGCGGG CGGGGCCGCC
GCGCTCCAGG CGATCCAGCG CAATCTCTAC GAGACGCTGC GGCGGGCCCG CCCCGCCCCG
GCCGAGATCT CGCCGCGCTG GCAGGGCCAG GACATCCCGG CGGCCGCAGC CCGGCCGGCG
GTCCCGCTCT GGACGGTGGC GGCGGTCACG GCGGCGGCGC TGCTCGCCCT CTACCTCGCC
CTGCGACTCC TGCTGGCGCG CGACGCCGAC ACGACCGCCG AGACCCTCGT CACCCTCCAC
CCGACGACCG AACTCGGCAT TCAGCGGCGC GCCCCGGTTC CGCCGCCCCC ACCTCCGCCT
CCGCCGCCTC CGAGCGGCCC GGCCGCGGCC CTGCGCGACG CGCTCGCCGC CGATGCCTCG
GCCGGGCGGG TCACGGTCGA GGAGACGAAC TCCCAGGTCG TCGTGCGGCT CGCCGCCGCG
CTCTTCGCAC CCGGCGACGC GGCCGTGACG GCGGAGTTCC GCCCGCTGCT GCAGCGCGTC
GCCGGCCTGA TCGCCCGCGA GCCGGGGCCG ATCCGGATCG TCGGCCACAC CGACAGCGCG
CCGGTCCGCA ACGGGCGCTT CGCCTCGAAC TTCGACCTCT CGGTCGCGCG CGCCAAGGCG
GTCGCAGCGG CGATCCGGGC AGCGCCGGAG AAGCCCGAGC GCCTTGAGGT CGAAGGCAAG
GGGCCGGACG CGCCCGTGGC CCCGAACGAC ACCGTCGAAG GACGCGCCCG CAACCGGCGC
GTCGAGATCC TCATCCCCCG CGGCAGCTGA
 
Protein sequence
MNAPFDPFGR SDRTIILPNP AGRRAPQAAL QATSPAPEAP VARRVPASLQ APFPAPSFAA 
PPAMGEDAWA RPDPLPPARE PAPPGRALVL RRDVVVAPNE NPFLRAAGPL LLLIGRLRVQ
LSRASFANLM EQVAAAIEEF EREVRGAGAS PEQTRTAKYV VCATADDVVQ NIPTEDRHVW
TQYSMLSRFF GERVGGVRFF EELERAKLDP AGNYALLELQ HACLALGFQG IHRTSAGGAA
ALQAIQRNLY ETLRRARPAP AEISPRWQGQ DIPAAAARPA VPLWTVAAVT AAALLALYLA
LRLLLARDAD TTAETLVTLH PTTELGIQRR APVPPPPPPP PPPPSGPAAA LRDALAADAS
AGRVTVEETN SQVVVRLAAA LFAPGDAAVT AEFRPLLQRV AGLIAREPGP IRIVGHTDSA
PVRNGRFASN FDLSVARAKA VAAAIRAAPE KPERLEVEGK GPDAPVAPND TVEGRARNRR
VEILIPRGS