Gene M446_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1123 
Symbol 
ID6132010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1248316 
End bp1251504 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content74% 
IMG OID641641413 
Productouter membrane autotransporter 
Protein accessionYP_001768085 
Protein GI170739430 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.948475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.876639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTCC TGTCGGGTGT CTCGCTTGCG GCCGTGATCA CGGCCATCAT GGGAGTCGGG 
GCGGCGCGGG CCCAGGCCGT CAACACGACG CCGGCCAACG TCAACGTGCT GAACCTGCTG
GCGCCCTTCC TGTCGCTCGC CAGCACCCCG GTCGGCCGGC AGACGCTTCA GCTCAACCTC
TCGCAGGCGA TCGCGCAGAA CCAGAACGCG ACGACGGCGC AGAGGCAGCT CGCGATCAGC
GACAAGGCGC TGCCGGGCAG CGCGGCCTTC GTCACCGCCA TCACCCTCGC CAACGGCACG
ACGGTCACTC TCGGGCCGGC CGACAATCTC GCGGGCGGCC TGCCGCTCCA GGCGGTCCAG
GGCGCGGGCA CGCTGAACCC GGTCCAGCCG GTCGGCGGTC TCGGCCCGCA GCTCGGCAGC
CTCTACCAGG CCGGCATCCG GGCGAGCACC GCCACGACGG GTCCGCTCGC CGCGACCTTC
GCCCTGCTCA ACACGGCCTA CTCGGTGATG GGCACCGATC TCGGCGTCGC CAAGAACTAC
TTCGCGAACG GCGCCGCCAC GAACCCGGGC ACGACGCCCG CGAACTACGT GCCGGTCCCG
GCCGTCGCGC CGGCGGGCTC CACCCTGCCC ACCCTGAACG GCCTGCCGAA CACCCGCGAC
AGCGTCTACG ACCTCGCCTA CGGGGTGACG AACACCCAGG CCGGCCAGGA CGTCTACGGC
AGTTCGCGGC CGATCCAGGT CGCGCCCGGC CGCTACGCGA TCTTCGACCC GACGGCGCTC
AACGGCATCG CCACCAACCC GTCCTTCCCG AGCGGGCACA CGCAGTACGC CTTTACGGAC
GGAATCCTGC TCGCCATGCT GGTGCCGCAG CAATACCGCA GCATGCTGTC CCGGGCCGCC
GAATACGCGG ACAGCCGCAT CGTGCTGGGC GTCCACTACC CGCTCGACAT CGTCGCGTCG
CGGGCCTTCT CGGCCTACGA CCTCGCCCAG GCCTTCACCA ACCCGGCCTA CGTCGCCAAC
GCCGCGACCA CCGGCTTGGC GATGAACCTG CCGGGCCTGT TCACGCGCGC CCAGGGCGAG
CTCCAGGGCT ACCTCGCGGC CCAGTGCGGC GCCTCCGTGG CGGCCTGCGC CGCCGGCGCC
GCCAACACGA CCAACACCCC CTACCTGCCC TCGGCCGCCA ATCAGGCGCT CTACCAGTCC
CGCCTGACCT ACGGCCTGCC CACCCTCCCG TTCGACCAAG CGCCCCGCGA GCAGGCCCCG
GCGGGCGGGC CGGACGCCGC GATCCTCCTC GCGCCCCTCT ACGGCGGCAC CGGCGCGGCC
GCGACGCTCG CGCCGAATGG CGGCCTCTCC GGGAACCTCG CCACCGCCAC GATCAACCAG
ATCCTGGTCA ACACCGAGAC CACCGCGCTC GCAGCCTTCT ACGGGCAGCC GCTCAGCTAC
TGGGCGCGCC TCGACCTCTA CGCGGCCGCC GGATACTTCG ACAACGTGGT CGGCACGCTG
CGCATGGACC CGGCCGACCG CCTCACCACC GCGGTGACGA TCGGCGATAC CGGCGCCCTC
TACGCGAACG GCGTGATCGG CGGGCCGGTC ACGGTCGGAG CCGGCGGCCT CCTCGGCGGC
ACCGGCACGG TCGGCGGGAT CGTGGCCCAG GCCGGCGGCA CCGTGGCGCC CGGCACCTCG
ATCGGCACCC TGACCGTCGC CGGGACCGTC GCCTTCGCGG CCGGCTCGAC CTACCGGGTC
GAGGCCAACG CGGCCGGGCA GGCCGACCGC CTCGCCGCCA CCGGCACCGC CGCCCTCGCC
GGGGGCACCG TCGCGGTTCT GGCCCAGGCC GGCACCTACG CCCCGCGCAC CCTCTACCCG
ATCCTGACGG CGGGCGGCGG GGTGAGCGGC AGCTTCGCGG GCGTCACCGC CAACTTCGCC
TTCCTCACCC CGACCCTGCG CTACCAGGCG AACGAGGTCG ACCTGACGCT CACCCGCAAC
GACGTTCCCT TCGCGGCAGT GGCCCGGACC CGCAACCAGG CCGCCGCGGC GAACGGCATC
CAGGCGAGCG GCGGGGCGAG CGCCGTCACG GCCCGCACCG TCGGGCTGAC CACGCCGGAG
GCGGTCGGCG CCTTCCAGGC CCTCAGCGGC GACATCCACG CGAGCAGCGT CTCCGCCGCC
TCCGAGACCG CCTTCTTCGT GCGGGAGGCG ATCCTCGACC GCCTGCGCCG GGGCGAGGCG
GGCGTGCGCG ACTACGGCAG CCTGCCGGCC AGTTACACGG CCGACCTGCC CGGGCGGGCC
GCCCCGGCGG CGCCCGTGCC GGTCCGGGTG CTCGACCCGC GGGTCTTCGG CCTCTGGGGC
CAGGGCTTCG GCTCCTTCGG CGAGGCGCGC AGCGACGGGA ACGCCGTGGC GGTGAGCCGC
GACACGGCCG GCTTCGTGCT CGGCGCCGAC CTGCGGCTGG GGAACGGGCT CACGCTCGGC
GTCGCGGGCG GCTACACGAC GACGAGCCTC GACACGCCCG GCCGGGTGCA GTCGGGCACG
ATCGAGAGCG GCTTCGGGGG CGTGTACGGC GGCTACGAGG CGGGTCCGTT CGCGCTGCGC
CTCGGGGCGG TCTATGCCGG GGACAGCCTG CGCACGCGCC GCAGCGTGAC CTTCCCGGGC
GTGGCCGAGA CCGAGGCGGC GCGCTACGGC GGCGCGACGG TGCAGGGCTT CGGCGAGATC
GGCTACAGGA TCGTGCTGGG CGGCGCGCCG TCCGTCGCGG GCAAGGATCC GCTCGCGCCC
ATCCCGACCT TCATCGAGCC CTTCGTGGGT GGCGCCTCCG TGAGCATCGA CCGCGACCGC
TTCGCGGAGA CCGGCGGGGT GGGGGCCCTC ACCGGCGCCG CCCGGACGGC GGAGATCCCG
ACGCTGACGG CGGGCATGCG GGCGCAGACG GGCCTCGACC TGGGCTTCGG CGCGCCGGTG
ATCCTGCACG GGCTGCTCGG CTATCGCCGG GCCTTCGGGG ACGTGGTGCC GACCGCGCTG
CTCGCCTTCG GGACGGCGCC GGGCTTCGTC ACGGCGGGCA TCCCGATCGA CCGCGACGCC
CTCCTGGCGC GGGCCGGCCT CTCGCTGCGG CTCTCGGAGC GGGCGACGCT CGACGTGTCC
TACACGGGGC AGGTCGGGCC GCGGGCGCAG GACCACGCCG TGAAGGGCGG CTTCACCTAC
CGGTTCTGA
 
Protein sequence
MRFLSGVSLA AVITAIMGVG AARAQAVNTT PANVNVLNLL APFLSLASTP VGRQTLQLNL 
SQAIAQNQNA TTAQRQLAIS DKALPGSAAF VTAITLANGT TVTLGPADNL AGGLPLQAVQ
GAGTLNPVQP VGGLGPQLGS LYQAGIRAST ATTGPLAATF ALLNTAYSVM GTDLGVAKNY
FANGAATNPG TTPANYVPVP AVAPAGSTLP TLNGLPNTRD SVYDLAYGVT NTQAGQDVYG
SSRPIQVAPG RYAIFDPTAL NGIATNPSFP SGHTQYAFTD GILLAMLVPQ QYRSMLSRAA
EYADSRIVLG VHYPLDIVAS RAFSAYDLAQ AFTNPAYVAN AATTGLAMNL PGLFTRAQGE
LQGYLAAQCG ASVAACAAGA ANTTNTPYLP SAANQALYQS RLTYGLPTLP FDQAPREQAP
AGGPDAAILL APLYGGTGAA ATLAPNGGLS GNLATATINQ ILVNTETTAL AAFYGQPLSY
WARLDLYAAA GYFDNVVGTL RMDPADRLTT AVTIGDTGAL YANGVIGGPV TVGAGGLLGG
TGTVGGIVAQ AGGTVAPGTS IGTLTVAGTV AFAAGSTYRV EANAAGQADR LAATGTAALA
GGTVAVLAQA GTYAPRTLYP ILTAGGGVSG SFAGVTANFA FLTPTLRYQA NEVDLTLTRN
DVPFAAVART RNQAAAANGI QASGGASAVT ARTVGLTTPE AVGAFQALSG DIHASSVSAA
SETAFFVREA ILDRLRRGEA GVRDYGSLPA SYTADLPGRA APAAPVPVRV LDPRVFGLWG
QGFGSFGEAR SDGNAVAVSR DTAGFVLGAD LRLGNGLTLG VAGGYTTTSL DTPGRVQSGT
IESGFGGVYG GYEAGPFALR LGAVYAGDSL RTRRSVTFPG VAETEAARYG GATVQGFGEI
GYRIVLGGAP SVAGKDPLAP IPTFIEPFVG GASVSIDRDR FAETGGVGAL TGAARTAEIP
TLTAGMRAQT GLDLGFGAPV ILHGLLGYRR AFGDVVPTAL LAFGTAPGFV TAGIPIDRDA
LLARAGLSLR LSERATLDVS YTGQVGPRAQ DHAVKGGFTY RF