Gene M446_4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4029 
Symbol 
ID6132880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4493313 
End bp4495481 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content72% 
IMG OID641644186 
Producthypothetical protein 
Protein accessionYP_001770826 
Protein GI170742171 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGC TCCCATCGGA AACCGGTCTG GCGCCTGCCA GGCCGCACAC GGCCTCGGGG 
GGGAGCGTCC GCGACACGGG CGCGACCCTC CTCTTCGTGG TGTTCAAGTG GCTGCCGCAG
ATGGCCGCGG TCGTCGGGAT CGGGATCCTG TGCGGAGTGG GCTACCTGAG CCTGGTGCGC
GGGAACATGT TCCAGGCCAA CGCGAAGCTC TACGTTCGCG TCGGCGTCGA TCAGACGCCC
TCGCCGCTGC TCGGCTCGGA CCGCAACGTC ACGTTCCTGG CGCAAACCCG CGGTGCCGTG
CAGTCGGAGA TGGATCTCAT GCGCAACGAG GTGCTGGTCT CGCGTCTGAT CGCGGACCTG
AACCTCGGCG CGCCGGAGGT GCGTCCGGAG CCGACGGGCC TCTACGGTCG CCTGAAGCGC
TTCGGCTCCG ACCTTTACCA GGGTATGCGC GACGGGGCCG ATGCCGTCCT GATCATGGCT
GGCCTGAAGA CGCCGCTGTC GCGCTCGGCC GCGGTCAGCC AGGTCTTCGC CCAGTCGCTG
ATCCTCGACA ATGCGCCGGG CTCCGACGTC ATCTCGGTCA GCCTGCGCTG GCCCGGCGAG
GCGCAGGCGG TGCAGCTCCT CGACCGTTTC CTGCAGATCT ACACGAACTT TCGCTCGGCG
GTGTTCGAGG GAGGCGGCGA GATCGACTTC CTCAGCACCA AGCGCGACGC CGCCAAGGCG
GCCGTCGAGG CGGTCGAGGC GGAGATGGCG GTGTTCGAGC GCGAGCACGA CACCCGCAAT
GCGGCGAGCC GGCTGCCCCT TCTGGAGGCC GACCTCGTGG AGGCGCAGCG CGCGCTCGAA
CGCCAGTCCC TCGAACTGGA CTTCGCGCGT CGGCGCTTCG AACGGCTCAG CCGCGTCCTG
GTCCGGACGG CCAGCCAGCG CGAGCCGGTG GGCCTCGGAA CCTTTCCGCC GAACTCGCCG
GCCCTCAGCA TGGCGCCCTC GATGGTGGCG CTGCTCGGCG AGCGCGAGCG TCTCCTCGTC
AACAACTCGG CCAACGCGCC CGAGGTGAGA GAAGTCGAGG CCAAGCTCGG CGCGCTGACG
GTCGTGCTGC TCCGCCAGCT CGAGGCCGAG GTTCAGGATC TCGTCCACGT CGAGGCGGCG
GCGCGAGAGC GCGTGGAGAA GGTCTCGACG GCGTTGCGCG AGTTCCAGGA TGCGGCGACC
GGCTGGAAGT CGCTGGAGCG GCGGCGCGAA CTCGCTGAGG CGCGCTACCG CGATGCCGAG
AAGCGTCTCG CGGAGGCGCG CGACATCGCC GCTCTCAGGA ACGCGCGGCT CTCCAACGTG
GTCGTCGTGC AGCCCGCCGC CGCCGAGGGC ACGCCCATCG GGCTTCGCAA GCTCTCGATG
CTCGGCATCA TCACGCTGTG CTCGGGCGTG CTCGCCTGCG GCTGGGCGCT CGTGCGCGAA
GTCTTCGACG GACGCCTCTA CCGGGGCGAG GAGGCTGCCG CCGCGCTCGG CCTGCCGCTG
GTCGGCGAGG TGCCGGCGCG GGAGCGCCCG CTGCAGGTCT GGTCGCCGAG CGATCCCGAC
TCGGCGGCGC GGGTCGCCCT CGACCGACTC GTCGTCACGG TCAGCGAGCG CCTGCGCGGC
GCGCGGCCCA CCATCCTGGC GATCGCCGCC GCCGAGGCGG ATGAGGGTGC TTCCACCGTC
GCCCTCGCGC TCGCTCACGG CATGGCCCGG CGCGGCCGCG TGCCGGTGCG CCTCATCTCG
GGCTCGACGA GCCAGGACCT GCTGCACCAC GCCCGGGACC TGCGCGTTCG CATGGACCCG
TTGCCGTCCT CGCCGGATCT GCCGACCGGC CTGGCCCTGA CCGCCGTCGG CGACCGCTTG
GTCGTGGCAA CCTGGGACGA CGCGGAGGCC GCCGCGGCGT TCCTGCGCGA CGGCTTCGCG
AGCTGCCCAG GCCTGCAAGG GGATGCCAGC CTCGTCATCC TCGACCTGCC GCCGCTGTCC
GGCGATGCGG AGGCACCCCT CTGCGCGAGC CGGGCGGATG CGACGCTCCT CGTCGTGCGC
GCCGGCCGGC ACGGCGCCCA GCGGCACATC GCGGCACTCG AGGCCTTGAG GTGGCTCGGC
ACGGAGCCGA TCGGCATCGT CCTCAACGGC GTGCGCCGCT TCGTTCCCGC TCGCCTGGAG
CGAGTCTGA
 
Protein sequence
MTKLPSETGL APARPHTASG GSVRDTGATL LFVVFKWLPQ MAAVVGIGIL CGVGYLSLVR 
GNMFQANAKL YVRVGVDQTP SPLLGSDRNV TFLAQTRGAV QSEMDLMRNE VLVSRLIADL
NLGAPEVRPE PTGLYGRLKR FGSDLYQGMR DGADAVLIMA GLKTPLSRSA AVSQVFAQSL
ILDNAPGSDV ISVSLRWPGE AQAVQLLDRF LQIYTNFRSA VFEGGGEIDF LSTKRDAAKA
AVEAVEAEMA VFEREHDTRN AASRLPLLEA DLVEAQRALE RQSLELDFAR RRFERLSRVL
VRTASQREPV GLGTFPPNSP ALSMAPSMVA LLGERERLLV NNSANAPEVR EVEAKLGALT
VVLLRQLEAE VQDLVHVEAA ARERVEKVST ALREFQDAAT GWKSLERRRE LAEARYRDAE
KRLAEARDIA ALRNARLSNV VVVQPAAAEG TPIGLRKLSM LGIITLCSGV LACGWALVRE
VFDGRLYRGE EAAAALGLPL VGEVPARERP LQVWSPSDPD SAARVALDRL VVTVSERLRG
ARPTILAIAA AEADEGASTV ALALAHGMAR RGRVPVRLIS GSTSQDLLHH ARDLRVRMDP
LPSSPDLPTG LALTAVGDRL VVATWDDAEA AAAFLRDGFA SCPGLQGDAS LVILDLPPLS
GDAEAPLCAS RADATLLVVR AGRHGAQRHI AALEALRWLG TEPIGIVLNG VRRFVPARLE
RV