Gene M446_2958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2958 
Symbol 
ID6130846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3280713 
End bp3282992 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content76% 
IMG OID641643149 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001769804 
Protein GI170741149 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0453248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGCG CGTCTCCCGC CATCCCGTCA CCGCCGCCGG GCGCCGTGAC CGATCCCCTC 
CCGCGCCATG CGGAGCGCGC GGCGGGCGAG GGCGTCGCGC TCGGCGACAT CGGCCGGCTG
CTGCGGCGGC ACTGGCTGGC GATCCTGCTC CCGACCCTGG CGGCGCTGGC GGCCGCGATC
GCCTTCGTTC AGCTCGTCCC GCCCCGCTAC ACCGGCGAGG CCAAGCTGCT GCTGGAGAGC
CGCGACAGCG CCCTGACGCG GCTGCAGCAG GAGCGCGGCG ACGCCGCCCA GCCGATCGAC
GAGCAGGCGG TGGCGAGCCA GGTCCAGGTG GTGATGTCCC GCGACCTCGC CCGCGAGGCG
ATCCGGCGCC TGAACCTCGT CGGCAACCGG GAGTTCGACC CGACCGTGGA GGGAATCGGC
TCCCTGCAGC GGCTGCTGAT CATGCTCGGG CTGGTGCCGA GCCCCCTCGA CCGGGACCCC
GAGGACCGGG TGCTGGAGCG CTACTTCGAC CGGCTGCTGG TCTACTCGGC GGGCAAGTCG
CGCATCCTGA CGATCGAGTT CCGCTCCCGC GACCCGGACC TCGCCGCCCG CGGCGCCAAC
ACGATCGCGG ACCTCTACCT CGCCTCGCTC GCCGCCGCGA AGGTCGACAC CGCGCGCTTC
GCCTCGACCT GGCTCGGCAG CAACGTCGAG ACCCTGCGCA GCCGCGTCGC CGAGGCGGAG
GCCAAGGTCG AGGCGTTCCG CGCCCGCAAC GGCCTGATCG GCGGCCTCGC CGGCAGCGGC
GGGGCGCAGC CGATCGGCGC GCAGCAATTG ACCGAGCTCT CCAGCCAGCT CACCCAGGCC
CGCGCCGCCC AGGCCGACGC TGCGGCCAAG GCCAAGCTCA TCAAGGACAT GATTCGCGAC
GGGCGCGCCT TCGAGATCCC GGACGTCGCC AATAACGAGC TGATCCGCCG CCTCGTCGAG
CAGCGCATCA CCCTGCGCGC GCAGCTCGCG CTGGAGGGCC GCACCCTGCT GCCGCAGCAC
CCGCGCATGA AGGAACTGAC CGCGCAGGTC ACCGACCTGG AGGGGCAGAT CCGCGCGGCC
GCCGACCGCA CGGTGCGCAC CCTCGAGAAC GACGCCCGCA TCGCCGGCAG CCGGGTCGAG
AGCCTGCAGG CCGCGGTCGA CGCGCAGCGC GAGGTCGTGG CCAAGGGCAA CAGCAGCGAG
GTGCAGCTGC GCGCCCTCGA GCGCGAGGCG AAGGTCCAGC GCGAGCAGCT CGAATCCTAC
CTCGCGCGCT ACCGTGAGGC CGCCGCGCGG GATGCCGAGA GCGCCGCCCC GGCCGATGCC
CGGGTGGTGT CGCGGGCGAT CGTGCCCGAC ACGCCGTCCT TCCCCAAGAA GCTGCCGATC
ATCGGCTTCA CCACCGCCGT CGCCTTCCTG CTCGCGGCCG GCAGCGTGCT CGCGCGCAGC
CTCATCGCCG ACGATCCCGA CGACCTGCGC GGCCGCGGCC GGCCGCGGCC GCGTCGCGCT
GCGGTGCTCA CCGATGTGAG CGCGCGCCCC GACGCGCCCG CGCCCGCGCC CGCGGCGGGC
GGGGCGGCCG CCGCCCTCGC CGAACCGGAG GCCGGGGCCG CCGAGGAGGA GGATCTCGGG
TGGGGCGCGC CGCCCGCTTC CGCGCAGGAT CCCCTCCCCG CCCCCGCGAT GCGGGAGGAG
CCTTACGAAT TCGACGCACT GGTGGCCCGG CTCGCCTCCG TGGAGAGCGC GGGGGACGGG
CGGCGCGTCC TGATCCTCGG ATCGGGCGGC GCGGGCGAGC CCGAGGCGCT CGCGCTCGCG
CTCGGGCGGT CGCTCGCGCG CTCGGCGCGG GCGGTGCTGC TGCCACTGGA CGCGGCGGCG
GGCGGCGCGC CGGGCCTGAC CGACATCGTG CTCGGCGAGG CGCCGTTCGG GGAGGCGATC
CAGCGCGACC CGGGCTCCCG CCTCCACCGG GTCGGCCCCG GCTCGCACGC GCCGGGCGTG
CTCCTCGACG AGCGCGAGGG CCTGGGCCTG ACCTTCGACG CCCTCGGGCA GGCTTACGAC
TGGATCCTCT GCGTGCTGCG CGACGACGGC CGCGACCCGG GGACGCGCGC CCTGGCCTCG
GCGACCTGCC TCTGGATGGA CGCGGTCGTC ATCGCCTCGA ACGCGCCGGC GGACGATGCC
GACCTCGTCG CCCTCTACGC GCTCGCCGAG GGCGCCGGCG TGCCGGAGGT GATCGTGGCG
CAGGACCGGC CGCCGCCCCC GGTCGCGGTG CCGGCCTACC CGCTGCGGCG CTCCGCCTGA
 
Protein sequence
MLRASPAIPS PPPGAVTDPL PRHAERAAGE GVALGDIGRL LRRHWLAILL PTLAALAAAI 
AFVQLVPPRY TGEAKLLLES RDSALTRLQQ ERGDAAQPID EQAVASQVQV VMSRDLAREA
IRRLNLVGNR EFDPTVEGIG SLQRLLIMLG LVPSPLDRDP EDRVLERYFD RLLVYSAGKS
RILTIEFRSR DPDLAARGAN TIADLYLASL AAAKVDTARF ASTWLGSNVE TLRSRVAEAE
AKVEAFRARN GLIGGLAGSG GAQPIGAQQL TELSSQLTQA RAAQADAAAK AKLIKDMIRD
GRAFEIPDVA NNELIRRLVE QRITLRAQLA LEGRTLLPQH PRMKELTAQV TDLEGQIRAA
ADRTVRTLEN DARIAGSRVE SLQAAVDAQR EVVAKGNSSE VQLRALEREA KVQREQLESY
LARYREAAAR DAESAAPADA RVVSRAIVPD TPSFPKKLPI IGFTTAVAFL LAAGSVLARS
LIADDPDDLR GRGRPRPRRA AVLTDVSARP DAPAPAPAAG GAAAALAEPE AGAAEEEDLG
WGAPPASAQD PLPAPAMREE PYEFDALVAR LASVESAGDG RRVLILGSGG AGEPEALALA
LGRSLARSAR AVLLPLDAAA GGAPGLTDIV LGEAPFGEAI QRDPGSRLHR VGPGSHAPGV
LLDEREGLGL TFDALGQAYD WILCVLRDDG RDPGTRALAS ATCLWMDAVV IASNAPADDA
DLVALYALAE GAGVPEVIVA QDRPPPPVAV PAYPLRRSA