Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_2958 |
Symbol | |
ID | 6130846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3280713 |
End bp | 3282992 |
Gene Length | 2280 bp |
Protein Length | 759 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641643149 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001769804 |
Protein GI | 170741149 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0453248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCGCG CGTCTCCCGC CATCCCGTCA CCGCCGCCGG GCGCCGTGAC CGATCCCCTC CCGCGCCATG CGGAGCGCGC GGCGGGCGAG GGCGTCGCGC TCGGCGACAT CGGCCGGCTG CTGCGGCGGC ACTGGCTGGC GATCCTGCTC CCGACCCTGG CGGCGCTGGC GGCCGCGATC GCCTTCGTTC AGCTCGTCCC GCCCCGCTAC ACCGGCGAGG CCAAGCTGCT GCTGGAGAGC CGCGACAGCG CCCTGACGCG GCTGCAGCAG GAGCGCGGCG ACGCCGCCCA GCCGATCGAC GAGCAGGCGG TGGCGAGCCA GGTCCAGGTG GTGATGTCCC GCGACCTCGC CCGCGAGGCG ATCCGGCGCC TGAACCTCGT CGGCAACCGG GAGTTCGACC CGACCGTGGA GGGAATCGGC TCCCTGCAGC GGCTGCTGAT CATGCTCGGG CTGGTGCCGA GCCCCCTCGA CCGGGACCCC GAGGACCGGG TGCTGGAGCG CTACTTCGAC CGGCTGCTGG TCTACTCGGC GGGCAAGTCG CGCATCCTGA CGATCGAGTT CCGCTCCCGC GACCCGGACC TCGCCGCCCG CGGCGCCAAC ACGATCGCGG ACCTCTACCT CGCCTCGCTC GCCGCCGCGA AGGTCGACAC CGCGCGCTTC GCCTCGACCT GGCTCGGCAG CAACGTCGAG ACCCTGCGCA GCCGCGTCGC CGAGGCGGAG GCCAAGGTCG AGGCGTTCCG CGCCCGCAAC GGCCTGATCG GCGGCCTCGC CGGCAGCGGC GGGGCGCAGC CGATCGGCGC GCAGCAATTG ACCGAGCTCT CCAGCCAGCT CACCCAGGCC CGCGCCGCCC AGGCCGACGC TGCGGCCAAG GCCAAGCTCA TCAAGGACAT GATTCGCGAC GGGCGCGCCT TCGAGATCCC GGACGTCGCC AATAACGAGC TGATCCGCCG CCTCGTCGAG CAGCGCATCA CCCTGCGCGC GCAGCTCGCG CTGGAGGGCC GCACCCTGCT GCCGCAGCAC CCGCGCATGA AGGAACTGAC CGCGCAGGTC ACCGACCTGG AGGGGCAGAT CCGCGCGGCC GCCGACCGCA CGGTGCGCAC CCTCGAGAAC GACGCCCGCA TCGCCGGCAG CCGGGTCGAG AGCCTGCAGG CCGCGGTCGA CGCGCAGCGC GAGGTCGTGG CCAAGGGCAA CAGCAGCGAG GTGCAGCTGC GCGCCCTCGA GCGCGAGGCG AAGGTCCAGC GCGAGCAGCT CGAATCCTAC CTCGCGCGCT ACCGTGAGGC CGCCGCGCGG GATGCCGAGA GCGCCGCCCC GGCCGATGCC CGGGTGGTGT CGCGGGCGAT CGTGCCCGAC ACGCCGTCCT TCCCCAAGAA GCTGCCGATC ATCGGCTTCA CCACCGCCGT CGCCTTCCTG CTCGCGGCCG GCAGCGTGCT CGCGCGCAGC CTCATCGCCG ACGATCCCGA CGACCTGCGC GGCCGCGGCC GGCCGCGGCC GCGTCGCGCT GCGGTGCTCA CCGATGTGAG CGCGCGCCCC GACGCGCCCG CGCCCGCGCC CGCGGCGGGC GGGGCGGCCG CCGCCCTCGC CGAACCGGAG GCCGGGGCCG CCGAGGAGGA GGATCTCGGG TGGGGCGCGC CGCCCGCTTC CGCGCAGGAT CCCCTCCCCG CCCCCGCGAT GCGGGAGGAG CCTTACGAAT TCGACGCACT GGTGGCCCGG CTCGCCTCCG TGGAGAGCGC GGGGGACGGG CGGCGCGTCC TGATCCTCGG ATCGGGCGGC GCGGGCGAGC CCGAGGCGCT CGCGCTCGCG CTCGGGCGGT CGCTCGCGCG CTCGGCGCGG GCGGTGCTGC TGCCACTGGA CGCGGCGGCG GGCGGCGCGC CGGGCCTGAC CGACATCGTG CTCGGCGAGG CGCCGTTCGG GGAGGCGATC CAGCGCGACC CGGGCTCCCG CCTCCACCGG GTCGGCCCCG GCTCGCACGC GCCGGGCGTG CTCCTCGACG AGCGCGAGGG CCTGGGCCTG ACCTTCGACG CCCTCGGGCA GGCTTACGAC TGGATCCTCT GCGTGCTGCG CGACGACGGC CGCGACCCGG GGACGCGCGC CCTGGCCTCG GCGACCTGCC TCTGGATGGA CGCGGTCGTC ATCGCCTCGA ACGCGCCGGC GGACGATGCC GACCTCGTCG CCCTCTACGC GCTCGCCGAG GGCGCCGGCG TGCCGGAGGT GATCGTGGCG CAGGACCGGC CGCCGCCCCC GGTCGCGGTG CCGGCCTACC CGCTGCGGCG CTCCGCCTGA
|
Protein sequence | MLRASPAIPS PPPGAVTDPL PRHAERAAGE GVALGDIGRL LRRHWLAILL PTLAALAAAI AFVQLVPPRY TGEAKLLLES RDSALTRLQQ ERGDAAQPID EQAVASQVQV VMSRDLAREA IRRLNLVGNR EFDPTVEGIG SLQRLLIMLG LVPSPLDRDP EDRVLERYFD RLLVYSAGKS RILTIEFRSR DPDLAARGAN TIADLYLASL AAAKVDTARF ASTWLGSNVE TLRSRVAEAE AKVEAFRARN GLIGGLAGSG GAQPIGAQQL TELSSQLTQA RAAQADAAAK AKLIKDMIRD GRAFEIPDVA NNELIRRLVE QRITLRAQLA LEGRTLLPQH PRMKELTAQV TDLEGQIRAA ADRTVRTLEN DARIAGSRVE SLQAAVDAQR EVVAKGNSSE VQLRALEREA KVQREQLESY LARYREAAAR DAESAAPADA RVVSRAIVPD TPSFPKKLPI IGFTTAVAFL LAAGSVLARS LIADDPDDLR GRGRPRPRRA AVLTDVSARP DAPAPAPAAG GAAAALAEPE AGAAEEEDLG WGAPPASAQD PLPAPAMREE PYEFDALVAR LASVESAGDG RRVLILGSGG AGEPEALALA LGRSLARSAR AVLLPLDAAA GGAPGLTDIV LGEAPFGEAI QRDPGSRLHR VGPGSHAPGV LLDEREGLGL TFDALGQAYD WILCVLRDDG RDPGTRALAS ATCLWMDAVV IASNAPADDA DLVALYALAE GAGVPEVIVA QDRPPPPVAV PAYPLRRSA
|
| |