Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_4029 |
Symbol | |
ID | 6132880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 4493313 |
End bp | 4495481 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641644186 |
Product | hypothetical protein |
Protein accession | YP_001770826 |
Protein GI | 170742171 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGC TCCCATCGGA AACCGGTCTG GCGCCTGCCA GGCCGCACAC GGCCTCGGGG GGGAGCGTCC GCGACACGGG CGCGACCCTC CTCTTCGTGG TGTTCAAGTG GCTGCCGCAG ATGGCCGCGG TCGTCGGGAT CGGGATCCTG TGCGGAGTGG GCTACCTGAG CCTGGTGCGC GGGAACATGT TCCAGGCCAA CGCGAAGCTC TACGTTCGCG TCGGCGTCGA TCAGACGCCC TCGCCGCTGC TCGGCTCGGA CCGCAACGTC ACGTTCCTGG CGCAAACCCG CGGTGCCGTG CAGTCGGAGA TGGATCTCAT GCGCAACGAG GTGCTGGTCT CGCGTCTGAT CGCGGACCTG AACCTCGGCG CGCCGGAGGT GCGTCCGGAG CCGACGGGCC TCTACGGTCG CCTGAAGCGC TTCGGCTCCG ACCTTTACCA GGGTATGCGC GACGGGGCCG ATGCCGTCCT GATCATGGCT GGCCTGAAGA CGCCGCTGTC GCGCTCGGCC GCGGTCAGCC AGGTCTTCGC CCAGTCGCTG ATCCTCGACA ATGCGCCGGG CTCCGACGTC ATCTCGGTCA GCCTGCGCTG GCCCGGCGAG GCGCAGGCGG TGCAGCTCCT CGACCGTTTC CTGCAGATCT ACACGAACTT TCGCTCGGCG GTGTTCGAGG GAGGCGGCGA GATCGACTTC CTCAGCACCA AGCGCGACGC CGCCAAGGCG GCCGTCGAGG CGGTCGAGGC GGAGATGGCG GTGTTCGAGC GCGAGCACGA CACCCGCAAT GCGGCGAGCC GGCTGCCCCT TCTGGAGGCC GACCTCGTGG AGGCGCAGCG CGCGCTCGAA CGCCAGTCCC TCGAACTGGA CTTCGCGCGT CGGCGCTTCG AACGGCTCAG CCGCGTCCTG GTCCGGACGG CCAGCCAGCG CGAGCCGGTG GGCCTCGGAA CCTTTCCGCC GAACTCGCCG GCCCTCAGCA TGGCGCCCTC GATGGTGGCG CTGCTCGGCG AGCGCGAGCG TCTCCTCGTC AACAACTCGG CCAACGCGCC CGAGGTGAGA GAAGTCGAGG CCAAGCTCGG CGCGCTGACG GTCGTGCTGC TCCGCCAGCT CGAGGCCGAG GTTCAGGATC TCGTCCACGT CGAGGCGGCG GCGCGAGAGC GCGTGGAGAA GGTCTCGACG GCGTTGCGCG AGTTCCAGGA TGCGGCGACC GGCTGGAAGT CGCTGGAGCG GCGGCGCGAA CTCGCTGAGG CGCGCTACCG CGATGCCGAG AAGCGTCTCG CGGAGGCGCG CGACATCGCC GCTCTCAGGA ACGCGCGGCT CTCCAACGTG GTCGTCGTGC AGCCCGCCGC CGCCGAGGGC ACGCCCATCG GGCTTCGCAA GCTCTCGATG CTCGGCATCA TCACGCTGTG CTCGGGCGTG CTCGCCTGCG GCTGGGCGCT CGTGCGCGAA GTCTTCGACG GACGCCTCTA CCGGGGCGAG GAGGCTGCCG CCGCGCTCGG CCTGCCGCTG GTCGGCGAGG TGCCGGCGCG GGAGCGCCCG CTGCAGGTCT GGTCGCCGAG CGATCCCGAC TCGGCGGCGC GGGTCGCCCT CGACCGACTC GTCGTCACGG TCAGCGAGCG CCTGCGCGGC GCGCGGCCCA CCATCCTGGC GATCGCCGCC GCCGAGGCGG ATGAGGGTGC TTCCACCGTC GCCCTCGCGC TCGCTCACGG CATGGCCCGG CGCGGCCGCG TGCCGGTGCG CCTCATCTCG GGCTCGACGA GCCAGGACCT GCTGCACCAC GCCCGGGACC TGCGCGTTCG CATGGACCCG TTGCCGTCCT CGCCGGATCT GCCGACCGGC CTGGCCCTGA CCGCCGTCGG CGACCGCTTG GTCGTGGCAA CCTGGGACGA CGCGGAGGCC GCCGCGGCGT TCCTGCGCGA CGGCTTCGCG AGCTGCCCAG GCCTGCAAGG GGATGCCAGC CTCGTCATCC TCGACCTGCC GCCGCTGTCC GGCGATGCGG AGGCACCCCT CTGCGCGAGC CGGGCGGATG CGACGCTCCT CGTCGTGCGC GCCGGCCGGC ACGGCGCCCA GCGGCACATC GCGGCACTCG AGGCCTTGAG GTGGCTCGGC ACGGAGCCGA TCGGCATCGT CCTCAACGGC GTGCGCCGCT TCGTTCCCGC TCGCCTGGAG CGAGTCTGA
|
Protein sequence | MTKLPSETGL APARPHTASG GSVRDTGATL LFVVFKWLPQ MAAVVGIGIL CGVGYLSLVR GNMFQANAKL YVRVGVDQTP SPLLGSDRNV TFLAQTRGAV QSEMDLMRNE VLVSRLIADL NLGAPEVRPE PTGLYGRLKR FGSDLYQGMR DGADAVLIMA GLKTPLSRSA AVSQVFAQSL ILDNAPGSDV ISVSLRWPGE AQAVQLLDRF LQIYTNFRSA VFEGGGEIDF LSTKRDAAKA AVEAVEAEMA VFEREHDTRN AASRLPLLEA DLVEAQRALE RQSLELDFAR RRFERLSRVL VRTASQREPV GLGTFPPNSP ALSMAPSMVA LLGERERLLV NNSANAPEVR EVEAKLGALT VVLLRQLEAE VQDLVHVEAA ARERVEKVST ALREFQDAAT GWKSLERRRE LAEARYRDAE KRLAEARDIA ALRNARLSNV VVVQPAAAEG TPIGLRKLSM LGIITLCSGV LACGWALVRE VFDGRLYRGE EAAAALGLPL VGEVPARERP LQVWSPSDPD SAARVALDRL VVTVSERLRG ARPTILAIAA AEADEGASTV ALALAHGMAR RGRVPVRLIS GSTSQDLLHH ARDLRVRMDP LPSSPDLPTG LALTAVGDRL VVATWDDAEA AAAFLRDGFA SCPGLQGDAS LVILDLPPLS GDAEAPLCAS RADATLLVVR AGRHGAQRHI AALEALRWLG TEPIGIVLNG VRRFVPARLE RV
|
| |