Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0950 |
Symbol | |
ID | 6131631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 1076397 |
End bp | 1077404 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641641259 |
Product | extensin family protein |
Protein accession | YP_001767933 |
Protein GI | 170739278 |
COG category | [S] Function unknown |
COG ID | [COG3921] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00924534 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGTG GCCTCGTTGC GTTCTCAGCC TTCACGCTCT TCAGCCTGGG GCTCACCGGA TGCGCGCTCA AACCCTTTGA GCAGCGCGAA CCCTGGCGTA CGGAGGCCGA GGAGGCCTGC CTCGCCCGCG GCCTCGTCAA GCCGACCGAG GACATCGTCC CCGTGAAGGA GATCGACGGG CCGGGAATCT GCGGGATGGT GCATCCGTTC CGCGTCACCC GGCTCGCGGG CGGCACGGTC GCGCTCAAGC AGCGCATGAC GCTCGCCTGC CCGATGATCC CGGAGGTGGA GGCGTGGCTC GCCGGCACGG TTCAGCCGGC CGCGCAGCTG TATTTCGGCC AGCCGGTGGT GGAGATCAAT TCCGGCTCCT ATTCCTGCCG CGGCCGCAAC AATCAGGTCG GTGCGAAGCT CTCGGAGCAT TCCTTCGGCA ACGCGGTCGA CGTGATGTCC TTCCGGCTCG CGGACGGTTC CGTCGTGACG GTGAAGGGCG GCTGGCGCGG CAGCGAGGCC GAGCAGGGCT TCCTGCGCGA GGTCTTCCTG GGGGCCTGCA ACCACTTCAC CACGGTGCTG GCGCCGGGGT CGAACGTCTA CCACTACGAC CACCTGCACC TCGACCTCGC CCGCCACGAC CCGCGCGGGC TGCGGCGGAT CTGCAAGCCC CTGATCAAGT TCCAGTCGCA GCTGCCGCCG CCCGGCTCGC CCCTGTCGCC GATCCGCAAG AAGCCGCCGG CGTGGCAGCC CGCGCCCGAT CCGGCGCCGA TCGACGTCGA GGAGGACGAT CCCTACGGCG TGTCGCCGAT GAGCCGGCGC GAGAGCCCGG GCCAGACCCG CGTCGTCCGC GCGCCGGCCC CCCCGCCCGT GCAGGCCTAC GCGCCGCGGC CGGCGCCGAC CCGCAGCGCC GCCGCGCAGG AGCCGCCCGT GGCGCCGGGC TTCGGCCCGG CGGCGCGGGC GATTTCCGCG CCGCTGCCGC TCAATTCTCC CGCTTGGTCG TCCGGACCGA TCTACTGA
|
Protein sequence | MRRGLVAFSA FTLFSLGLTG CALKPFEQRE PWRTEAEEAC LARGLVKPTE DIVPVKEIDG PGICGMVHPF RVTRLAGGTV ALKQRMTLAC PMIPEVEAWL AGTVQPAAQL YFGQPVVEIN SGSYSCRGRN NQVGAKLSEH SFGNAVDVMS FRLADGSVVT VKGGWRGSEA EQGFLREVFL GACNHFTTVL APGSNVYHYD HLHLDLARHD PRGLRRICKP LIKFQSQLPP PGSPLSPIRK KPPAWQPAPD PAPIDVEEDD PYGVSPMSRR ESPGQTRVVR APAPPPVQAY APRPAPTRSA AAQEPPVAPG FGPAARAISA PLPLNSPAWS SGPIY
|
| |