Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1077 |
Symbol | |
ID | 6131532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 1200063 |
End bp | 1201085 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641641368 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001768040 |
Protein GI | 170739385 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.257076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.422372 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCTG AGGCGACGAG GCGGGCGCGG CCGCTTCCCG CCGCGATCAC CCGCGGCGCC TTCCTGGCCG CCCTCGCCGG CGGCGTCGCC GCGGCGGCCG GGACGGGGGC GGGCCCGGCG CGGGCCGGCG GGGCGGTCCG CTACCCCGGC GTCAACCTGT CGGGCGGGGA GTTCGGCGAC ATCGGCCGCC CCCTCGGCCA GGGCTACATC TACCCGCCGA ACGAGAGTTT CGCCTACTAC GCCGGGCGCG GCATGAAGCT CGTGCGGATC CCGTTCAAGA TCGAGCGGGT GCAGCCGGAG CCCCTCGGCG CCCTCTCGGT CCGGGACGCG GACGAACTCG CGCGCTGCGT GCGCGCGGCC AAGGCCGCCG GGCTCCTGGT GGTCCTCGAC GCGCACAATT TCGGCAAGCG CGACGGAAAG CCGATCGAGG CGCGGGACCT CACCAATCTC TGGTCGCGGC TCGCCGCGCG GTTCCGGGAC GAGCCGTCGG TGGCCTACGG CCTCATGAAC GAGCCGGTGG CCTTCGCGCC GCCCGCCTGG CGCCCGGTCG TCGACGCCCT CGTCAAGGCC ATCCGCGACG GCGGCTCGCG GCAGCTCCTG ATGGTCCCCG GCGCCGGCTG GAGCGGCGCC CATTCCTGGG TGTCGGACGG CAACGCCGCG GCCTTCGAGG ATTTCCAGGA CCCGCACTTC CTCTTCGAGG TCCACCAGTA CCTCGACCGG GACAATTCGG GCTCGAACCC GCAGGATTAC GCCCCGGGCG CCGGCGCGAC CCGGCTCGCT GCCTTCACGG ACTGGGCCCG GCGGCGCGGC GCCAAGGCCT TCCTGGGCGA GTTCGGCTTC GCCCTGCCCG CGGGCGAGGC CGAGGCGCGG GCGCTCCTCT CCTTCGTGGC CGCCCACACG GATGTCTGGC AGGCCTACGC CTACTGGGCC GGCGGACCGT GGTGGGGCGA TTACGCGTTC AGCATCGAGC CCGGCAAAGA GGGCGACAAG CCGCAGATGG CCCTGCTGAG GCACTTCATG TGA
|
Protein sequence | MPSEATRRAR PLPAAITRGA FLAALAGGVA AAAGTGAGPA RAGGAVRYPG VNLSGGEFGD IGRPLGQGYI YPPNESFAYY AGRGMKLVRI PFKIERVQPE PLGALSVRDA DELARCVRAA KAAGLLVVLD AHNFGKRDGK PIEARDLTNL WSRLAARFRD EPSVAYGLMN EPVAFAPPAW RPVVDALVKA IRDGGSRQLL MVPGAGWSGA HSWVSDGNAA AFEDFQDPHF LFEVHQYLDR DNSGSNPQDY APGAGATRLA AFTDWARRRG AKAFLGEFGF ALPAGEAEAR ALLSFVAAHT DVWQAYAYWA GGPWWGDYAF SIEPGKEGDK PQMALLRHFM
|
| |