Gene M446_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1077 
Symbol 
ID6131532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1200063 
End bp1201085 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content74% 
IMG OID641641368 
Productglycoside hydrolase family protein 
Protein accessionYP_001768040 
Protein GI170739385 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.257076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.422372 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCTG AGGCGACGAG GCGGGCGCGG CCGCTTCCCG CCGCGATCAC CCGCGGCGCC 
TTCCTGGCCG CCCTCGCCGG CGGCGTCGCC GCGGCGGCCG GGACGGGGGC GGGCCCGGCG
CGGGCCGGCG GGGCGGTCCG CTACCCCGGC GTCAACCTGT CGGGCGGGGA GTTCGGCGAC
ATCGGCCGCC CCCTCGGCCA GGGCTACATC TACCCGCCGA ACGAGAGTTT CGCCTACTAC
GCCGGGCGCG GCATGAAGCT CGTGCGGATC CCGTTCAAGA TCGAGCGGGT GCAGCCGGAG
CCCCTCGGCG CCCTCTCGGT CCGGGACGCG GACGAACTCG CGCGCTGCGT GCGCGCGGCC
AAGGCCGCCG GGCTCCTGGT GGTCCTCGAC GCGCACAATT TCGGCAAGCG CGACGGAAAG
CCGATCGAGG CGCGGGACCT CACCAATCTC TGGTCGCGGC TCGCCGCGCG GTTCCGGGAC
GAGCCGTCGG TGGCCTACGG CCTCATGAAC GAGCCGGTGG CCTTCGCGCC GCCCGCCTGG
CGCCCGGTCG TCGACGCCCT CGTCAAGGCC ATCCGCGACG GCGGCTCGCG GCAGCTCCTG
ATGGTCCCCG GCGCCGGCTG GAGCGGCGCC CATTCCTGGG TGTCGGACGG CAACGCCGCG
GCCTTCGAGG ATTTCCAGGA CCCGCACTTC CTCTTCGAGG TCCACCAGTA CCTCGACCGG
GACAATTCGG GCTCGAACCC GCAGGATTAC GCCCCGGGCG CCGGCGCGAC CCGGCTCGCT
GCCTTCACGG ACTGGGCCCG GCGGCGCGGC GCCAAGGCCT TCCTGGGCGA GTTCGGCTTC
GCCCTGCCCG CGGGCGAGGC CGAGGCGCGG GCGCTCCTCT CCTTCGTGGC CGCCCACACG
GATGTCTGGC AGGCCTACGC CTACTGGGCC GGCGGACCGT GGTGGGGCGA TTACGCGTTC
AGCATCGAGC CCGGCAAAGA GGGCGACAAG CCGCAGATGG CCCTGCTGAG GCACTTCATG
TGA
 
Protein sequence
MPSEATRRAR PLPAAITRGA FLAALAGGVA AAAGTGAGPA RAGGAVRYPG VNLSGGEFGD 
IGRPLGQGYI YPPNESFAYY AGRGMKLVRI PFKIERVQPE PLGALSVRDA DELARCVRAA
KAAGLLVVLD AHNFGKRDGK PIEARDLTNL WSRLAARFRD EPSVAYGLMN EPVAFAPPAW
RPVVDALVKA IRDGGSRQLL MVPGAGWSGA HSWVSDGNAA AFEDFQDPHF LFEVHQYLDR
DNSGSNPQDY APGAGATRLA AFTDWARRRG AKAFLGEFGF ALPAGEAEAR ALLSFVAAHT
DVWQAYAYWA GGPWWGDYAF SIEPGKEGDK PQMALLRHFM