Gene M446_4602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4602 
Symbol 
ID6134194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5060834 
End bp5061820 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content71% 
IMG OID641644741 
Productglutathione S-transferase domain-containing protein 
Protein accessionYP_001771376 
Protein GI170742721 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0435] Predicted glutathione S-transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.957897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.248638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCTGC TCGTCAACGG CGTCTGGCAG GATCGCTGGT ACGACACGAA GGAGACCGGC 
GGGCGCTTCG TGCGCAAGGA CTCCGCCTTC CGCAACTGGG TGACGGCGGA CGGCGCGCCC
GGCCCCACCG GCACCGGCGG CTTCCGGGCC GAGCCCGGGC GCTACCACCT CTACGTCTCC
CTCGCCTGCC CCTGGGCGCA CCGCACGCTG ATCGTGCGCG CCCTGAAGGG GCTGGAGCAG
GCGATCTCGG TCGCGGTCGT CGATCCGCTG ATGGGCGCCG AGGGCTGGGT CTTCGGCGAT
TCGCCCGGGG CCGGGCCCGA CACCGTGAAC GGGGCCGCGC GCCTGTCCGA GATCTACGTC
ATGGCCGATC CGCACTACAC GGGGCGGGTC ACCGTCCCGG TCCTGTGGGA CCGGGAGCGC
CGCACGATCG TGTCGAACGA ATCGGCCGAG ATCATCCGGA TGCTGAACCG CGCCTTCGAC
GGCGCGGGCG CGCGCGGCCC GGACCTCTGC CCGGACGCCC TGCGGGAGGC GATCGACGCC
CTCAACGCCC GCGTCTACGA CCGGGTCAAC AACGGGGTCT ACAAGGCGGG CTTCGCCACC
AGCCAGGCGG CCTACGCGGA GGCGGTGACG GCCCTGTTCG AGGAACTCGA CGCCCTGGAC
GCGCGGCTCG ACCGCGGTCG GTTCCTGTTC GGCCCGACGC TGACGGAGGC GGATATCCGG
CTCTTCACCA CGCTCGTCCG CTTCGACCCG GTCTATGTCG GGCACTTCAA GTGCAACCTG
CGGCGGATCG CGGATTATCC CAACCTCGCG CCCTACCTGC GCGACCTCTA CCAGCATCCG
GCGATCCGCC CGACCGTGGA CCTGACCCAC ATCAAGCGCC ACTATTACGA GAGCCACGCC
AGCATCAACC CGACCGGGAT CGTCCCGCTG GGCCCGATCC TCGATTACGA CGCGCCGCAC
GGGCGGGCTG AGCGCTTCGC GGCCTGA
 
Protein sequence
MGLLVNGVWQ DRWYDTKETG GRFVRKDSAF RNWVTADGAP GPTGTGGFRA EPGRYHLYVS 
LACPWAHRTL IVRALKGLEQ AISVAVVDPL MGAEGWVFGD SPGAGPDTVN GAARLSEIYV
MADPHYTGRV TVPVLWDRER RTIVSNESAE IIRMLNRAFD GAGARGPDLC PDALREAIDA
LNARVYDRVN NGVYKAGFAT SQAAYAEAVT ALFEELDALD ARLDRGRFLF GPTLTEADIR
LFTTLVRFDP VYVGHFKCNL RRIADYPNLA PYLRDLYQHP AIRPTVDLTH IKRHYYESHA
SINPTGIVPL GPILDYDAPH GRAERFAA