Gene M446_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2421 
Symbol 
ID6132794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2683842 
End bp2684915 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content78% 
IMG OID641642643 
ProductGp37Gp68 family protein 
Protein accessionYP_001769311 
Protein GI170740656 
COG category[S] Function unknown 
COG ID[COG4422] Bacteriophage protein gp37 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.139852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGGT CCGAGCACGG CGATGCGCCG CCGGACGCGA CCTGGAACGT CATCACCGGC 
TGCTCGGCGG TCTCCGCCGG CTGCCTCCGC TGCGACGCGA TGCGGCTGTC CGGCACCCGG
CTCCGGCACC ACCCCTCGCG GGCGGGGCTG ACGCGGGACG GCGAGGCGGG TCCCGTCTGG
ACGGGGGAGG TGCGGTTCAA CGAGGCGTGG CTGGACCGGC CGCTGCGCTG GCGCCGGCCG
CGGATGATCC GCGTCTGCGC GCACGGCGAC CTGTTCGCGG AGGCCGTCCC GGAGAGGTGG
ATCGACCGCG TCTTCGCCGT CATGGCGCTG GCGCCGCACC ACACCTTCCA GGTGCTGACG
AAGCGGTCGG CGCGCATGCG GGCCTACCTC GGCGATCGCG GCCGCGGCGG CCGGATCATG
GACGAGCTCG GGCACGCCGC CCGCGCGCAT GCCGGCGCCC GGGCGCTGGC GGACGGGCTG
AGGGACCTCC TCGTGGTGCA GCGCAGGCCG CTGCCGAACG TCTGGCTCGG CGTCTCGGCC
GAGGACCAGG GCCGCGCCAA CGAGCGGATC CTCGACCTGC TCGCGACGCC CGCCGCCCTG
CGCTGGATCT CGGCGGAGCC GCTGCTCGGC CCCGTCGACC TGCGCGCGCT CCGAATCGCC
GCCCATCTCG ACCTCGACGC GCTGACGGGC GCCGTCTCGG CGAGCGCGAG GCTCGATGGA
GGGGCCGACT TCCGCACCGG CCTCGCCGCG CTCCCGCTGC TGCCCGAGGG GCGGCAGGGC
CTCGACTGGG TGGTCGTCGG CGGCGAGACG GGACCGGGGA GCCGGCCGAT GCATCCGGAT
TGGGCCCGCG CGCTGCGGGA CCAGTGCGCG GCGGCCGGAG TCGCCTTCCG CTTCCATCAG
CACGGGGACG GGAGCGTGGG CGGCGGACCG CCCGAGCCCG ACGGAGGGCT CGCGGAGGGC
GGGCGGACCG GCGGGAGGGC CGCCCGTCGC CTCCTCGACG GCCGCAGCTG GGCCGAGGTG
CCGGACGGCC GCCGGCCCCC CGAGACGCGC CGCGGGGAAG CGGCGCGCGG TTGA
 
Protein sequence
MTRSEHGDAP PDATWNVITG CSAVSAGCLR CDAMRLSGTR LRHHPSRAGL TRDGEAGPVW 
TGEVRFNEAW LDRPLRWRRP RMIRVCAHGD LFAEAVPERW IDRVFAVMAL APHHTFQVLT
KRSARMRAYL GDRGRGGRIM DELGHAARAH AGARALADGL RDLLVVQRRP LPNVWLGVSA
EDQGRANERI LDLLATPAAL RWISAEPLLG PVDLRALRIA AHLDLDALTG AVSASARLDG
GADFRTGLAA LPLLPEGRQG LDWVVVGGET GPGSRPMHPD WARALRDQCA AAGVAFRFHQ
HGDGSVGGGP PEPDGGLAEG GRTGGRAARR LLDGRSWAEV PDGRRPPETR RGEAARG