Gene M446_3617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3617 
Symbol 
ID6129449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4034963 
End bp4036099 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content73% 
IMG OID641643784 
Producthypothetical protein 
Protein accessionYP_001770432 
Protein GI170741777 
COG category[S] Function unknown 
COG ID[COG3287] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0308687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0797948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCGC ATGCCTGCGG CGTGCAGACG GCGTGGACCG AGGCGGACGG GGCCGCGCAG 
GCGGCCGAGG CCATCGCCGC CGCCCTCGAC CGGGACGAGG TCGGCCACCT CCTGGTGTTC
TTCTCCGCGG AATACGACGC CGCCGCCCTG GCCGAGGCGC TGCGGACCCG GTTTCCGGGC
ATCGGGGTCG CGGGCTGCAC CGCCTCGGGA GAGATCTGCG CCGCGGGCGG CCTGGAGCGG
GGTCTCGTCG CCGTCGCCTT CCCGCGCGAG GGCTTCCGCG TGGTCTCGAC GGTCCTCACC
GGCATCGATT GCCTCGACGG GGAGGCGACG GTCGCGGCGA TGCGTGGGCT GCGCGCGCGG
CTCGGCCAGG CCGCCCCCGT CCCGCTGCAC CGCTTCGCCC TGTCGCTGAT CGACGGCCGC
ACCCACGCGG AGGAGAGGGT CATCTCCGCG GTGGCCTGGG GCCTCGACGC CATCCCCCTG
GTGGGCGGTT CGGCCGGCGA CGCCCTGACC TTCTCGCGCA CCGCGCTGAT CCACGACGGG
CAGGCGTACC GCAACGCCGC CGTGGTGGCG GTCGTGGAGA CGACCTACCC GGTCGAGATC
TTCAAGATCG ACAATTTCGA GCCGACGCCG GTGAAGTTCG TGGTGACCGA GACCGACGCG
GCCAACCGCA CGGTGCGGGA GCTCAACGCC GAGCCGGCGG CCGCCGAGTA CGCGCGCGCG
GTCGGCCTGC GCCCGGGCGA GCTCTCGCCG ATGACCTTCG CGACCCACCC GCTGGTGGTG
CGGGTGGGGG GCGACTATTT CTGCCGCGCG ATCCGCCGCC TCAACCCGGA CGGCTCGCTC
GGCCTGTTCT GCGCGATCGA CGAGGGGGTG GTGCTGACCC TGGCGCGCCA GCGCGACCTC
CTGGCCTCGA CCGAGGAGGC TCTGGTCGAC CTCGACGCGC GCCTGGGCGG CCTCGACCTC
GTGATCGGCT TCGAGTGCGT GCTGCGCCGG CTCGACGCCG AGATGCACCA GATCCGACAC
CCGATCTCGG AACTCTATCG GAAGTACGGT GTGGTCGGCT TCGAGACGTT CGGGGAGCAG
TACCGATCGA CCCACCTCAA CCAGACCTTC ACGGGGATCG CGATCGGCCG GATGTGA
 
Protein sequence
MRAHACGVQT AWTEADGAAQ AAEAIAAALD RDEVGHLLVF FSAEYDAAAL AEALRTRFPG 
IGVAGCTASG EICAAGGLER GLVAVAFPRE GFRVVSTVLT GIDCLDGEAT VAAMRGLRAR
LGQAAPVPLH RFALSLIDGR THAEERVISA VAWGLDAIPL VGGSAGDALT FSRTALIHDG
QAYRNAAVVA VVETTYPVEI FKIDNFEPTP VKFVVTETDA ANRTVRELNA EPAAAEYARA
VGLRPGELSP MTFATHPLVV RVGGDYFCRA IRRLNPDGSL GLFCAIDEGV VLTLARQRDL
LASTEEALVD LDARLGGLDL VIGFECVLRR LDAEMHQIRH PISELYRKYG VVGFETFGEQ
YRSTHLNQTF TGIAIGRM