Gene M446_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0104 
Symbol 
ID6131566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp119917 
End bp122865 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content78% 
IMG OID641640444 
Producthypothetical protein 
Protein accessionYP_001767123 
Protein GI170738468 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.351566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCGG CGCTCCTCGC CGCGGGCGGG GGCGCCGCGC GGGCGGAGGC TCCGACTGCC 
CCGGTCGGGG TGATCCAGCT CGACGAGCGG CGCGCGCCCG TGATCCTCGG CAGGGACGGC
GCGCCGGCCG AGGATGCCGA GATCGTCGAC GAGAGCGCGC TGCGCTTCTA CGCCGCCCAG
CGCCAATCCG ACCGGGTCCA GGCCGAGATC GCCCGCCTGC GCCGCCGCTA CCCGAACTGG
AAGGTTCCGG CGGACCTCGA TTCCATCCGC CCGAGCCCGC CCGAGGAGGC CCCGATGTGG
GACCTGTTCA CGGCCGGCCG GTTCAACGAC CTGCGCGCCG CGATCGCGGC CCGCCAGGCG
AGCGATCCCG GCTGGCAGCC CTCGGACGAC CTCGCCCGCA AGCTCAACCG CGGCACGCTG
CGCAGCGAGG TCAAGGCCGC GGCCCGCGAG GGCGCCTGGC CGGAGGTGAT CGCCCGGGTC
CGGGCGGTGC CGGAGGCGAT GCGCGACCTC GACATCGACA TCGCCTGGCT CGTCGCGGAA
GCCTACGCCC GCGGCGGCGA TCCCGGCCAG GCCGCGAGCC TCTACCGGGG CATCCTGGCG
GGGCAGAAGG AATCCGCCCT GCGCCTCGCG ACGATCCAGA AGGCGATGGC GACCCTGCCG
ATCGGCCAGG TCGAGCCGCT CCTCGCCCTC GGTCAGCCGG CCGCGGACGG GAGCAACGAG
TTCGCGGCGC TGCGCCTCGA CGTGACCCGC GCCCGCATCG CCGCCATCCT GCACGACGAG
CCGGTGGGCC GGATCGAGCC GGCCGACCTC GCCGCCCTGA GCGACCACGT GCGCAAGCTC
GGCGAGCCGG ACCAGTTCAG CCTGCTCGGC TGGTACGCCT ACAAGCGGCG GCAATTCCGC
GAGGCGCTGG ACTGGTTCAA GGGCGCGATC GCCCGCGGGG GCAGGGCCAC CGTCGCGACC
GGGCTCGCCC TCTCGCTGCG CGAGGTCGGG CAGGAGCGCG ACGCCGAGGA GGTCGCCTTC
GCGTGGCGCG AGGGATCGGT CACCAACACG GCCCTCTACA TGGACCTGCT GGAGCGCCGT
CTCACCCAGT CCCCGGTGGC ACCCCTGGAG GCGCCGCGCC TGGACCGCTT CGCCAAGCTC
GTCCTCGCGA CCGGCTCGGG CGAGGGCGCC CAGGCGCTCG GCTGGTACGC CTACAATGCC
TGCCAGTTCG ACGCCGCCGC GGAGTGGTTC GAGCGGGCGA GCGCCTGGAA GCCCCGCGAG
GCCGCGATCC TCGGCTACGC CCTCTCGCTG ACCCGGCAGA AGCGCAACCG CGAGTTCCTG
GAAATCGTGA ACCGCTACGA CGGGCTGTTC CCGAAGGTGG TCGGGCTCCT GTTCACGGCG
CAGGACGAGG GCGCCGATCC GGGCCCCTGC GCGCCCCCCC GCTCGGGCCC CGCGCCCCAG
CGCGCGGCCC TGACCCGGCC GGCCGCGCCC CGCGTGCCGA AGCCGACCGC GGCCGCCCCC
GAGACGGTCC TGCCCGTCCG CCGCGGCGAG TTCCCCCAGG CGGTCGCGCC GGAGAACCCG
CTGCGCTTCA CCGCCCAGCC CGCGGGCGAG AAGGGCGGCG AGCGGGACGC CCTGCGCGAC
CTCCGCGCGG GGCCGCCGCC GCTGGTGGCC CGCCGGGTCC CCGGGGTCGG GCCGATGCCC
TACGAGCGCT ACGGCTACGC CCTGCTGCCG AGCTGGAACG GCTCGGACCG GGCGAGCCCG
CAGGACGGCC TGCCGCCCCC CGCGGTCGGA ACCATCGCGC AGGCCGAGGG AGCCGCCGCC
CCGGCCGGCC CCGCCCGCCC CACCCAGACC CGCGACGGGA GATCGCTTCG GCCTCTTCGA
CGCGCCGCCC GCCGCGATGC GGTGAGCGCA GCTTCCGGAT CGCCCATGAC CTCGCCCCGT
CCCTCCCTCG CCGCGGCGTC CCTCCGGATG GTCGCACTCG CCGCCGCGGC GGCGTCGCTC
GCCGCCTGCA CCGCGTCGGG CAGCGGCCGG CGCGTCCTGC GCGGCGCGGA GAGCGCCACG
TCCTTCAGCT CGCTCAACGG CTCGGCCCGG TCGCAGGCCC TCGTGCACCT GCCGGGCGCC
GAGGCCGGGC AGTCCATCCA GGTCGACGGC TACTCGCAGG GCCTGCGCCA GCGGATCAGC
TTCGGCCCGC GGGGCGGGGG TGGCGAGGGC TGGATCGACC TCGCGCTGCG CCGCCGCGGG
GCCGTGGACG GGCCCGCCAT GGCCAAGCCG ACGCGCTCGG GCATCGCGGC GGAACTGGCG
GCGCTGGCCG GAGGATCGGG CTACCGGATC TCGTCGCGCC CGGCCCGGAA CGCCTACGGG
CCGATGGGCT TCGCCCAGAA CGAGCGCTGC GTCTACGCGT GGCAGTGGAT CGAGGCGGTG
CCGAGCCTCG GCGCGGCCTT CCCGCCCGCC GCCTCGCCCG TCACCGCCTC GCTGCGCATC
CACCAGTGCC GCCGCGCGGG CACGCCGTCC GAGGCGCTGA TCGAGAATCT CGCGCGCCTG
CGGCTCGGGT CCGACGGTCC CGTGCCGGCG GACGCGCCCC CGCGCCGCCG GCCGGCATCC
CGTCGCGTCC TGGCGGCCGC GCCGGCCGCG GTCCCGCCCG CGGCGGCCGT CCCGCCGGCG
ATGGCGGCCG CGCCGGTCGC CCCGCGGCCC GCCGCCCCGG ACCGGGCCGC GACCCCGACC
TACCTCGCGC CGCCCGCCGG CCCGCCGGCG GCCGCCTCCC TGGCGCCGCC CGCGGTACCC
CGCCCGGGGC CGGCCGGCGC CACGCTCGCC CAGCCCCGCT TCATCACGGA TACCCTCCCG
ACCCCCGGGA TCGGCCGCAT GCCCGAGGCG GCCCCCGCGC CGCGCCCGCC CGCCCGCGCC
GAGCCGATCT CGGGCGAATT GCCGGCCCAG GCCTATCGCG GCCCGGCACC GCCGCCCTTC
GGCTGGTAA
 
Protein sequence
MLAALLAAGG GAARAEAPTA PVGVIQLDER RAPVILGRDG APAEDAEIVD ESALRFYAAQ 
RQSDRVQAEI ARLRRRYPNW KVPADLDSIR PSPPEEAPMW DLFTAGRFND LRAAIAARQA
SDPGWQPSDD LARKLNRGTL RSEVKAAARE GAWPEVIARV RAVPEAMRDL DIDIAWLVAE
AYARGGDPGQ AASLYRGILA GQKESALRLA TIQKAMATLP IGQVEPLLAL GQPAADGSNE
FAALRLDVTR ARIAAILHDE PVGRIEPADL AALSDHVRKL GEPDQFSLLG WYAYKRRQFR
EALDWFKGAI ARGGRATVAT GLALSLREVG QERDAEEVAF AWREGSVTNT ALYMDLLERR
LTQSPVAPLE APRLDRFAKL VLATGSGEGA QALGWYAYNA CQFDAAAEWF ERASAWKPRE
AAILGYALSL TRQKRNREFL EIVNRYDGLF PKVVGLLFTA QDEGADPGPC APPRSGPAPQ
RAALTRPAAP RVPKPTAAAP ETVLPVRRGE FPQAVAPENP LRFTAQPAGE KGGERDALRD
LRAGPPPLVA RRVPGVGPMP YERYGYALLP SWNGSDRASP QDGLPPPAVG TIAQAEGAAA
PAGPARPTQT RDGRSLRPLR RAARRDAVSA ASGSPMTSPR PSLAAASLRM VALAAAAASL
AACTASGSGR RVLRGAESAT SFSSLNGSAR SQALVHLPGA EAGQSIQVDG YSQGLRQRIS
FGPRGGGGEG WIDLALRRRG AVDGPAMAKP TRSGIAAELA ALAGGSGYRI SSRPARNAYG
PMGFAQNERC VYAWQWIEAV PSLGAAFPPA ASPVTASLRI HQCRRAGTPS EALIENLARL
RLGSDGPVPA DAPPRRRPAS RRVLAAAPAA VPPAAAVPPA MAAAPVAPRP AAPDRAATPT
YLAPPAGPPA AASLAPPAVP RPGPAGATLA QPRFITDTLP TPGIGRMPEA APAPRPPARA
EPISGELPAQ AYRGPAPPPF GW