Gene M446_5142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5142 
Symbol 
ID6131708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5651448 
End bp5653043 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content76% 
IMG OID641645277 
Product4-phytase 
Protein accessionYP_001771902 
Protein GI170743247 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.303319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0209906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCAG CCTGCCCCGC CCTCGTCGGC CTCGTCCTCG CCGCCCTCGC GGCCGCGGCG 
GGCGTTGCCG GGCGGGCGCG GGCCGCGGAG GTGCCCGACG ACGTGCTGGT GGTCGGGCAG
TCGGCCGAGC CCGCCTCCCT CGATCCCGGC GTCACCACCG CGACGAACGA CGCGCGCATC
CTCGTCAACC TCTACGACGG GCTGGTGCGC ACCAAGCCCG GCAGCCTGGA GATCGAGCCC
GCGCTCGCCG AGAGCTGGAG CCTCTCGGAG GATGGCCGCC GCTACACGTT CCGGCTGCGC
GCGGGCGTGC GCTTCCACGA CGGCAGCCCG CTCGACGCGC GGGCCGTGAC CTTCACCTTC
GGGCGGCTCC TCGAACCCGC CCACCCGGCG GCGGCGACCG GCCCCTTCCC GCTCGCCTTC
CTGTTCCGCG CGGTGGAGCG GGTCGAGGCC CTCGATCCCC GGACGGTGCG CTTCACCCTG
CGCCAGCCCT TCGCGCCCTT CCTGGCCAAC CTCGCGACGC CGACCGGGCT CATCGTCCCG
CCCGGGGCCG TGATGGCGCG GGGGAAGGAT TTCGGGCGCA ACCCGGTCGG GACCGGGCCG
TTCCGCTTCG AGGCGTGGCA GAGCAGCCGC AAGGTGACGC TCGCCCGCAA CCCGGGTTAC
TGGGGCGGGC CGGCCGCCTC GCGGCTCGTG ATCTTCCGCC CGCTCGCCGA CCCGAACACC
CGCGCGACCG AGATGCTGGC GGGCGACGTC GACGTCGTGG CGGAGATGCC GCCCGACGCC
CTCGCGCTGT TCCGGCACCG GGCCGGGTTC TCGGTCGCGG AGGCGGTCGG GCCCCACCTC
TGGTACCTGA TCCTGAATAT GCGGGCCGGG CCGCTGCGGG ACCGCCGGGT GCGCGAGGCG
GTGAACTGGG CCATCGACCG GCGGGCGCTG GCCGAGCACG TGCTGCAGGG CACGGCCGTG
CCGGCGCGCG GGATCATCGC CCCGGCCTTC GCGGGCACCT ACGATCCCGA CCTCGCGGGC
TACGGCCACG ATCCCGCCCG CGCCCGCGCC CTGCTGCGCG AGGCCGGGGC GGAGGGGGCG
CGGCTCACGC TCCTCGTCGC CGAGGGCGGG TCGGGGATGC TCGACCCGGT GGCGATGGGC
ACGGCGATCC AAGCCGACCT CGCCCGCGTG GGCCTCGACG TCCGGCTCGT CACCTACGAG
TGGAACGCCT ACCTGGCCCG GGTCAATCGC GGCCTCGGCG AGGACGCCGA CATGGCCGAC
ATGGCCGAGA TGGCCTGGAT GACCAACGAT CCCGACCAAT TGCCCTCGCT CGCCCTCGCG
AGCGACGCGC TGCCGGGGAA GGGCGGCTTC AACGCGGGCG GCTACGCCAA CCCGGATCTC
GACCGGCTCC TCGACGAGGC CCGCCGCAGC ACCGACCCGG CGCGCCGCCG GGATCTCGAC
CGGGCCGCGG AGCGCCTCGT CGTGGCGGAC GCGCCCTTCG CGGTCGTGGT CCACGGCAAG
CAGGCGGCGG TGGTGCGGGA GGCCGTGCGC GGCTTCGCCC TCGACCCGAC CTTCACGGCC
CGCCTCGCCG GCGTGCGCAA GCGCGAGGGG CCGTGA
 
Protein sequence
MSPACPALVG LVLAALAAAA GVAGRARAAE VPDDVLVVGQ SAEPASLDPG VTTATNDARI 
LVNLYDGLVR TKPGSLEIEP ALAESWSLSE DGRRYTFRLR AGVRFHDGSP LDARAVTFTF
GRLLEPAHPA AATGPFPLAF LFRAVERVEA LDPRTVRFTL RQPFAPFLAN LATPTGLIVP
PGAVMARGKD FGRNPVGTGP FRFEAWQSSR KVTLARNPGY WGGPAASRLV IFRPLADPNT
RATEMLAGDV DVVAEMPPDA LALFRHRAGF SVAEAVGPHL WYLILNMRAG PLRDRRVREA
VNWAIDRRAL AEHVLQGTAV PARGIIAPAF AGTYDPDLAG YGHDPARARA LLREAGAEGA
RLTLLVAEGG SGMLDPVAMG TAIQADLARV GLDVRLVTYE WNAYLARVNR GLGEDADMAD
MAEMAWMTND PDQLPSLALA SDALPGKGGF NAGGYANPDL DRLLDEARRS TDPARRRDLD
RAAERLVVAD APFAVVVHGK QAAVVREAVR GFALDPTFTA RLAGVRKREG P