Gene M446_2210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2210 
Symbol 
ID6132952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2462285 
End bp2463877 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content68% 
IMG OID641642437 
Productextracellular solute-binding protein 
Protein accessionYP_001769105 
Protein GI170740450 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.046819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCGG GTCTGACGAG GCGGATGGCG GTGGCGGCCG CGGCGGCGCT GGCACTGACG 
GGCGGCCCGG CCGCCGCGCA GGGCGGGGCC GCCCAGGGCG GGAAGGTGCT CAAGGTCGTG
ATGCATTCGG GCCTGCGCAT CACCGACCCG ATCATCACCA CCGCCTACAT CGCCCGCAAT
CACGGCTACA TGATCTACGA CACGCTCTTC GCCACCGACG AGCGGTTCGA GATCAAGCCC
CAGATGGTCA AGGACTACGC CGTCAGCGAC GACAAGCTGA CCTACACGTT CACCCTGCGC
GACGGGCTCA AGTTCCATGA CGGGGCGCCC GTCACGGCCG AGGATTGCAT CGCGTCGATC
CAGCGCTGGG GCAAGCGCGA CGGCATGGGC CAGAAGCTGA TGGAGTACAC CGCCGCCCTG
AAGGCCGTGG ACGACAGGAC CTTCACGCTC ACCCTGAAGC AGCCCTACGG CCTCGTGCTC
GCCTCCCTGG GCAAGCCGTC GTCGAACGTG CCCTTCATCA TGCCCAGGCG CATCGCCGAG
ACCCCGGCCG ACAGGAACGT GCCGGAGGAG ATCGGCTCCG GCCCGTTCCG CTTCGTCAAG
GCCGAGTTCC AGCCCGGCCT CAAGGCCGTC TACGAGAAGA ACCCCGACTA CGTGCCGCGC
GCCGAGCCGC CGAGCTGGCT CGCGGGCGGC AAGGTCGTCA AGCTCGACCG CGTCGAGTGG
ATCAACATCC CAGACTACCA GACCGCCGTG AACGCCCTCA TCAACGGCGA AATCGACTAC
ATCGAGCAGC CCCCGCACGA CTTCCTGCCG ATCCTGAAGG ACGCCAAGGG CGTCGTGATC
AACAACTACA ATCCGCTCGG CTTCTCCGGC ATGGTGCGGA TGAACTGGCT CAACCCGCCC
TTCGACAATC CCAAGATCCG GCAGGCGGTG ATGCTCGCGC TCACCCAGCA GGATTACCTC
GACGCGCAGA TCGGCAATCC GGACTACATG CAGCTCTGCA TGGCGCTGTT CGTCTGCGGC
ACGCCGAACG CCACAGAGGC GGGCGCGCCC AAGACCGACC TCGCCCGCGC CAAGCAGCTC
CTCAAGGAGG GCGGCTATGA CGGGCGGCCG GTGGTGATCA TGCAGCCGAC CGACCTCGCC
ATCGTGGCGC CGCTCGGCCC CGTCACCGCC CAGGCCCTGC GCGCCATCGG CATGAAGGTC
GACCTGCAAT CGATGGACTG GCAGACCCTG GTCGGGCGCC GGGCCAAGCA GGATCCCGTC
GACCAGGGCG GCTGGAACAT CTTCCACACC ACCTGGGTCA ACGCCGACAT GCTGAACCCG
ATCGCCAATG TCGGGGTGAA CGGCAAGGGC AGGACCGGCG GCTGGTTCGG CTGGGCCGAG
GACAAGGAGA TCGAGGCGAT GCGGGACGCC TATGCCCGCG AGACCGACCC GGCCAAGCAG
AAGCAGATCG CCGCCGACGT GCAGAAGCGG GCCTTCGAGG TCGGCATGTA CTACCCGACC
GGGCAGTACA CGGCGCCGCT GGCGGTGCGG GCCAGCCTCA AGGGGATCCT GCAGGGGCCG
GCGCCGGTGT TCTGGAACAT CGAGAAGCCG TGA
 
Protein sequence
MRAGLTRRMA VAAAAALALT GGPAAAQGGA AQGGKVLKVV MHSGLRITDP IITTAYIARN 
HGYMIYDTLF ATDERFEIKP QMVKDYAVSD DKLTYTFTLR DGLKFHDGAP VTAEDCIASI
QRWGKRDGMG QKLMEYTAAL KAVDDRTFTL TLKQPYGLVL ASLGKPSSNV PFIMPRRIAE
TPADRNVPEE IGSGPFRFVK AEFQPGLKAV YEKNPDYVPR AEPPSWLAGG KVVKLDRVEW
INIPDYQTAV NALINGEIDY IEQPPHDFLP ILKDAKGVVI NNYNPLGFSG MVRMNWLNPP
FDNPKIRQAV MLALTQQDYL DAQIGNPDYM QLCMALFVCG TPNATEAGAP KTDLARAKQL
LKEGGYDGRP VVIMQPTDLA IVAPLGPVTA QALRAIGMKV DLQSMDWQTL VGRRAKQDPV
DQGGWNIFHT TWVNADMLNP IANVGVNGKG RTGGWFGWAE DKEIEAMRDA YARETDPAKQ
KQIAADVQKR AFEVGMYYPT GQYTAPLAVR ASLKGILQGP APVFWNIEKP