Gene M446_3927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3927 
Symbol 
ID6134899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4375378 
End bp4376979 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content71% 
IMG OID641644085 
Productextracellular solute-binding protein 
Protein accessionYP_001770727 
Protein GI170742072 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0370442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCA GATCGCTGCT GAAGGCAATG GCCGGCGCGG GCGCGCTCGC CGCCGGACCC 
TCCTTCCCGG CCCCGGCCCT CGCCCAGGGC GCGGCCAAGA CCCTGCGCTT CGTCCCGCAG
GCGAACCTCG CCAATTTCGA CCCGATCTGG GGGACCCAGT ACGTGGTGCG CAACGCCGCG
GCCCTGGTCT GGGACACGCT CTACGGGGTC GACGCGCAGC TGCGGCCCCA GCGCCAGATG
GTCGAATCCG AGACCGTCTC GTCGGACGGG CTGACCTGGA CCTTCACCCT GCGGCCGGGC
TTGAAGTTCC ACGACGGCGA GCCGGTGCGG GCGCGGGACG CGGTGGCGAG CCTCGTGCGC
TGGTCGGCCC GCGACCCGAT GGGGCTGATG ATCCGGGCGA TCCAGGCGGA GCTCTCGGCG
GTCGACGACC GCAGCTTCCG CTGGGTGCTG ACCAAGCCCT ACCCGAAGAT GCTCCTGGCG
CTCGCCAAGA ACAACGCGCC CTGCTCCTTC GTGATGCCCG AGCGCATCGC CCAGACCGAC
CCGTTCAAGC AGATCACCGA GTATGTCGGC TCCGGGCCGA TGCGCTTCGC CCGCGACGAG
TGGGTGCCGG GGGCGCGGGC GGTGTTCACG CGCTTCGCCG ATTACGTCCC GCGCCAGGAG
CCGGCCTCCT GGCTCGCGGG CGGCAAGCAG ATCGCCTTCG ACCGGGTCGA GTGGATCATC
ATGCCGGACC CGGCCAGCGC CTCGGCGGCC CTGCAGAACG GCGAGGTCGA TTGGTGGGAG
AACCCGATCG CCGACCTCGT CCCGCTGCTC AAGAAGAACC GCAACATCCA GGTCGACATC
GCCGACCCGC TCGGCAACGT CGGCTCGTTC CGGATGAACA CGCTGCACCC GCCCTTCAAC
AACCAGCTGG TGCGCCGCGC GGTCCTGATG GCGATGAACC AGGAGGACTA CATGCGGGCG
ATCGTCGGCG ACGACGACGC GCTGTGGAAG CCGCTGCCCG GCTACTTCAC GCCCGGGACG
CCGCTCTACA ACGAGGAGGG CGGCGAGGTG GTCAAGCCCG GCGGCGACCT CGCGGCGGCC
AGGAAGCTCC TGGCCGAGAG CGGCTACAAG GGCGAGCCGG TGACCTGCGT GGTGGCGCAG
GACCAGCCGA TCACCAAGGC GCAGGGCGAC GTCACCGCCG ACCTGCTCAA GAAGCTCGGC
ATGAACGTCG ACTTCGTGGC GACCGACTGG GGCACCGTCG GCGCCCGCCG CGCCTCCAAG
GCGCCGCCCA AGGACGGCGG CTGGAGCATG TTCCACACCT GGCATGCCGG GGCGGATTGC
CTGAGCCCGG TCGGCTACAC GGCGATCCGG GCCAACGGCG ACAAGGCGTG GTTCGGCTGG
CCCGACAGCC CGCCGGTGGA GGCCGCGATC ACCGGCTGGT TCGAGGCGGC GACGCCGGAG
GACGAGAAGG CCGCCATGCG CCGCCTCAAC AAGGCCGCCC TCGACTACGT GGTCTACGTG
CCGACCGGCT TCTTCCTCAC CTACCAGGCG TGGCGGACAT CGCTGAGCGG CGTCACCAAG
GGCCCCCTGC CCTTCTTCTG GGGCGTGTCG AAATCGGCGT GA
 
Protein sequence
MDRRSLLKAM AGAGALAAGP SFPAPALAQG AAKTLRFVPQ ANLANFDPIW GTQYVVRNAA 
ALVWDTLYGV DAQLRPQRQM VESETVSSDG LTWTFTLRPG LKFHDGEPVR ARDAVASLVR
WSARDPMGLM IRAIQAELSA VDDRSFRWVL TKPYPKMLLA LAKNNAPCSF VMPERIAQTD
PFKQITEYVG SGPMRFARDE WVPGARAVFT RFADYVPRQE PASWLAGGKQ IAFDRVEWII
MPDPASASAA LQNGEVDWWE NPIADLVPLL KKNRNIQVDI ADPLGNVGSF RMNTLHPPFN
NQLVRRAVLM AMNQEDYMRA IVGDDDALWK PLPGYFTPGT PLYNEEGGEV VKPGGDLAAA
RKLLAESGYK GEPVTCVVAQ DQPITKAQGD VTADLLKKLG MNVDFVATDW GTVGARRASK
APPKDGGWSM FHTWHAGADC LSPVGYTAIR ANGDKAWFGW PDSPPVEAAI TGWFEAATPE
DEKAAMRRLN KAALDYVVYV PTGFFLTYQA WRTSLSGVTK GPLPFFWGVS KSA