Gene M446_0892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0892 
Symbol 
ID6129194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1004572 
End bp1006362 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content67% 
IMG OID641641202 
Productextracellular solute-binding protein 
Protein accessionYP_001767876 
Protein GI170739221 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.353929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAC GTGACCTGCG GACGCTGATC GGCGCGGTGA AGGACGGACG GCTCTCCCGG 
CGGCACTTCG TGCAGCGGAT GATCGCGCTC GGCCTCACCG CCCCCATGGC CGGCATGATG
CTGGCGCAGG AGGGCGTCGC CCAGACCGCC GAGGGGACAT CCGCCTACAA GCCCGCCAAG
GCCGGGGGCG GGGGGGCGCT CAAGCTGCTG TTCTGGCAGG CCGCGACCCT GATCAACCCG
CATTTCGCGG TCGGCACCAA GGACCAGGAG GGCTCGCGCA TCTTCTACGA GCCCCTCGCC
GCCTGGGACG CCGAGGGCAA CCTGTTCCCG GTGCTCGCCG CCGCGATCCC GAGCCGCCAG
AACGGCGGCG TCGCCGAGGA TGGCCGCTCG GTGACCTGGA AGCTCAAGCC CGGCGTGACG
TGGCACGACG GCAAGCCCTT CTCGGCCGAC GACGTCGTCT TCACCTGGCA ATACGCCGCC
AACCCCGCCA CCGCGGCGGT GACCTCCGGC AGCTACAAGG ACATCAAGGT CGAGAAGGTC
GACGACCTCA CCGTCAAGGT TCTGTTCGAC AAGCCGACGC CGTTCTGGGC CGACGCCTTC
GTGTCCTCGG CCGGGATGAT CATCCCCAAG CACCATTTCG AGGCCTATAT CGGCGACAAG
TCGCGCGACG CGCCCGCCAA CCTCGCGCCG GTCGGCACCG GCCCCTACAA GTTCGCGGAG
TTCCGCCCCG GCGACATCGT GCGCGGCGTG CGCAACCCCG ACTACCATAT GCCGAACCGG
CCCTCCTTCG ACACGATCGA GATGAAGGGC GGCGGCGACG CGGTCTCGGC GGCCCGCGCC
GTGCTGCAGA CCGGCGAGTA CGACTACGCC TGGAACATGC AGGTCGAGGA CGAGATCCTC
AAGCGCCTCG AGGCCGAGGG CAGGGGCCGC GTCGAGATCG TCTACGGCGG CAACCTCGAA
TTCATCCTGC TCAACGCCAC CGATCCCTGG ACCGAGGTCG ACGGCGAGCG CGCCTCGCTC
AAGACCAAGC ACCCGGCCTT CTCGGACCCG GCGGTGCGCA AGGCCATGAA CCTGATCGTC
AACCGGGCGG CCGTGCAGCA ATTCATCTAC GGTCGCACCG GCCGGGCGAC CGCCAACGTG
CTCAACGGCC CCGAGCGCTT CCGCTCGAAG AACACCAGCT TCGCCTTCGA CACCGACAAG
GCCGCGCAGA TCCTGGAGGA GGCCGGCTGG AAGAAGGGCG GCGACGGCAT CCGCGCCAAG
GACGGAAAGA AACTCAAATT CGTCTACCAG ACCTCCATCA ACGCCCCCCG CCAGAAGACC
CAGGCCATCG TCAAGCAGGC TGCCCAAAAG GCCGGCATCG ACATGGAACT GAAATCGATT
CCCGGCTCCG TGTTCTTCTC CTCGGACGTC GCCAACCCGG ACACCTACCC GCACTTCTAC
GCCGACATGG AGATGTACAC CTGGAACATG GCGCAGGCCG ATCCGGGCGT GTTCATGCTG
CAATACGTGT CCTGGGAGGC GGCCACCAAG GAGAACAAGT GGCAGGGCCG CAACATCTGC
CGGATGCGCA ACGACGAGGC CGATGCCTGC TACCGCGCGG CGCAGGGCGA ACTCGACGCG
GTCAAGCGCG CGGCCCTCTT CATCAAGATG AACGACATCG TGGCCTCCGA ATACGTCATG
CCGCTCCTCC ACCGGGCCCA GGTCTCGGCG GTCGGGGCCA AGCTCCAGGC GCCGTCGAGC
GGCTGGGACA ATTCGCTCGC CTTCCTGTTC GACTGGTACA AGGAGGCGTG A
 
Protein sequence
MNERDLRTLI GAVKDGRLSR RHFVQRMIAL GLTAPMAGMM LAQEGVAQTA EGTSAYKPAK 
AGGGGALKLL FWQAATLINP HFAVGTKDQE GSRIFYEPLA AWDAEGNLFP VLAAAIPSRQ
NGGVAEDGRS VTWKLKPGVT WHDGKPFSAD DVVFTWQYAA NPATAAVTSG SYKDIKVEKV
DDLTVKVLFD KPTPFWADAF VSSAGMIIPK HHFEAYIGDK SRDAPANLAP VGTGPYKFAE
FRPGDIVRGV RNPDYHMPNR PSFDTIEMKG GGDAVSAARA VLQTGEYDYA WNMQVEDEIL
KRLEAEGRGR VEIVYGGNLE FILLNATDPW TEVDGERASL KTKHPAFSDP AVRKAMNLIV
NRAAVQQFIY GRTGRATANV LNGPERFRSK NTSFAFDTDK AAQILEEAGW KKGGDGIRAK
DGKKLKFVYQ TSINAPRQKT QAIVKQAAQK AGIDMELKSI PGSVFFSSDV ANPDTYPHFY
ADMEMYTWNM AQADPGVFML QYVSWEAATK ENKWQGRNIC RMRNDEADAC YRAAQGELDA
VKRAALFIKM NDIVASEYVM PLLHRAQVSA VGAKLQAPSS GWDNSLAFLF DWYKEA