Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0892 |
Symbol | |
ID | 6129194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 1004572 |
End bp | 1006362 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641641202 |
Product | extracellular solute-binding protein |
Protein accession | YP_001767876 |
Protein GI | 170739221 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.353929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAC GTGACCTGCG GACGCTGATC GGCGCGGTGA AGGACGGACG GCTCTCCCGG CGGCACTTCG TGCAGCGGAT GATCGCGCTC GGCCTCACCG CCCCCATGGC CGGCATGATG CTGGCGCAGG AGGGCGTCGC CCAGACCGCC GAGGGGACAT CCGCCTACAA GCCCGCCAAG GCCGGGGGCG GGGGGGCGCT CAAGCTGCTG TTCTGGCAGG CCGCGACCCT GATCAACCCG CATTTCGCGG TCGGCACCAA GGACCAGGAG GGCTCGCGCA TCTTCTACGA GCCCCTCGCC GCCTGGGACG CCGAGGGCAA CCTGTTCCCG GTGCTCGCCG CCGCGATCCC GAGCCGCCAG AACGGCGGCG TCGCCGAGGA TGGCCGCTCG GTGACCTGGA AGCTCAAGCC CGGCGTGACG TGGCACGACG GCAAGCCCTT CTCGGCCGAC GACGTCGTCT TCACCTGGCA ATACGCCGCC AACCCCGCCA CCGCGGCGGT GACCTCCGGC AGCTACAAGG ACATCAAGGT CGAGAAGGTC GACGACCTCA CCGTCAAGGT TCTGTTCGAC AAGCCGACGC CGTTCTGGGC CGACGCCTTC GTGTCCTCGG CCGGGATGAT CATCCCCAAG CACCATTTCG AGGCCTATAT CGGCGACAAG TCGCGCGACG CGCCCGCCAA CCTCGCGCCG GTCGGCACCG GCCCCTACAA GTTCGCGGAG TTCCGCCCCG GCGACATCGT GCGCGGCGTG CGCAACCCCG ACTACCATAT GCCGAACCGG CCCTCCTTCG ACACGATCGA GATGAAGGGC GGCGGCGACG CGGTCTCGGC GGCCCGCGCC GTGCTGCAGA CCGGCGAGTA CGACTACGCC TGGAACATGC AGGTCGAGGA CGAGATCCTC AAGCGCCTCG AGGCCGAGGG CAGGGGCCGC GTCGAGATCG TCTACGGCGG CAACCTCGAA TTCATCCTGC TCAACGCCAC CGATCCCTGG ACCGAGGTCG ACGGCGAGCG CGCCTCGCTC AAGACCAAGC ACCCGGCCTT CTCGGACCCG GCGGTGCGCA AGGCCATGAA CCTGATCGTC AACCGGGCGG CCGTGCAGCA ATTCATCTAC GGTCGCACCG GCCGGGCGAC CGCCAACGTG CTCAACGGCC CCGAGCGCTT CCGCTCGAAG AACACCAGCT TCGCCTTCGA CACCGACAAG GCCGCGCAGA TCCTGGAGGA GGCCGGCTGG AAGAAGGGCG GCGACGGCAT CCGCGCCAAG GACGGAAAGA AACTCAAATT CGTCTACCAG ACCTCCATCA ACGCCCCCCG CCAGAAGACC CAGGCCATCG TCAAGCAGGC TGCCCAAAAG GCCGGCATCG ACATGGAACT GAAATCGATT CCCGGCTCCG TGTTCTTCTC CTCGGACGTC GCCAACCCGG ACACCTACCC GCACTTCTAC GCCGACATGG AGATGTACAC CTGGAACATG GCGCAGGCCG ATCCGGGCGT GTTCATGCTG CAATACGTGT CCTGGGAGGC GGCCACCAAG GAGAACAAGT GGCAGGGCCG CAACATCTGC CGGATGCGCA ACGACGAGGC CGATGCCTGC TACCGCGCGG CGCAGGGCGA ACTCGACGCG GTCAAGCGCG CGGCCCTCTT CATCAAGATG AACGACATCG TGGCCTCCGA ATACGTCATG CCGCTCCTCC ACCGGGCCCA GGTCTCGGCG GTCGGGGCCA AGCTCCAGGC GCCGTCGAGC GGCTGGGACA ATTCGCTCGC CTTCCTGTTC GACTGGTACA AGGAGGCGTG A
|
Protein sequence | MNERDLRTLI GAVKDGRLSR RHFVQRMIAL GLTAPMAGMM LAQEGVAQTA EGTSAYKPAK AGGGGALKLL FWQAATLINP HFAVGTKDQE GSRIFYEPLA AWDAEGNLFP VLAAAIPSRQ NGGVAEDGRS VTWKLKPGVT WHDGKPFSAD DVVFTWQYAA NPATAAVTSG SYKDIKVEKV DDLTVKVLFD KPTPFWADAF VSSAGMIIPK HHFEAYIGDK SRDAPANLAP VGTGPYKFAE FRPGDIVRGV RNPDYHMPNR PSFDTIEMKG GGDAVSAARA VLQTGEYDYA WNMQVEDEIL KRLEAEGRGR VEIVYGGNLE FILLNATDPW TEVDGERASL KTKHPAFSDP AVRKAMNLIV NRAAVQQFIY GRTGRATANV LNGPERFRSK NTSFAFDTDK AAQILEEAGW KKGGDGIRAK DGKKLKFVYQ TSINAPRQKT QAIVKQAAQK AGIDMELKSI PGSVFFSSDV ANPDTYPHFY ADMEMYTWNM AQADPGVFML QYVSWEAATK ENKWQGRNIC RMRNDEADAC YRAAQGELDA VKRAALFIKM NDIVASEYVM PLLHRAQVSA VGAKLQAPSS GWDNSLAFLF DWYKEA
|
| |