Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1690 |
Symbol | |
ID | 6131944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 1888421 |
End bp | 1890019 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641641948 |
Product | extracellular solute-binding protein |
Protein accession | YP_001768617 |
Protein GI | 170739962 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.23662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.232631 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCCA AGCCCCTGCT CGCCGCCCTC GCCGGTGCGG CCCTGCTCGC CGCCAGCCCG CTCCAGGCCA AGACCCTGGT CTACTGCTCG GAAGGCTCGC CGGAGAACTT CTACCCCGCG GTCAACACCA CGGGCACCAC CTTCGACGCC AACGCCCAGA TCTACAACAA CGTCGTCGAA TTCGAGCGCG GCGGGACCAA GGTCGTGCCC GGACTCGCCG AGAGCTGGGA CGTCTCGCCG GACGGCACCG TCTACACCTT CCACCTGCGC AAGGGCGTCA AGTGGCACAA CACCAACCGC GCCTTCAAGC CGAGCCGCGA CTTCAACGCG GACGACATCA TGTTCTCGAT CGAGCGGCAA TGGAAGGAGG ATCATCCCTT CTACAAGGTG ACCAGCTCGA ACCACTCCTA TTTCAACGAT ATGGGACTGG CGAAGCTGCT GAAATCGGTC GAGAAGGTCG ACGATCACAC GGTGCGCATC ACCCTCACCC GGCCCGAGGC GCCGTTCCTG TCCGACCTCG CCATGTCGTA CGCGGCGATC CAGTCGAAGG AATACGCCGA CGCCATGCTG AAGGCCGGCA CGCCTGAGAA GATCGATCAG TCGCCGATCG GCACCGGGCC GTTCTACCTC GTCCAGTACC AGAAGGACGC GATCATCCGC TACAAGGCCT TCCCCGAGTA CTGGGGCGGC AAGGCCAAGA TCGACGACCT CGTGTTCGCC ATCACGCCCG ACGCCTCGGT CCGCTGGGCC AAGCTGCAGA AGGGCGAGTG CCACGTGATG CCCTACCCGA ACCCGGCCGA CCTCGACGCC ATCCGCAAGG ACCCGGCCGT GCAGGTGCTG GAGCAGCCGG GGCTGAACAT CGGCTACCTG TCCTACAACG TCACCCGCAA GCCGTTCGAC GACGTGCGCG TGCGCAAGGC CTTCAACATG GCCATCAACA AGAAGGCCAT CATCGACGCG GTCTACCTCT CGACCGGGAT CGCGGCCGTG AACCCGATCC CGCCCTCGAT GTGGTCCTAC AACAAGGACG TCAAGGACGA CCCCTACGAT CCGGAGGCGG CGCGGAAGCT GCTCGCCGAG GCGGGCTTCC CGAACGGTCT CGAGACCGAC ATCTGGGCGA TGCCGGTGCA GCGGCCCTAC AACCCGAACG CCCGACGCAT CGCCGAACTC ATGCAGGCCG ACCTTGCCAA GGTCGGCGTG AAGGCGGAGA TCAAGTCCTA CGAGTGGGGC GAGTACCGCA AACGCATGCA GGCGGGCGAG CACCAGACCG GCATGCTCGG CTGGACCGGC GACAACGGCG ACCCGGACAA CTTCCTGCAC ACGCTGCTCG GCTGCGACGC CGCCAAGAAC AACGGCGGCA ACACGTCGAA GTGGTGCGAC AAGACCTTCG ACGACATCGT GGTCAGGGCC AAGACCCTGA CCGACCAGGC CGAGCGCACC AAGCTCTACG AGCAGGCGCA GGTCCGGTTC AAGGAGGAGG CGCCCTGGTT CACCATCGCG CACGCGGTGC AGCTGAAGCC GGTGCGCAAG GAGGTGATCG ACTTCAAGCT CTCGCCCTTC AGCCGCCACG TCTTCTACGG CGTGGACATC AAGGGCTGA
|
Protein sequence | MTAKPLLAAL AGAALLAASP LQAKTLVYCS EGSPENFYPA VNTTGTTFDA NAQIYNNVVE FERGGTKVVP GLAESWDVSP DGTVYTFHLR KGVKWHNTNR AFKPSRDFNA DDIMFSIERQ WKEDHPFYKV TSSNHSYFND MGLAKLLKSV EKVDDHTVRI TLTRPEAPFL SDLAMSYAAI QSKEYADAML KAGTPEKIDQ SPIGTGPFYL VQYQKDAIIR YKAFPEYWGG KAKIDDLVFA ITPDASVRWA KLQKGECHVM PYPNPADLDA IRKDPAVQVL EQPGLNIGYL SYNVTRKPFD DVRVRKAFNM AINKKAIIDA VYLSTGIAAV NPIPPSMWSY NKDVKDDPYD PEAARKLLAE AGFPNGLETD IWAMPVQRPY NPNARRIAEL MQADLAKVGV KAEIKSYEWG EYRKRMQAGE HQTGMLGWTG DNGDPDNFLH TLLGCDAAKN NGGNTSKWCD KTFDDIVVRA KTLTDQAERT KLYEQAQVRF KEEAPWFTIA HAVQLKPVRK EVIDFKLSPF SRHVFYGVDI KG
|
| |