Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1431 |
Symbol | |
ID | 6132117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 1574168 |
End bp | 1575166 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641641711 |
Product | extracellular solute-binding protein |
Protein accession | YP_001768381 |
Protein GI | 170739726 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.633528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGCC GCAGCTTCCT GGCCGGCGCC ACCGCGCTCG GCCTCGCCGC GCCCGCCCGC GCCGAGACCG TCAAGGTCCG CATCGGCGTC GGCGGCAAGC CGCTCCTGTA CTACCTGCCG CTCACCATCG CGGAGAAGAA GGGCTACTTC GTCGAGGAGG GGGTGGAGGC CGAGATCAAC GATTTCGGCG GCGGCGCCCG CTCGCTCCAG GCGCTGATCG GCGGCTCGGT CGACGTGGTG ACGGGGGCCT ACGAGCACAC GATCCGCATG CAGGCCAAGG GACAGGACGT GCGGGCGGTG TGCGAACTCG GCCGCTACCC GGCGATCGTG ATCGCGGTGC GCAAGGACCT CGCCGGGACG GTGCGGGGCC CGGGCGATCT CAAGGGCCGC AAGATCGGCG TGACGGCGCC CGGCTCCTCG ACGGCGCTCG CGGTGCAGTA CGCGATGATC AAGGCGGGGC TGAAGGCCAC GGACGCGCCG CTCATCGGCG TCGGCGGCGG GGCGGGCGCC ATCGCGGCGA TGAAGAAGGG CGAGATCGAC GCGATCTCCC ACCTCGACCC GGTCATCGCC AAGCTGGAGG CGGACGGCGA CATCGCCGTG ATGATCGACA CCCGCACGGA GGCCGGGACC CGGGCGCTGT TCGGCGGGCC GAACCCGGCG GCGGTGGTCT ACACCAAGCA GGAGTGGATC GAGCGCCACG CCGCCGCGAC CCAGAAGGTG GTCAACGCCT TCGCGAAATC GCTGAAGTGG CTCGCCGCCG CCACTCCCGA GGAGGTCGCC GACACGGTGC CGCCCGCCTA CCATTTCGGC GACCGGCCGC TCTACGTGCA GGCGGTGAAG AACTCGCTCG AGAGCTATTC CCGCACCGGC ATCCCCTCGC AGGAAGGCAT GGCGAGCGTG CTCGACCTCG TGCGCACCCT CGATCCGGAG CTGCAGGGCG CCAAGATCGA CCTCGCGGCG ACGCTGGAGG ACCGCTTCAT CCGCAAGGCG ATGGGCTGA
|
Protein sequence | MDRRSFLAGA TALGLAAPAR AETVKVRIGV GGKPLLYYLP LTIAEKKGYF VEEGVEAEIN DFGGGARSLQ ALIGGSVDVV TGAYEHTIRM QAKGQDVRAV CELGRYPAIV IAVRKDLAGT VRGPGDLKGR KIGVTAPGSS TALAVQYAMI KAGLKATDAP LIGVGGGAGA IAAMKKGEID AISHLDPVIA KLEADGDIAV MIDTRTEAGT RALFGGPNPA AVVYTKQEWI ERHAAATQKV VNAFAKSLKW LAAATPEEVA DTVPPAYHFG DRPLYVQAVK NSLESYSRTG IPSQEGMASV LDLVRTLDPE LQGAKIDLAA TLEDRFIRKA MG
|
| |