Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3961 |
Symbol | |
ID | 5835618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4402392 |
End bp | 4403303 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641369752 |
Product | extracellular solute-binding protein |
Protein accession | YP_001641403 |
Protein GI | 163853360 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.323027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCTGA GTCATGCGCT TTTCCTCGCC GCCCTCGCGA TTTCGGCGGC CACGGCACCG GTCGGCGCGC AGGAGTTGAG CGGAACCCTC AAGAAGGTGA AGGACACGGG CGCCATCACC ATCGGCTATC GCGACGCCTC GGTGCCGTTC TCCTATCTCG ACGGCAATCA GAAGCCGGTG GGCTACGCCT TCGAGATCTG TCTCAAGGTC GCCGACGCGG TCAAAGCGCA TCTGAAGCTC GACACGCTGG AGGTGCGGCT CAACCCCGTC ACCTCCGCGA CCCGCATCCC GCTGATCGCC AACGGGACGA TCGACCTCGA ATGCGGCTCG ACCACCAACA ACGCCGACCG GCAGCGGCAG GCGGCCTTCA CCAACACCCA CTTCCTCACC GCGACACGCT TCGTCGCCAA GCGGGACAAG GGGCTCGACA AGACCGACGA CCTCAAGGGC CGCACCGTGG TCTCGACCTC GGGCACCACC AACATCCGCC AGATCAACGA GATCAACACC GCCCGGGGCC TCGGCATGCG GATCCTGCCG GCCAAGGACC ACGCCGAGGC CTTCCTGATG GTCGAGACCG GCCGCGCCGA CGCCTTCGTG ACGGACGACG TGCTGCTCGC CGCCCTCGTC GCCGGATCGA AGACGCCCGA CGCCTACGCG ATCTCCTCGG AGGCGCAGTC GCGCCCCGAG CCCTACGGCA TCATGCTGCG CAAGGACGAC CCGGCCTTCA AGGCCGTGGT CGATGCCGCC ACCGCCGCCC TCTACAAGAG CCCGGAGGGG ACGGCGCTCT ACGAGAAGTG GTTCACGCAA GCCATCCCGC CGCGGGGCAT CAACCTGAAG CTCCCGATGA GCGAGGCGAT GCGGAAGGCC TTGGCCAACC CCAGCGACAG CCCTGACCCG GCGGCCTACT GA
|
Protein sequence | MHLSHALFLA ALAISAATAP VGAQELSGTL KKVKDTGAIT IGYRDASVPF SYLDGNQKPV GYAFEICLKV ADAVKAHLKL DTLEVRLNPV TSATRIPLIA NGTIDLECGS TTNNADRQRQ AAFTNTHFLT ATRFVAKRDK GLDKTDDLKG RTVVSTSGTT NIRQINEINT ARGLGMRILP AKDHAEAFLM VETGRADAFV TDDVLLAALV AGSKTPDAYA ISSEAQSRPE PYGIMLRKDD PAFKAVVDAA TAALYKSPEG TALYEKWFTQ AIPPRGINLK LPMSEAMRKA LANPSDSPDP AAY
|
| |