Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0149 |
Symbol | |
ID | 3102859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 155447 |
End bp | 156865 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637169373 |
Product | chain length determinant protein |
Protein accession | YP_112687 |
Protein GI | 53802563 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03017] chain length determinant protein EpsF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.22388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCA ACCGGCTCAT CCTGATCTTC CTCGCGCACT GGCGGGTCTT CGGTTGGACG CTGGCCGTGA CCGTACTGAC CACGCTGGTG GCCAGCGCCG CGATGTCGAA AACCTATACC GCGTCGACCA CCGTCGTGAT CGATTACAAG GGAGGCGATC CGCTGACCGG CTCGGCGTTT CCGTCCGATC TCATGGCCGG TTACCTCGCG ACCCAGGTGG ACATCATCGG CAGCCACGCC GTGGCGGCAC GCGTCGCGGA TGCGCTGAAA CTCTCGGAGG TTCCCGCTTA CCGCGACCGC TTCGAAAAAG TCGCCAGGAA GACGGGTTCT CGCGCGGTAT TCCGGGATTG GGCGGCGGAC AAGCTGCTGG AAGACCTGGA TGTGTCGCCA TCGCGGGAAA GCGACGTCAT CACGATCTCT TTCGCCGCCT CCGATCCCCT GTTCGCCGCG GATGTGGCGG ATGCCTTCGC CCAGACCAGC ATCCGTGCCA ACGTCGAACT CAAGTTGGAG CCGTTGAAGC GCCAGGCGGC CTGGTTCGAC GAGCAGGTCC TGGCGTTGCG GAGCGCACTG GAAAAAGCGC AGGCCGAACT TTCCAGCTAC CAGATCGCGC ATGGCGTGCT TGCCGCCGGC GACAAGCTGG ACGTGGAAAC CGCTCACTTG TCCGATCTGT CCGCCCAGCT CGCCGCGGCG AAAGGCCAGA TGTACGACGG TGAGGCGCGG GTTCAGCAGG TGAGGGCGGC GGCCCGTGGG GGGGTGGACG CGCTTCCCGA TCTGTTGCAG AACCCTGCCT TGCAGGCGTT GAAGGCCGAG CTGGCGCGCG CCGAGGCCCG GCTCGCCGAA GTCGGTTCGC GCTATGACTG GAACCATCCG CAGCGGCGCG CCGTGGCGGC CGAGGTCGCC AGTCTGCGGC AAAGGCTTGC CGCCGAGGTG GCGAACGCGA CGGGCGCCAT CGAGCGCGCC GCCGAACTCG CCCGCCAACG CGTGGCGGAT CTCGAACGCG CGGTGGTCGA ACAGAGGAAA CGCATACTCG CTCTGGGTGC CGAGCAGGAC AAGCTGAGCG TATTGAAACG AGAGGTGGAG AATGCCCAGC GGGTCTATGA TGCCGCGTTG CAGCGGGCGA GTCAGTTGCA GTTGGAGAGC CGGCTGGAGG GGACCAACAT CGTCGTCCTC TCGCCGGCCG TGCCCCCGCT CAAGCCCAGC AAGCCGAAGG TGGCACTGAA TCTGGCGCTG TCAGTCATCC TGGGCAGCGG GCTGGGGCTG GCGTTCGTCC TCCTGTTCGA GTTCAAGGAC AGGCGTATCC GTTCTGCGGA AGATGTCATC GACGAACTGG GTCTGCCCCT CCTGGCCGAA ATGCCTCCGG AAAGGACGGC GTGGCCGCGC TTCTTCCGGC TTGCCGCGCC GTCCGCACTC AAGGGATGA
|
Protein sequence | MSANRLILIF LAHWRVFGWT LAVTVLTTLV ASAAMSKTYT ASTTVVIDYK GGDPLTGSAF PSDLMAGYLA TQVDIIGSHA VAARVADALK LSEVPAYRDR FEKVARKTGS RAVFRDWAAD KLLEDLDVSP SRESDVITIS FAASDPLFAA DVADAFAQTS IRANVELKLE PLKRQAAWFD EQVLALRSAL EKAQAELSSY QIAHGVLAAG DKLDVETAHL SDLSAQLAAA KGQMYDGEAR VQQVRAAARG GVDALPDLLQ NPALQALKAE LARAEARLAE VGSRYDWNHP QRRAVAAEVA SLRQRLAAEV ANATGAIERA AELARQRVAD LERAVVEQRK RILALGAEQD KLSVLKREVE NAQRVYDAAL QRASQLQLES RLEGTNIVVL SPAVPPLKPS KPKVALNLAL SVILGSGLGL AFVLLFEFKD RRIRSAEDVI DELGLPLLAE MPPERTAWPR FFRLAAPSAL KG
|
| |