Gene MCA0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0149 
Symbol 
ID3102859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp155447 
End bp156865 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content66% 
IMG OID637169373 
Productchain length determinant protein 
Protein accessionYP_112687 
Protein GI53802563 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03017] chain length determinant protein EpsF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.22388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCA ACCGGCTCAT CCTGATCTTC CTCGCGCACT GGCGGGTCTT CGGTTGGACG 
CTGGCCGTGA CCGTACTGAC CACGCTGGTG GCCAGCGCCG CGATGTCGAA AACCTATACC
GCGTCGACCA CCGTCGTGAT CGATTACAAG GGAGGCGATC CGCTGACCGG CTCGGCGTTT
CCGTCCGATC TCATGGCCGG TTACCTCGCG ACCCAGGTGG ACATCATCGG CAGCCACGCC
GTGGCGGCAC GCGTCGCGGA TGCGCTGAAA CTCTCGGAGG TTCCCGCTTA CCGCGACCGC
TTCGAAAAAG TCGCCAGGAA GACGGGTTCT CGCGCGGTAT TCCGGGATTG GGCGGCGGAC
AAGCTGCTGG AAGACCTGGA TGTGTCGCCA TCGCGGGAAA GCGACGTCAT CACGATCTCT
TTCGCCGCCT CCGATCCCCT GTTCGCCGCG GATGTGGCGG ATGCCTTCGC CCAGACCAGC
ATCCGTGCCA ACGTCGAACT CAAGTTGGAG CCGTTGAAGC GCCAGGCGGC CTGGTTCGAC
GAGCAGGTCC TGGCGTTGCG GAGCGCACTG GAAAAAGCGC AGGCCGAACT TTCCAGCTAC
CAGATCGCGC ATGGCGTGCT TGCCGCCGGC GACAAGCTGG ACGTGGAAAC CGCTCACTTG
TCCGATCTGT CCGCCCAGCT CGCCGCGGCG AAAGGCCAGA TGTACGACGG TGAGGCGCGG
GTTCAGCAGG TGAGGGCGGC GGCCCGTGGG GGGGTGGACG CGCTTCCCGA TCTGTTGCAG
AACCCTGCCT TGCAGGCGTT GAAGGCCGAG CTGGCGCGCG CCGAGGCCCG GCTCGCCGAA
GTCGGTTCGC GCTATGACTG GAACCATCCG CAGCGGCGCG CCGTGGCGGC CGAGGTCGCC
AGTCTGCGGC AAAGGCTTGC CGCCGAGGTG GCGAACGCGA CGGGCGCCAT CGAGCGCGCC
GCCGAACTCG CCCGCCAACG CGTGGCGGAT CTCGAACGCG CGGTGGTCGA ACAGAGGAAA
CGCATACTCG CTCTGGGTGC CGAGCAGGAC AAGCTGAGCG TATTGAAACG AGAGGTGGAG
AATGCCCAGC GGGTCTATGA TGCCGCGTTG CAGCGGGCGA GTCAGTTGCA GTTGGAGAGC
CGGCTGGAGG GGACCAACAT CGTCGTCCTC TCGCCGGCCG TGCCCCCGCT CAAGCCCAGC
AAGCCGAAGG TGGCACTGAA TCTGGCGCTG TCAGTCATCC TGGGCAGCGG GCTGGGGCTG
GCGTTCGTCC TCCTGTTCGA GTTCAAGGAC AGGCGTATCC GTTCTGCGGA AGATGTCATC
GACGAACTGG GTCTGCCCCT CCTGGCCGAA ATGCCTCCGG AAAGGACGGC GTGGCCGCGC
TTCTTCCGGC TTGCCGCGCC GTCCGCACTC AAGGGATGA
 
Protein sequence
MSANRLILIF LAHWRVFGWT LAVTVLTTLV ASAAMSKTYT ASTTVVIDYK GGDPLTGSAF 
PSDLMAGYLA TQVDIIGSHA VAARVADALK LSEVPAYRDR FEKVARKTGS RAVFRDWAAD
KLLEDLDVSP SRESDVITIS FAASDPLFAA DVADAFAQTS IRANVELKLE PLKRQAAWFD
EQVLALRSAL EKAQAELSSY QIAHGVLAAG DKLDVETAHL SDLSAQLAAA KGQMYDGEAR
VQQVRAAARG GVDALPDLLQ NPALQALKAE LARAEARLAE VGSRYDWNHP QRRAVAAEVA
SLRQRLAAEV ANATGAIERA AELARQRVAD LERAVVEQRK RILALGAEQD KLSVLKREVE
NAQRVYDAAL QRASQLQLES RLEGTNIVVL SPAVPPLKPS KPKVALNLAL SVILGSGLGL
AFVLLFEFKD RRIRSAEDVI DELGLPLLAE MPPERTAWPR FFRLAAPSAL KG