Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1752 |
Symbol | |
ID | 3102519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1868033 |
End bp | 1873798 |
Gene Length | 5766 bp |
Protein Length | 1921 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637170913 |
Product | hypothetical protein |
Protein accession | YP_114191 |
Protein GI | 53803904 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATGC GCGCAGTGTT CTGGGTCGTG ACGTTGTGCT GGATTCAGGC GGCCGGTGCC GCGCGGCTGG AGCTGTTTTC GCCCGAGGGA ACGGCCAAGG ACGTGCGCCA GGTGACGGCG CGATTCTCCG AACCGATGGT TGCGTTCGGC GATCCCCGGC TCGCCGACCC GTTCACGGTG GATTGCCCGG TCCGGGGGCG CGGACGCTGG GCCGAACCCC GCACCTGGGT CTACGACTTC GAAGCCGACC TGCCCGCCGG CCTTTCCTGC CGATTCCGAC CGAAGGCGCA GTTGAAGGCA TTGAGCGGCG CGCCGGTCGA TCTCGCCAGG GAATACCGCT TCGACACCGG CGGTCCGGTG GTGGTGGGTT CCTATCCCAG CGAAGGCAGC GAGAACGTCG ATGAAAACCA GGTCTTCCTG CTGGCCGCCG CCGCTCCGGC CACGCCGGAA AGCGTCACCG CCCATGCGCG CTGCCGGATC GAGGGCATGG AAGAGGCGAT CGGCGTCGAC GTCCTGACCG GCGAGCGACG GGATGCAGTG TTGGCCCAGC GCCGGAAACT GGGCTGGGGC TACACCGACC TGCTCTGGCC GGACGCCGAT TTCGACTCGC TGGAACCGGA GGAAATCAAG GCCGGCGAAG CACGGCTGGT GCTGGTGCAA TGCCGTCGGG CCATCCCGCC GGAGACGAAG GTCAGCATCT TCTGGGGCAG GGACATCGCC GCGCCCAGCG GCGTCGCGAC GACCGAAGAC CAGGAACTGA GCTTCAAGTC CCGCCCGAGC TTCACCGCCC GCTTCGGTTG CGAGCGGATC AATGCCGAGG CGGACTGCAT TCCGCTCCTG CCCATGTCCC TGCGCTTCGG CGCGCCCGTG CCCGCCGACC AGGCCGCGGC CGTGCGCCTG GTCGATGCCG CCGGCAAGGT CTATCCGGCT GACGGGCTCG ACCCGGGCCG CAAACCTTTC GTCGAGGAAC TGAGCTGGAA AGGCCCCTTC CCGGAAAAGT CCATGCTCCG CATCGAACTG CCCCCAGGGT TCGTCGACGA TGCCGGACGT CCGCTGGAAA ACGCCAGCCG TTATCCGCTG GAAGTCAAGA CCGACGAATA TCCTCCTTTG GCCAAATTTC CCGGCGAGTT CGGCATTCTG GAGCTGAAGG AGGGCGGCGT CCTGCCCGTC ACCGTGCGCC ACATCGAGAA CCCCATGACC GGCCAACGAC TGGCCTCGGG CGGCCCGGCC ATCCCCGGCA AGGTGAAGCG GGTGGCGTTG AGCGACCAGG ACATCGCCGC CTGGATGCGC AAGGTGAAGA AGGCCGCCGA ACGCCGGGGC GAATCAGTCC AGTTGCCCGA CGGCGGCACC GAATGGAAGG AACTCACCGG TACCGAATCG GTGTTCGCCG ATACTCCGGC GGAAACTTTC GAGCTGCCGC GTGCCGAAGC CCAGAAAGAG TTCGAGGTGC TCGGCATTCC CTTGGAAAAC GCGGGCTTTT ACGTGGTCGA ACTGGCCAGC CCACGGCTCG GCGCCGCTTT ATTGGGGGAG GAAAGGCCCC GCTACGTGGC GACCTCGGCG CTGGTCACCA ATCTGTCGGT ACACTTCAAA TGGGGCCGGG AATCCTCGCT GGTCTGGGTG ACGACCCTGG ATTCGGCCCG GCCGGTGCCG AACGCCGATG TCCGCATCAG CCGCTACTGC AAGGACGAAA CGTTGTGGCA GGGCCGGACG GATGCCAACG GCGTGGCGCT CATCCAGGGC CCGGCCCTGC CCAATCCCTC GGATTCCGGC GAGAGCTGCG ACTGGGGCGA CGGCCCCCTC ATGGTGAGCG CCCGGACCGA GGACGACATG AGCTTCACCT TGAGCAGCTG GACCAATGGC ATTTCGCCGC AGAACTTCGG CCTGGCGGTG GGCTTCTACG GCAACCCGGA AATCGTCCAC TCGGTGCTGG ACCGAAGCCT GTTCCGAGCC GGAGAGACCG TGTCCATGAA GCACTTCCAG CGCCTGCGCA CGTCCAACGG CTTCGGCCTG CCGGATACCC GACCCGCGAA AATAAAGGTC CGCCACACCG GCAGCGGCCA GGAATACGAA CTGCCGGCCG AGTTCGACGC GCGCGGAATC GCAGAAAATA CCTGGGCGAT TCCGGCCGAA GCCAAGCTCG GCGGTTACGA AATCGTATTC GTCTATGCCG ACGGCAGCGA ATCCGCCACC TCGGCCGAAT TCCGCGTCGA GCAGTTCCGA GTGCCGACCA TGCGCGCCGA CATCCAGCCG CAGTCCGATG CGCTGATCAA CGCCAGGGAA GCCACACTCG ACCTTCACGT CAGCTATCTT TCCGGCGGCG GCGCGGCCAA TGCGCCGGTC AAAGTCCGCA CCCTGGTCGA GCCGCGCGCG GTGTCATTCC CCGGTTATCC GGATTTCGAT TTCGAGGTGC AGGCCATCGA GGAAGGACTG CGCGACAGCG GTTACAGCGA GGAAGCCGAG AAAACCCCCG GGGCGAGCGG TCCCGCCCAG GTGCTGCCGC TGTCGCTGGA CAAGGGCGGC GCGGCGCGGG TGACGGTTCC CAACCTGCCG CGCCAGCACA CGCCGCAGGA CCTGGTGGCC GAACTCGAAT ACCAGGATGC CAACGGCGAA CTGTTGACCG TGGCCCGCCG TATCCCACTG TGGCCGGCCG GCCTCAGCCT GGGAATCAAG ACCGATGGCT GGGTGGCAAC CAGAGACCAA GTGCGGTTCC AGGTACTGGC GCTGGATCTG ACCGGGAAAC CCGCCGTCGG CAAAGAGGTC CGGGTCGACC TGTACTCGCG CACCACTTAT TCTCACCGCA AGCGCCTGAT CGGCGGCTTT TACGCCTATG ACGACAAGAC CGAAGTCAAA CGCCTCGGCA GCGAGTGCAA AGGCAAGACC AACGATCAGG GCCTCCTCCT CTGCGTCCTC GCTCCCGGCG TATCCGGCGA AATCCTGCTC CAGGCCACGG CCGACGACGG CACCGGCAAC CAGGCGCTGG GCACGGCTTC GATCTGGGTC GCCAGTGAGG ACGAATGGTG GTTCGAGAAC GGCCCCAGCG ACCGCATGGA CGTGATTGCC GAGAAGCGCG CCTACGAGCT GGGCGACAAG GCCCGCTTGC AGGTGCGCAT GCCGTTCCGC GAAGCCACGG CTCTGGTGAC GATCGAACGG GAAGGCATCG TCGACAGCTT CATCACCCCG CTGTCCGGCC GCTCTCCGGT GGTCGAAGTG CCCATCAAGG AAGCCTACGC GCCCAACGTG TTCGTCTCCG TGCTGGCCGT GCGGGGTCGG GTGGGGCCGG TGCAGACCTG GATCGCGGAC ATGGCCCGAA AGTTCAGGCT GCCGTGGCAG CCAGACGGCG GCAAGGCCAC GACGATGATC GACCTGAGCA AGCCCGCCTA CCGCATGGGG CTCGCCCAGA TCGACGTGGG CTGGGCACCC AACCGGCTCG ATGTGAAAGT GCAGGCCGAC CGGGACACCT ACAAGGTCCG GGAAACCGCC AAGGTGAAGG TGACGGTGCA GCGGGCCACG GGCGGCGCGC CGGCCGACGC CGAGATCGCC CTGGCGGCCG TCGACGAGGG GCTGCTCGAG ATCAAGCCCA ACGAAAGCTG GAACCTGCTG GACAAGATGA TGGGCCGGCG CGGCATCGAG GTGCTCACCT CCACCGCGCA GATGCAGGTC GTCGGCAAGC GCCACTATGG CCGCAAGGCC GTGCCGCATG GCGGCGGCGG CGGCCGCCAG GCGGCGCGCG AGCTGTTCGA CACCCTGCTG CTGTGGAAAG GCCGGGTGGC GCTGGACACC AGGGGCGAAG CCGAGATCGA GGTGCCGCTC AACGACTCGC TCACCGCCTT CCGGCTGGTG GCCATCGCCA ACGCGGGCAG CGGCCTGTTC GGCACCGGCA AGACCTCGAT CCGCACCACC CAGGACCTGA TGCTCTATTC CGGCCTGCCG CCGATGGTCC GCGAAGGCGA CCGCTTCCAC GCCATCTTCA ACGTGCGCAA CGCCACCGAC CGCAAGATCG CGGGCCGGCT CAGCGCCCGA TTGACGCCGA AAGGGACGGC GGCGGGCCAG GACCTTCCGC CCCGGCCCTT CGAACTCGAG GCCGGCACGG CGGGCGATTT CTCCTGGGAA GTCGACGTCC CGGTCGACGT CGCCGGGCTG GATTGGGAAA TCGCCGCCGC GGAAGACGGC GGGGCCGCCA CCGACCGGCT GAAAACCTCG CAGCAGGTGC AGCCCGCCGT GCCCGTCCGG GTGTTCCAGG CGACGCTGAC CCGCATCGAC AAGCCGCTCT CGCTCGCGGT GCAAAGGCCC GAGGACGCCG TGCCGGGCCG CGGCGGCGTG CGCGTCTCGC TGCGCGCCCG ACTCGGCGAT GGCCTCGGCG GGGTGATCGA ATACATGACG CACTACCCCT ACTCCTGCCT GGAGCAGAAG GTTTCCAAGG CCGTCGCGCT GCGCGACAAG CGCCTATGGA ACGAGATCGT CGCCGACCTG CCCGCGTACC AGGATGCCGA CGGCCTGTTC CGCTATTTCC CGGGTGACGA CACCCACGGC AGCGACGTGC TGACCGCCTA TGTCCTGGCG ATCGCGCAGG AAGCCGGCTG GGAACTGCCC GAGAATGCCC TGGGCCGGGC CAAGGAGGCG TTGAAAGGCT TCGTCGAGGG CCGTGTGCTG CGCGACTCCG CCCTGCCCAC CGCCGACCTC TCCATTCGCA AGCTGGCGGC CGTCGAGGCC CTGGCCCGTT ATGAAGAAGC GATGCCGGAG CTGCTGGACA GTATCCGCAT CGAGCCCAGG CTCTGGCCCA CCTCGGCGGT ATTGGACTGG CTGTCGGTGT TGCAGCGGGT CGAAGCCATC CCCGATCGCA GCGCCAAGCT CAAGGAAGCG GAACAAACCC TCCGTGCCCG CCTCAACTTC CAGGGCACTA CCCTGAACTT CTCCAGCGAG AAAAACGATG CCCTCTGGTG GTTGATGATC TCGCCCGACC TCAATGCGGT GCGGGCCGTC CTGGGCGTGA TGGAACTGCC GGGCTGGCGC GAAGACATTC CCCGGCTGGT GACCGGCGCC TTGGGCCGCC AGACCAAGGG CCGCTGGAAC ACCACGACGG CAAACGCCTG GGGCCGGCTG GCGATGGAGA GGTTCAGCGA GGCATTCGAA TCGGTGCCGG TCACCGGCTA CACCGAGGCG AATCTCGGCG CAGTGAAAAA AGGCATGCAA TGGACCGAGG AAACCCGGCG CAGCGCCCTG AACCTGCCCT GGCCCGACGG CCCCGGCACG CTGGAGGTGC GCCATCAGGG CACGGGCAAA CCCTGGGCCA TCGTCCAGAG CCGCGCCGCG ATCCCCCTGA AAGAGCCACT GTTCACCGGT TTCTCGATCA AACGAACGGT CACCCCGGTC GAACGGAAGC AGCCCGGCGC CTGGAACCGG GGTGACGTGG CACGCGTTAC GCTGGAACTC GACGCCCAGT CCGACATGAG CTGGGTGGTG GTGGACGACC CCATTCCCGC CGGCGCCAGC ATCCTCGGCG GCGGCCTGGG ACGCGACTCC GCCCTGCTGA CCCGCGGCGA ACGCCAGACC GGCTGGACCT GGCCCATCTA CGAGGAGCGC CGCTTCGACG CCTATCGCGC GTATTACGAC TATGTGCCAA AGGGAAAATG GAACATCGAA TATACCGTGC GCTACAACAA CCCAGGCAGC TTTGAACTGC CACCGACGCG CGTCGAGGCC ATGTATGCGC CGGAAATGTT CGGGGAGCTG CCGAATAAGG CTTTGGTGAT CGGGGTGACC CCATGA
|
Protein sequence | MSMRAVFWVV TLCWIQAAGA ARLELFSPEG TAKDVRQVTA RFSEPMVAFG DPRLADPFTV DCPVRGRGRW AEPRTWVYDF EADLPAGLSC RFRPKAQLKA LSGAPVDLAR EYRFDTGGPV VVGSYPSEGS ENVDENQVFL LAAAAPATPE SVTAHARCRI EGMEEAIGVD VLTGERRDAV LAQRRKLGWG YTDLLWPDAD FDSLEPEEIK AGEARLVLVQ CRRAIPPETK VSIFWGRDIA APSGVATTED QELSFKSRPS FTARFGCERI NAEADCIPLL PMSLRFGAPV PADQAAAVRL VDAAGKVYPA DGLDPGRKPF VEELSWKGPF PEKSMLRIEL PPGFVDDAGR PLENASRYPL EVKTDEYPPL AKFPGEFGIL ELKEGGVLPV TVRHIENPMT GQRLASGGPA IPGKVKRVAL SDQDIAAWMR KVKKAAERRG ESVQLPDGGT EWKELTGTES VFADTPAETF ELPRAEAQKE FEVLGIPLEN AGFYVVELAS PRLGAALLGE ERPRYVATSA LVTNLSVHFK WGRESSLVWV TTLDSARPVP NADVRISRYC KDETLWQGRT DANGVALIQG PALPNPSDSG ESCDWGDGPL MVSARTEDDM SFTLSSWTNG ISPQNFGLAV GFYGNPEIVH SVLDRSLFRA GETVSMKHFQ RLRTSNGFGL PDTRPAKIKV RHTGSGQEYE LPAEFDARGI AENTWAIPAE AKLGGYEIVF VYADGSESAT SAEFRVEQFR VPTMRADIQP QSDALINARE ATLDLHVSYL SGGGAANAPV KVRTLVEPRA VSFPGYPDFD FEVQAIEEGL RDSGYSEEAE KTPGASGPAQ VLPLSLDKGG AARVTVPNLP RQHTPQDLVA ELEYQDANGE LLTVARRIPL WPAGLSLGIK TDGWVATRDQ VRFQVLALDL TGKPAVGKEV RVDLYSRTTY SHRKRLIGGF YAYDDKTEVK RLGSECKGKT NDQGLLLCVL APGVSGEILL QATADDGTGN QALGTASIWV ASEDEWWFEN GPSDRMDVIA EKRAYELGDK ARLQVRMPFR EATALVTIER EGIVDSFITP LSGRSPVVEV PIKEAYAPNV FVSVLAVRGR VGPVQTWIAD MARKFRLPWQ PDGGKATTMI DLSKPAYRMG LAQIDVGWAP NRLDVKVQAD RDTYKVRETA KVKVTVQRAT GGAPADAEIA LAAVDEGLLE IKPNESWNLL DKMMGRRGIE VLTSTAQMQV VGKRHYGRKA VPHGGGGGRQ AARELFDTLL LWKGRVALDT RGEAEIEVPL NDSLTAFRLV AIANAGSGLF GTGKTSIRTT QDLMLYSGLP PMVREGDRFH AIFNVRNATD RKIAGRLSAR LTPKGTAAGQ DLPPRPFELE AGTAGDFSWE VDVPVDVAGL DWEIAAAEDG GAATDRLKTS QQVQPAVPVR VFQATLTRID KPLSLAVQRP EDAVPGRGGV RVSLRARLGD GLGGVIEYMT HYPYSCLEQK VSKAVALRDK RLWNEIVADL PAYQDADGLF RYFPGDDTHG SDVLTAYVLA IAQEAGWELP ENALGRAKEA LKGFVEGRVL RDSALPTADL SIRKLAAVEA LARYEEAMPE LLDSIRIEPR LWPTSAVLDW LSVLQRVEAI PDRSAKLKEA EQTLRARLNF QGTTLNFSSE KNDALWWLMI SPDLNAVRAV LGVMELPGWR EDIPRLVTGA LGRQTKGRWN TTTANAWGRL AMERFSEAFE SVPVTGYTEA NLGAVKKGMQ WTEETRRSAL NLPWPDGPGT LEVRHQGTGK PWAIVQSRAA IPLKEPLFTG FSIKRTVTPV ERKQPGAWNR GDVARVTLEL DAQSDMSWVV VDDPIPAGAS ILGGGLGRDS ALLTRGERQT GWTWPIYEER RFDAYRAYYD YVPKGKWNIE YTVRYNNPGS FELPPTRVEA MYAPEMFGEL PNKALVIGVT P
|
| |