Gene MCA1752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1752 
Symbol 
ID3102519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1868033 
End bp1873798 
Gene Length5766 bp 
Protein Length1921 aa 
Translation table11 
GC content67% 
IMG OID637170913 
Producthypothetical protein 
Protein accessionYP_114191 
Protein GI53803904 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATGC GCGCAGTGTT CTGGGTCGTG ACGTTGTGCT GGATTCAGGC GGCCGGTGCC 
GCGCGGCTGG AGCTGTTTTC GCCCGAGGGA ACGGCCAAGG ACGTGCGCCA GGTGACGGCG
CGATTCTCCG AACCGATGGT TGCGTTCGGC GATCCCCGGC TCGCCGACCC GTTCACGGTG
GATTGCCCGG TCCGGGGGCG CGGACGCTGG GCCGAACCCC GCACCTGGGT CTACGACTTC
GAAGCCGACC TGCCCGCCGG CCTTTCCTGC CGATTCCGAC CGAAGGCGCA GTTGAAGGCA
TTGAGCGGCG CGCCGGTCGA TCTCGCCAGG GAATACCGCT TCGACACCGG CGGTCCGGTG
GTGGTGGGTT CCTATCCCAG CGAAGGCAGC GAGAACGTCG ATGAAAACCA GGTCTTCCTG
CTGGCCGCCG CCGCTCCGGC CACGCCGGAA AGCGTCACCG CCCATGCGCG CTGCCGGATC
GAGGGCATGG AAGAGGCGAT CGGCGTCGAC GTCCTGACCG GCGAGCGACG GGATGCAGTG
TTGGCCCAGC GCCGGAAACT GGGCTGGGGC TACACCGACC TGCTCTGGCC GGACGCCGAT
TTCGACTCGC TGGAACCGGA GGAAATCAAG GCCGGCGAAG CACGGCTGGT GCTGGTGCAA
TGCCGTCGGG CCATCCCGCC GGAGACGAAG GTCAGCATCT TCTGGGGCAG GGACATCGCC
GCGCCCAGCG GCGTCGCGAC GACCGAAGAC CAGGAACTGA GCTTCAAGTC CCGCCCGAGC
TTCACCGCCC GCTTCGGTTG CGAGCGGATC AATGCCGAGG CGGACTGCAT TCCGCTCCTG
CCCATGTCCC TGCGCTTCGG CGCGCCCGTG CCCGCCGACC AGGCCGCGGC CGTGCGCCTG
GTCGATGCCG CCGGCAAGGT CTATCCGGCT GACGGGCTCG ACCCGGGCCG CAAACCTTTC
GTCGAGGAAC TGAGCTGGAA AGGCCCCTTC CCGGAAAAGT CCATGCTCCG CATCGAACTG
CCCCCAGGGT TCGTCGACGA TGCCGGACGT CCGCTGGAAA ACGCCAGCCG TTATCCGCTG
GAAGTCAAGA CCGACGAATA TCCTCCTTTG GCCAAATTTC CCGGCGAGTT CGGCATTCTG
GAGCTGAAGG AGGGCGGCGT CCTGCCCGTC ACCGTGCGCC ACATCGAGAA CCCCATGACC
GGCCAACGAC TGGCCTCGGG CGGCCCGGCC ATCCCCGGCA AGGTGAAGCG GGTGGCGTTG
AGCGACCAGG ACATCGCCGC CTGGATGCGC AAGGTGAAGA AGGCCGCCGA ACGCCGGGGC
GAATCAGTCC AGTTGCCCGA CGGCGGCACC GAATGGAAGG AACTCACCGG TACCGAATCG
GTGTTCGCCG ATACTCCGGC GGAAACTTTC GAGCTGCCGC GTGCCGAAGC CCAGAAAGAG
TTCGAGGTGC TCGGCATTCC CTTGGAAAAC GCGGGCTTTT ACGTGGTCGA ACTGGCCAGC
CCACGGCTCG GCGCCGCTTT ATTGGGGGAG GAAAGGCCCC GCTACGTGGC GACCTCGGCG
CTGGTCACCA ATCTGTCGGT ACACTTCAAA TGGGGCCGGG AATCCTCGCT GGTCTGGGTG
ACGACCCTGG ATTCGGCCCG GCCGGTGCCG AACGCCGATG TCCGCATCAG CCGCTACTGC
AAGGACGAAA CGTTGTGGCA GGGCCGGACG GATGCCAACG GCGTGGCGCT CATCCAGGGC
CCGGCCCTGC CCAATCCCTC GGATTCCGGC GAGAGCTGCG ACTGGGGCGA CGGCCCCCTC
ATGGTGAGCG CCCGGACCGA GGACGACATG AGCTTCACCT TGAGCAGCTG GACCAATGGC
ATTTCGCCGC AGAACTTCGG CCTGGCGGTG GGCTTCTACG GCAACCCGGA AATCGTCCAC
TCGGTGCTGG ACCGAAGCCT GTTCCGAGCC GGAGAGACCG TGTCCATGAA GCACTTCCAG
CGCCTGCGCA CGTCCAACGG CTTCGGCCTG CCGGATACCC GACCCGCGAA AATAAAGGTC
CGCCACACCG GCAGCGGCCA GGAATACGAA CTGCCGGCCG AGTTCGACGC GCGCGGAATC
GCAGAAAATA CCTGGGCGAT TCCGGCCGAA GCCAAGCTCG GCGGTTACGA AATCGTATTC
GTCTATGCCG ACGGCAGCGA ATCCGCCACC TCGGCCGAAT TCCGCGTCGA GCAGTTCCGA
GTGCCGACCA TGCGCGCCGA CATCCAGCCG CAGTCCGATG CGCTGATCAA CGCCAGGGAA
GCCACACTCG ACCTTCACGT CAGCTATCTT TCCGGCGGCG GCGCGGCCAA TGCGCCGGTC
AAAGTCCGCA CCCTGGTCGA GCCGCGCGCG GTGTCATTCC CCGGTTATCC GGATTTCGAT
TTCGAGGTGC AGGCCATCGA GGAAGGACTG CGCGACAGCG GTTACAGCGA GGAAGCCGAG
AAAACCCCCG GGGCGAGCGG TCCCGCCCAG GTGCTGCCGC TGTCGCTGGA CAAGGGCGGC
GCGGCGCGGG TGACGGTTCC CAACCTGCCG CGCCAGCACA CGCCGCAGGA CCTGGTGGCC
GAACTCGAAT ACCAGGATGC CAACGGCGAA CTGTTGACCG TGGCCCGCCG TATCCCACTG
TGGCCGGCCG GCCTCAGCCT GGGAATCAAG ACCGATGGCT GGGTGGCAAC CAGAGACCAA
GTGCGGTTCC AGGTACTGGC GCTGGATCTG ACCGGGAAAC CCGCCGTCGG CAAAGAGGTC
CGGGTCGACC TGTACTCGCG CACCACTTAT TCTCACCGCA AGCGCCTGAT CGGCGGCTTT
TACGCCTATG ACGACAAGAC CGAAGTCAAA CGCCTCGGCA GCGAGTGCAA AGGCAAGACC
AACGATCAGG GCCTCCTCCT CTGCGTCCTC GCTCCCGGCG TATCCGGCGA AATCCTGCTC
CAGGCCACGG CCGACGACGG CACCGGCAAC CAGGCGCTGG GCACGGCTTC GATCTGGGTC
GCCAGTGAGG ACGAATGGTG GTTCGAGAAC GGCCCCAGCG ACCGCATGGA CGTGATTGCC
GAGAAGCGCG CCTACGAGCT GGGCGACAAG GCCCGCTTGC AGGTGCGCAT GCCGTTCCGC
GAAGCCACGG CTCTGGTGAC GATCGAACGG GAAGGCATCG TCGACAGCTT CATCACCCCG
CTGTCCGGCC GCTCTCCGGT GGTCGAAGTG CCCATCAAGG AAGCCTACGC GCCCAACGTG
TTCGTCTCCG TGCTGGCCGT GCGGGGTCGG GTGGGGCCGG TGCAGACCTG GATCGCGGAC
ATGGCCCGAA AGTTCAGGCT GCCGTGGCAG CCAGACGGCG GCAAGGCCAC GACGATGATC
GACCTGAGCA AGCCCGCCTA CCGCATGGGG CTCGCCCAGA TCGACGTGGG CTGGGCACCC
AACCGGCTCG ATGTGAAAGT GCAGGCCGAC CGGGACACCT ACAAGGTCCG GGAAACCGCC
AAGGTGAAGG TGACGGTGCA GCGGGCCACG GGCGGCGCGC CGGCCGACGC CGAGATCGCC
CTGGCGGCCG TCGACGAGGG GCTGCTCGAG ATCAAGCCCA ACGAAAGCTG GAACCTGCTG
GACAAGATGA TGGGCCGGCG CGGCATCGAG GTGCTCACCT CCACCGCGCA GATGCAGGTC
GTCGGCAAGC GCCACTATGG CCGCAAGGCC GTGCCGCATG GCGGCGGCGG CGGCCGCCAG
GCGGCGCGCG AGCTGTTCGA CACCCTGCTG CTGTGGAAAG GCCGGGTGGC GCTGGACACC
AGGGGCGAAG CCGAGATCGA GGTGCCGCTC AACGACTCGC TCACCGCCTT CCGGCTGGTG
GCCATCGCCA ACGCGGGCAG CGGCCTGTTC GGCACCGGCA AGACCTCGAT CCGCACCACC
CAGGACCTGA TGCTCTATTC CGGCCTGCCG CCGATGGTCC GCGAAGGCGA CCGCTTCCAC
GCCATCTTCA ACGTGCGCAA CGCCACCGAC CGCAAGATCG CGGGCCGGCT CAGCGCCCGA
TTGACGCCGA AAGGGACGGC GGCGGGCCAG GACCTTCCGC CCCGGCCCTT CGAACTCGAG
GCCGGCACGG CGGGCGATTT CTCCTGGGAA GTCGACGTCC CGGTCGACGT CGCCGGGCTG
GATTGGGAAA TCGCCGCCGC GGAAGACGGC GGGGCCGCCA CCGACCGGCT GAAAACCTCG
CAGCAGGTGC AGCCCGCCGT GCCCGTCCGG GTGTTCCAGG CGACGCTGAC CCGCATCGAC
AAGCCGCTCT CGCTCGCGGT GCAAAGGCCC GAGGACGCCG TGCCGGGCCG CGGCGGCGTG
CGCGTCTCGC TGCGCGCCCG ACTCGGCGAT GGCCTCGGCG GGGTGATCGA ATACATGACG
CACTACCCCT ACTCCTGCCT GGAGCAGAAG GTTTCCAAGG CCGTCGCGCT GCGCGACAAG
CGCCTATGGA ACGAGATCGT CGCCGACCTG CCCGCGTACC AGGATGCCGA CGGCCTGTTC
CGCTATTTCC CGGGTGACGA CACCCACGGC AGCGACGTGC TGACCGCCTA TGTCCTGGCG
ATCGCGCAGG AAGCCGGCTG GGAACTGCCC GAGAATGCCC TGGGCCGGGC CAAGGAGGCG
TTGAAAGGCT TCGTCGAGGG CCGTGTGCTG CGCGACTCCG CCCTGCCCAC CGCCGACCTC
TCCATTCGCA AGCTGGCGGC CGTCGAGGCC CTGGCCCGTT ATGAAGAAGC GATGCCGGAG
CTGCTGGACA GTATCCGCAT CGAGCCCAGG CTCTGGCCCA CCTCGGCGGT ATTGGACTGG
CTGTCGGTGT TGCAGCGGGT CGAAGCCATC CCCGATCGCA GCGCCAAGCT CAAGGAAGCG
GAACAAACCC TCCGTGCCCG CCTCAACTTC CAGGGCACTA CCCTGAACTT CTCCAGCGAG
AAAAACGATG CCCTCTGGTG GTTGATGATC TCGCCCGACC TCAATGCGGT GCGGGCCGTC
CTGGGCGTGA TGGAACTGCC GGGCTGGCGC GAAGACATTC CCCGGCTGGT GACCGGCGCC
TTGGGCCGCC AGACCAAGGG CCGCTGGAAC ACCACGACGG CAAACGCCTG GGGCCGGCTG
GCGATGGAGA GGTTCAGCGA GGCATTCGAA TCGGTGCCGG TCACCGGCTA CACCGAGGCG
AATCTCGGCG CAGTGAAAAA AGGCATGCAA TGGACCGAGG AAACCCGGCG CAGCGCCCTG
AACCTGCCCT GGCCCGACGG CCCCGGCACG CTGGAGGTGC GCCATCAGGG CACGGGCAAA
CCCTGGGCCA TCGTCCAGAG CCGCGCCGCG ATCCCCCTGA AAGAGCCACT GTTCACCGGT
TTCTCGATCA AACGAACGGT CACCCCGGTC GAACGGAAGC AGCCCGGCGC CTGGAACCGG
GGTGACGTGG CACGCGTTAC GCTGGAACTC GACGCCCAGT CCGACATGAG CTGGGTGGTG
GTGGACGACC CCATTCCCGC CGGCGCCAGC ATCCTCGGCG GCGGCCTGGG ACGCGACTCC
GCCCTGCTGA CCCGCGGCGA ACGCCAGACC GGCTGGACCT GGCCCATCTA CGAGGAGCGC
CGCTTCGACG CCTATCGCGC GTATTACGAC TATGTGCCAA AGGGAAAATG GAACATCGAA
TATACCGTGC GCTACAACAA CCCAGGCAGC TTTGAACTGC CACCGACGCG CGTCGAGGCC
ATGTATGCGC CGGAAATGTT CGGGGAGCTG CCGAATAAGG CTTTGGTGAT CGGGGTGACC
CCATGA
 
Protein sequence
MSMRAVFWVV TLCWIQAAGA ARLELFSPEG TAKDVRQVTA RFSEPMVAFG DPRLADPFTV 
DCPVRGRGRW AEPRTWVYDF EADLPAGLSC RFRPKAQLKA LSGAPVDLAR EYRFDTGGPV
VVGSYPSEGS ENVDENQVFL LAAAAPATPE SVTAHARCRI EGMEEAIGVD VLTGERRDAV
LAQRRKLGWG YTDLLWPDAD FDSLEPEEIK AGEARLVLVQ CRRAIPPETK VSIFWGRDIA
APSGVATTED QELSFKSRPS FTARFGCERI NAEADCIPLL PMSLRFGAPV PADQAAAVRL
VDAAGKVYPA DGLDPGRKPF VEELSWKGPF PEKSMLRIEL PPGFVDDAGR PLENASRYPL
EVKTDEYPPL AKFPGEFGIL ELKEGGVLPV TVRHIENPMT GQRLASGGPA IPGKVKRVAL
SDQDIAAWMR KVKKAAERRG ESVQLPDGGT EWKELTGTES VFADTPAETF ELPRAEAQKE
FEVLGIPLEN AGFYVVELAS PRLGAALLGE ERPRYVATSA LVTNLSVHFK WGRESSLVWV
TTLDSARPVP NADVRISRYC KDETLWQGRT DANGVALIQG PALPNPSDSG ESCDWGDGPL
MVSARTEDDM SFTLSSWTNG ISPQNFGLAV GFYGNPEIVH SVLDRSLFRA GETVSMKHFQ
RLRTSNGFGL PDTRPAKIKV RHTGSGQEYE LPAEFDARGI AENTWAIPAE AKLGGYEIVF
VYADGSESAT SAEFRVEQFR VPTMRADIQP QSDALINARE ATLDLHVSYL SGGGAANAPV
KVRTLVEPRA VSFPGYPDFD FEVQAIEEGL RDSGYSEEAE KTPGASGPAQ VLPLSLDKGG
AARVTVPNLP RQHTPQDLVA ELEYQDANGE LLTVARRIPL WPAGLSLGIK TDGWVATRDQ
VRFQVLALDL TGKPAVGKEV RVDLYSRTTY SHRKRLIGGF YAYDDKTEVK RLGSECKGKT
NDQGLLLCVL APGVSGEILL QATADDGTGN QALGTASIWV ASEDEWWFEN GPSDRMDVIA
EKRAYELGDK ARLQVRMPFR EATALVTIER EGIVDSFITP LSGRSPVVEV PIKEAYAPNV
FVSVLAVRGR VGPVQTWIAD MARKFRLPWQ PDGGKATTMI DLSKPAYRMG LAQIDVGWAP
NRLDVKVQAD RDTYKVRETA KVKVTVQRAT GGAPADAEIA LAAVDEGLLE IKPNESWNLL
DKMMGRRGIE VLTSTAQMQV VGKRHYGRKA VPHGGGGGRQ AARELFDTLL LWKGRVALDT
RGEAEIEVPL NDSLTAFRLV AIANAGSGLF GTGKTSIRTT QDLMLYSGLP PMVREGDRFH
AIFNVRNATD RKIAGRLSAR LTPKGTAAGQ DLPPRPFELE AGTAGDFSWE VDVPVDVAGL
DWEIAAAEDG GAATDRLKTS QQVQPAVPVR VFQATLTRID KPLSLAVQRP EDAVPGRGGV
RVSLRARLGD GLGGVIEYMT HYPYSCLEQK VSKAVALRDK RLWNEIVADL PAYQDADGLF
RYFPGDDTHG SDVLTAYVLA IAQEAGWELP ENALGRAKEA LKGFVEGRVL RDSALPTADL
SIRKLAAVEA LARYEEAMPE LLDSIRIEPR LWPTSAVLDW LSVLQRVEAI PDRSAKLKEA
EQTLRARLNF QGTTLNFSSE KNDALWWLMI SPDLNAVRAV LGVMELPGWR EDIPRLVTGA
LGRQTKGRWN TTTANAWGRL AMERFSEAFE SVPVTGYTEA NLGAVKKGMQ WTEETRRSAL
NLPWPDGPGT LEVRHQGTGK PWAIVQSRAA IPLKEPLFTG FSIKRTVTPV ERKQPGAWNR
GDVARVTLEL DAQSDMSWVV VDDPIPAGAS ILGGGLGRDS ALLTRGERQT GWTWPIYEER
RFDAYRAYYD YVPKGKWNIE YTVRYNNPGS FELPPTRVEA MYAPEMFGEL PNKALVIGVT
P