Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA2107 |
Symbol | |
ID | 3102385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 2263027 |
End bp | 2267397 |
Gene Length | 4371 bp |
Protein Length | 1456 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637171258 |
Product | nonribosomal peptide synthetase, putative |
Protein accession | YP_114534 |
Protein GI | 53803836 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain [TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.960336 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCGAGAC CGGGGAAGTT CACTCGAAAC ACGCTCCTCC TCTACCGGCA GCACGAACCG GCTTTGCCTA CGAGATGGCA TGCTCAATGC TTACGCCTCA AAAAAACTCG ACCACCTGTT GCCGGCAACT TTGACTGGGA CCACTGCATG AGTGCGATTA TTTCTTCTCT ATACAGAGTA TCGGACAGAC AATCTTCATC ACCAAAAAAA CATCGAAGCG GCCCATTAGA GCTTGAGTAT GAGCATTACC ATGAGCTTGC AGACCACACG CCGGAACACT TCGACTCGTG CAATAGACTG GAAGATTTTT TTTACAGAAC GACCGACACG GACCCAGAAC AATTTTGCGC CATAGAAGAC GGTATAATAT ATTCGTATCG CGATATAGAC ATCCGCTCAA ACAAACTTGC CTGCTATTTG TTGACGAATG GAATTGTGGC AGGCATGCGC ATAGGATTAA TGCTTGATCC AAGCATAGAT CTATATGTTT GCCTGCTGGC CATACTAAAA GCAGGGGCCG TATATGTTCC CATGGACCCA TCCTTCCCGA TCGACCGTCT CACCTATATA GCCAATGACT CTGAAACGAA GACGATCATC ACTGCCAACT CGATTCCACA AAGCATGGAA GATTTTCCGT GCGAGATCAT TCATATTGAC AGAATAAAAG ATATCGTTGA TTCGTTACCA GAATCCAGGC CCAAGTCGGT TACGTCGGTC GATTTCGAGT GCTATATCTG CTACACATCG GGATCCACCG GAGCACCAAA GGGCATAGCG ATAACCCACT CCAACATATG CAATTTTATT CGAGCCGCAA CACCCATCTA TGGATTCAGG CGCACCGACC TGGTCTATCA GGGCATGAGT ATCGCTTTTG ATTTCTCGGT GGAGGAAATC TGGACATCGT TTGCCGTCGG CGCGACGCTG GTTCCCAGGC CGGCAGGAAT GGAGCGCTTC GGAGAAGGGC TATGCGATTT CCTTAATCAG ATGGGGATAA CCGTCTTATG CTGCGTGCCC ACATTGCTGG CCACCCTGAA TCGTGACATC CCGTCATTGC GACTCTTGAT GGTGGGAGGC GAAGCATGCT CGCGGGCATT GGTTCAGAGA TGGTCCAAAC CAGGGCGGCG CATCCTGAAT ACTTATGGAC CTACCGAGAC GACGGTGACG GCGACCTGGA CGGAGCTCAT GCCAGACAAA CCTGTCACCA TTGGCAAGGC TCTGCCTACC TATTCAGTAT ATCTGCTGGA TGACCGGCTT ATGCCGGTCA ATGGTTCGGA AACAGGAGAG ATCTGTATAG GCGGCCCCGG GGTCGCCAAA GGCTATGTCA ACCGGCCAGA GCTGACTGCG GAACGTTTCC TGCCAGACCC TTTCCGTCCG GCTGGTGAAC ACAGCCGTCT TTATCGTACC GGAGACTTGG GGCGCTATAC TGAAAATGGC GAAATCGAAT TTCTTGGGCG ATGCGACACT CAGGTAAAAA TCCGTGGCTA TCGGATAGAA CTCAGTGAAA TCGAGGAAGT CATCAGAGGC GAGACCGGCG TCAAGGACGT TGTTGTGACC ACACTCGACG GCAATACCGA AGCTCCGGAT CTGGTGGCCT ACGTGATTCT TGCCGGATCG GCTTCTCCCG CCAAGGCCGA TGCCGAGCGT CTCCATCGTG TATGCCGCGA CAGATTACCC TCCTATATGG TGCCGGCATG GATCGAGTTT CTCTCCGATT TCCCCGTTCT GACGAGCGGC AAGGTGGATC GCAAATCTCT CCCCCCACCA AAATCCTCGC GTATCGGCAC GGGTGGTAAC ATCGTCTCTC CAGCCACCTC GAACGAAGCA CGGCTCGTCG AACTTTGGAA ATCAATACTG GGCGTTGACG AGATTTCGGT GGAAGCCGAC TTCTTTACCG ATCTGGGCGG GCACTCCCTG CTCGCTGCCT ATGCAATTGC CGAGCTGCGG CGAGATCCCG ATTACCAAGC ACTGTCTATG GGAGATATTT ATAACTTCCC TACCATTCGT TTGTTGGCCC GACACTGCGA AACGTTGCAA CGGGACACCT GCAAACGGCG GATTCCCGAT ACCACGGATC ACTACCGCAA GGCATCCAAT CTGCAAGTAT ATATCTGCGG CCTGCTTCAG TGTACTTTCA TAGCAATCTA TCTGACCCTT TTCCTTGCTC CTGCCGCATT CCTGCTCTAC GCCATTGATT GGGGGCGGAT ACCCGATTGG ACCTCATCAG CGTTTTGGTC GGACATTTCC GGTGCGATAT CGCACTTCCG CATCGGAAGC CTGAATTTAT ATCCAAATTC GACCGATTCG ACACACCTGG ATCTTCTTCG AATCATATCT CGGTCCGTAT CCCGCATTAC GACTTGGTTC ACTTCCTGGA CGGCTGAAGA CGGGCTCTCG CCAGCGTTCA TTTTCCTGTT GCCGTCACTG ATGCTTGTCA ATTCGCTGCT ATTGCCCATC CTGGCAAAGA AGCTCATCGT CGGCAAACTG ACACCGGGTC GGTATCCATT GTGGGGATCC ACATTCCTCA GGTGGTGGAT CGAGCGTAAG ACGACGCTGA TTGCGCCAAC CTATCTGCTC GCCGGCACGC CTTTCCTGAA TACGTTCATG CGCGCATTGG GTGCTACGAT CGGAAAGAAT GTCCACCTGA TATCGAGTAA CCTGAACAAC CCCGGTCTCG TTGTCATCGG CGAAGGTACC ACGGTCGGCT ACGATACCGA GGTTCAACCC TTTCGGATAG CGGATGGCTG GCTTCATCTC AGTCCGATCA CCATCGGCCG GGACGTCGGC ATCGGCCCGA GGTGTCTGCT CATGGGCGGC TGCGTCCTCG AAGATGCTTC CCGCCTCGGC GCGCTGACGC TGGTACCCGC TGGCCAGCGG ATCGAGGCGG GTCAATACTG GCAGGGCTCT CCCGCTCATC CGGTTTCAGA TCCGCCGCTG AGCGACATAA AGCTCCGCGA GCTGGGAAAC GCGCCAGACA AATGGACCAG GGCGCATTTG TGTGGATTTC TTGGGGGAAT TCTGATGGTT TATCATGCCC CTTTTCATGC CGCCCTTGCT GGTATGCTCC TGGCCTCCGC TGCCTTAAAC CATTGGGGTT TCATCGGCGG ACTGGCCACT GTAGTACCGG CCGGCCTGAT TTTCGTATTG ATGCTTTGCG GATTCATCGC ATTCATAAAG CGGATTTGCC TCCCCCATCT GGAGCCGGGG ATTCATCCTT TGCGCTCGAT GCAGGGGGTC TGCAACTGGC TGTCCGATAA ACTCATGGAA ACCAGCTTGC TCTATACGAA CTCACTCTAC TCCACGCTTT ATACGGCACC ATGGCTACGT ATGTTGGGTG CCAAAGTCGG GGCACGGGCG GAGATTTCCA CTGTCAGCGA CATTGACCCC GATCTGCTCA CTCTGGGAGA TGAGTGCTTC GTTGCCGATA TGGCATCCAT CGGTCCGACC ACACACCTGC GCGGCTGGTT CGAGATCGGG CCGACCACCA TCGGAAAACG CTCGTTCGTC GGAAACGCAG CGTTGGTACC TGCAAACTCT CGCATGGAAG ACAATTCGCT GCTGGGGGTT CAATCGACGA CCGCTGCAGG TGAGATTCCC CCAAACACGT CCTGGCTCGG ATCACCGGCG TTGTACCTGC CCAACCGTGA GGTAGTCCAA GCCGGCGAAG CGCAAACCTA TCGCCCGCCC CTCCTTGCCT ATGCGGTAAG GCTGGTGATC GAGTTTTTCA GGGTGACACT GCCCGAGGGC TTGAGCCTGT TCAGCGCCAG TCTGGTTTTC AGCCTGCTCA GGCAGATACC CGATTCCGTA CCACTTTGGC AGAAGCTGAC GGTTTATCCG ATGGCCATTT TCGGTACGGG TATAGGCCTC GTGGTCCTTG TCGCCATCAT CAAGTGGACC TTGGTGGGCT CCTATCGTCC GCGCAAAGAA CCCAACTGGG CTCTGTTCGT CCGCCGTACC GAACTGGTCA CGGCGCTGTA CGAGAACGTA TCGGTCAGCT TGGTGCTGGA CTGGCTCACG GGCACACCAT GGCTAGCGCC TTTTCTCAGG ATACACGGAG TAAAAATCGG CAAGCGCGTG TATTGTGACT CGACGTTCAT CACTGAATTC GACCTGCTCG AAATTGGAGA CGACTGCGCC ATCGGGAAAG ATACATCATT GCAGACACAT TTATTTGAGG ACAGGGTAAT GAAAATGTCC AAGGTCAAGA TAGAAAGTCA GTCGCAGATA GGATCGCGAT GTATCGTTCT TTACGATGCG ATCGTTTCCA AAGGGAGCTA TATCGAGAAT CTTTCAATGG TGATGAAAGC GGAATTCATA CCCGCACGAT CTCGCTGGAT AGGCATTCCA GCCTATCCCT ATCGTTCCTG A
|
Protein sequence | MPRPGKFTRN TLLLYRQHEP ALPTRWHAQC LRLKKTRPPV AGNFDWDHCM SAIISSLYRV SDRQSSSPKK HRSGPLELEY EHYHELADHT PEHFDSCNRL EDFFYRTTDT DPEQFCAIED GIIYSYRDID IRSNKLACYL LTNGIVAGMR IGLMLDPSID LYVCLLAILK AGAVYVPMDP SFPIDRLTYI ANDSETKTII TANSIPQSME DFPCEIIHID RIKDIVDSLP ESRPKSVTSV DFECYICYTS GSTGAPKGIA ITHSNICNFI RAATPIYGFR RTDLVYQGMS IAFDFSVEEI WTSFAVGATL VPRPAGMERF GEGLCDFLNQ MGITVLCCVP TLLATLNRDI PSLRLLMVGG EACSRALVQR WSKPGRRILN TYGPTETTVT ATWTELMPDK PVTIGKALPT YSVYLLDDRL MPVNGSETGE ICIGGPGVAK GYVNRPELTA ERFLPDPFRP AGEHSRLYRT GDLGRYTENG EIEFLGRCDT QVKIRGYRIE LSEIEEVIRG ETGVKDVVVT TLDGNTEAPD LVAYVILAGS ASPAKADAER LHRVCRDRLP SYMVPAWIEF LSDFPVLTSG KVDRKSLPPP KSSRIGTGGN IVSPATSNEA RLVELWKSIL GVDEISVEAD FFTDLGGHSL LAAYAIAELR RDPDYQALSM GDIYNFPTIR LLARHCETLQ RDTCKRRIPD TTDHYRKASN LQVYICGLLQ CTFIAIYLTL FLAPAAFLLY AIDWGRIPDW TSSAFWSDIS GAISHFRIGS LNLYPNSTDS THLDLLRIIS RSVSRITTWF TSWTAEDGLS PAFIFLLPSL MLVNSLLLPI LAKKLIVGKL TPGRYPLWGS TFLRWWIERK TTLIAPTYLL AGTPFLNTFM RALGATIGKN VHLISSNLNN PGLVVIGEGT TVGYDTEVQP FRIADGWLHL SPITIGRDVG IGPRCLLMGG CVLEDASRLG ALTLVPAGQR IEAGQYWQGS PAHPVSDPPL SDIKLRELGN APDKWTRAHL CGFLGGILMV YHAPFHAALA GMLLASAALN HWGFIGGLAT VVPAGLIFVL MLCGFIAFIK RICLPHLEPG IHPLRSMQGV CNWLSDKLME TSLLYTNSLY STLYTAPWLR MLGAKVGARA EISTVSDIDP DLLTLGDECF VADMASIGPT THLRGWFEIG PTTIGKRSFV GNAALVPANS RMEDNSLLGV QSTTAAGEIP PNTSWLGSPA LYLPNREVVQ AGEAQTYRPP LLAYAVRLVI EFFRVTLPEG LSLFSASLVF SLLRQIPDSV PLWQKLTVYP MAIFGTGIGL VVLVAIIKWT LVGSYRPRKE PNWALFVRRT ELVTALYENV SVSLVLDWLT GTPWLAPFLR IHGVKIGKRV YCDSTFITEF DLLEIGDDCA IGKDTSLQTH LFEDRVMKMS KVKIESQSQI GSRCIVLYDA IVSKGSYIEN LSMVMKAEFI PARSRWIGIP AYPYRS
|
| |