Gene MCA2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2107 
Symbol 
ID3102385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2263027 
End bp2267397 
Gene Length4371 bp 
Protein Length1456 aa 
Translation table11 
GC content54% 
IMG OID637171258 
Productnonribosomal peptide synthetase, putative 
Protein accessionYP_114534 
Protein GI53803836 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily
[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.960336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCGAGAC CGGGGAAGTT CACTCGAAAC ACGCTCCTCC TCTACCGGCA GCACGAACCG 
GCTTTGCCTA CGAGATGGCA TGCTCAATGC TTACGCCTCA AAAAAACTCG ACCACCTGTT
GCCGGCAACT TTGACTGGGA CCACTGCATG AGTGCGATTA TTTCTTCTCT ATACAGAGTA
TCGGACAGAC AATCTTCATC ACCAAAAAAA CATCGAAGCG GCCCATTAGA GCTTGAGTAT
GAGCATTACC ATGAGCTTGC AGACCACACG CCGGAACACT TCGACTCGTG CAATAGACTG
GAAGATTTTT TTTACAGAAC GACCGACACG GACCCAGAAC AATTTTGCGC CATAGAAGAC
GGTATAATAT ATTCGTATCG CGATATAGAC ATCCGCTCAA ACAAACTTGC CTGCTATTTG
TTGACGAATG GAATTGTGGC AGGCATGCGC ATAGGATTAA TGCTTGATCC AAGCATAGAT
CTATATGTTT GCCTGCTGGC CATACTAAAA GCAGGGGCCG TATATGTTCC CATGGACCCA
TCCTTCCCGA TCGACCGTCT CACCTATATA GCCAATGACT CTGAAACGAA GACGATCATC
ACTGCCAACT CGATTCCACA AAGCATGGAA GATTTTCCGT GCGAGATCAT TCATATTGAC
AGAATAAAAG ATATCGTTGA TTCGTTACCA GAATCCAGGC CCAAGTCGGT TACGTCGGTC
GATTTCGAGT GCTATATCTG CTACACATCG GGATCCACCG GAGCACCAAA GGGCATAGCG
ATAACCCACT CCAACATATG CAATTTTATT CGAGCCGCAA CACCCATCTA TGGATTCAGG
CGCACCGACC TGGTCTATCA GGGCATGAGT ATCGCTTTTG ATTTCTCGGT GGAGGAAATC
TGGACATCGT TTGCCGTCGG CGCGACGCTG GTTCCCAGGC CGGCAGGAAT GGAGCGCTTC
GGAGAAGGGC TATGCGATTT CCTTAATCAG ATGGGGATAA CCGTCTTATG CTGCGTGCCC
ACATTGCTGG CCACCCTGAA TCGTGACATC CCGTCATTGC GACTCTTGAT GGTGGGAGGC
GAAGCATGCT CGCGGGCATT GGTTCAGAGA TGGTCCAAAC CAGGGCGGCG CATCCTGAAT
ACTTATGGAC CTACCGAGAC GACGGTGACG GCGACCTGGA CGGAGCTCAT GCCAGACAAA
CCTGTCACCA TTGGCAAGGC TCTGCCTACC TATTCAGTAT ATCTGCTGGA TGACCGGCTT
ATGCCGGTCA ATGGTTCGGA AACAGGAGAG ATCTGTATAG GCGGCCCCGG GGTCGCCAAA
GGCTATGTCA ACCGGCCAGA GCTGACTGCG GAACGTTTCC TGCCAGACCC TTTCCGTCCG
GCTGGTGAAC ACAGCCGTCT TTATCGTACC GGAGACTTGG GGCGCTATAC TGAAAATGGC
GAAATCGAAT TTCTTGGGCG ATGCGACACT CAGGTAAAAA TCCGTGGCTA TCGGATAGAA
CTCAGTGAAA TCGAGGAAGT CATCAGAGGC GAGACCGGCG TCAAGGACGT TGTTGTGACC
ACACTCGACG GCAATACCGA AGCTCCGGAT CTGGTGGCCT ACGTGATTCT TGCCGGATCG
GCTTCTCCCG CCAAGGCCGA TGCCGAGCGT CTCCATCGTG TATGCCGCGA CAGATTACCC
TCCTATATGG TGCCGGCATG GATCGAGTTT CTCTCCGATT TCCCCGTTCT GACGAGCGGC
AAGGTGGATC GCAAATCTCT CCCCCCACCA AAATCCTCGC GTATCGGCAC GGGTGGTAAC
ATCGTCTCTC CAGCCACCTC GAACGAAGCA CGGCTCGTCG AACTTTGGAA ATCAATACTG
GGCGTTGACG AGATTTCGGT GGAAGCCGAC TTCTTTACCG ATCTGGGCGG GCACTCCCTG
CTCGCTGCCT ATGCAATTGC CGAGCTGCGG CGAGATCCCG ATTACCAAGC ACTGTCTATG
GGAGATATTT ATAACTTCCC TACCATTCGT TTGTTGGCCC GACACTGCGA AACGTTGCAA
CGGGACACCT GCAAACGGCG GATTCCCGAT ACCACGGATC ACTACCGCAA GGCATCCAAT
CTGCAAGTAT ATATCTGCGG CCTGCTTCAG TGTACTTTCA TAGCAATCTA TCTGACCCTT
TTCCTTGCTC CTGCCGCATT CCTGCTCTAC GCCATTGATT GGGGGCGGAT ACCCGATTGG
ACCTCATCAG CGTTTTGGTC GGACATTTCC GGTGCGATAT CGCACTTCCG CATCGGAAGC
CTGAATTTAT ATCCAAATTC GACCGATTCG ACACACCTGG ATCTTCTTCG AATCATATCT
CGGTCCGTAT CCCGCATTAC GACTTGGTTC ACTTCCTGGA CGGCTGAAGA CGGGCTCTCG
CCAGCGTTCA TTTTCCTGTT GCCGTCACTG ATGCTTGTCA ATTCGCTGCT ATTGCCCATC
CTGGCAAAGA AGCTCATCGT CGGCAAACTG ACACCGGGTC GGTATCCATT GTGGGGATCC
ACATTCCTCA GGTGGTGGAT CGAGCGTAAG ACGACGCTGA TTGCGCCAAC CTATCTGCTC
GCCGGCACGC CTTTCCTGAA TACGTTCATG CGCGCATTGG GTGCTACGAT CGGAAAGAAT
GTCCACCTGA TATCGAGTAA CCTGAACAAC CCCGGTCTCG TTGTCATCGG CGAAGGTACC
ACGGTCGGCT ACGATACCGA GGTTCAACCC TTTCGGATAG CGGATGGCTG GCTTCATCTC
AGTCCGATCA CCATCGGCCG GGACGTCGGC ATCGGCCCGA GGTGTCTGCT CATGGGCGGC
TGCGTCCTCG AAGATGCTTC CCGCCTCGGC GCGCTGACGC TGGTACCCGC TGGCCAGCGG
ATCGAGGCGG GTCAATACTG GCAGGGCTCT CCCGCTCATC CGGTTTCAGA TCCGCCGCTG
AGCGACATAA AGCTCCGCGA GCTGGGAAAC GCGCCAGACA AATGGACCAG GGCGCATTTG
TGTGGATTTC TTGGGGGAAT TCTGATGGTT TATCATGCCC CTTTTCATGC CGCCCTTGCT
GGTATGCTCC TGGCCTCCGC TGCCTTAAAC CATTGGGGTT TCATCGGCGG ACTGGCCACT
GTAGTACCGG CCGGCCTGAT TTTCGTATTG ATGCTTTGCG GATTCATCGC ATTCATAAAG
CGGATTTGCC TCCCCCATCT GGAGCCGGGG ATTCATCCTT TGCGCTCGAT GCAGGGGGTC
TGCAACTGGC TGTCCGATAA ACTCATGGAA ACCAGCTTGC TCTATACGAA CTCACTCTAC
TCCACGCTTT ATACGGCACC ATGGCTACGT ATGTTGGGTG CCAAAGTCGG GGCACGGGCG
GAGATTTCCA CTGTCAGCGA CATTGACCCC GATCTGCTCA CTCTGGGAGA TGAGTGCTTC
GTTGCCGATA TGGCATCCAT CGGTCCGACC ACACACCTGC GCGGCTGGTT CGAGATCGGG
CCGACCACCA TCGGAAAACG CTCGTTCGTC GGAAACGCAG CGTTGGTACC TGCAAACTCT
CGCATGGAAG ACAATTCGCT GCTGGGGGTT CAATCGACGA CCGCTGCAGG TGAGATTCCC
CCAAACACGT CCTGGCTCGG ATCACCGGCG TTGTACCTGC CCAACCGTGA GGTAGTCCAA
GCCGGCGAAG CGCAAACCTA TCGCCCGCCC CTCCTTGCCT ATGCGGTAAG GCTGGTGATC
GAGTTTTTCA GGGTGACACT GCCCGAGGGC TTGAGCCTGT TCAGCGCCAG TCTGGTTTTC
AGCCTGCTCA GGCAGATACC CGATTCCGTA CCACTTTGGC AGAAGCTGAC GGTTTATCCG
ATGGCCATTT TCGGTACGGG TATAGGCCTC GTGGTCCTTG TCGCCATCAT CAAGTGGACC
TTGGTGGGCT CCTATCGTCC GCGCAAAGAA CCCAACTGGG CTCTGTTCGT CCGCCGTACC
GAACTGGTCA CGGCGCTGTA CGAGAACGTA TCGGTCAGCT TGGTGCTGGA CTGGCTCACG
GGCACACCAT GGCTAGCGCC TTTTCTCAGG ATACACGGAG TAAAAATCGG CAAGCGCGTG
TATTGTGACT CGACGTTCAT CACTGAATTC GACCTGCTCG AAATTGGAGA CGACTGCGCC
ATCGGGAAAG ATACATCATT GCAGACACAT TTATTTGAGG ACAGGGTAAT GAAAATGTCC
AAGGTCAAGA TAGAAAGTCA GTCGCAGATA GGATCGCGAT GTATCGTTCT TTACGATGCG
ATCGTTTCCA AAGGGAGCTA TATCGAGAAT CTTTCAATGG TGATGAAAGC GGAATTCATA
CCCGCACGAT CTCGCTGGAT AGGCATTCCA GCCTATCCCT ATCGTTCCTG A
 
Protein sequence
MPRPGKFTRN TLLLYRQHEP ALPTRWHAQC LRLKKTRPPV AGNFDWDHCM SAIISSLYRV 
SDRQSSSPKK HRSGPLELEY EHYHELADHT PEHFDSCNRL EDFFYRTTDT DPEQFCAIED
GIIYSYRDID IRSNKLACYL LTNGIVAGMR IGLMLDPSID LYVCLLAILK AGAVYVPMDP
SFPIDRLTYI ANDSETKTII TANSIPQSME DFPCEIIHID RIKDIVDSLP ESRPKSVTSV
DFECYICYTS GSTGAPKGIA ITHSNICNFI RAATPIYGFR RTDLVYQGMS IAFDFSVEEI
WTSFAVGATL VPRPAGMERF GEGLCDFLNQ MGITVLCCVP TLLATLNRDI PSLRLLMVGG
EACSRALVQR WSKPGRRILN TYGPTETTVT ATWTELMPDK PVTIGKALPT YSVYLLDDRL
MPVNGSETGE ICIGGPGVAK GYVNRPELTA ERFLPDPFRP AGEHSRLYRT GDLGRYTENG
EIEFLGRCDT QVKIRGYRIE LSEIEEVIRG ETGVKDVVVT TLDGNTEAPD LVAYVILAGS
ASPAKADAER LHRVCRDRLP SYMVPAWIEF LSDFPVLTSG KVDRKSLPPP KSSRIGTGGN
IVSPATSNEA RLVELWKSIL GVDEISVEAD FFTDLGGHSL LAAYAIAELR RDPDYQALSM
GDIYNFPTIR LLARHCETLQ RDTCKRRIPD TTDHYRKASN LQVYICGLLQ CTFIAIYLTL
FLAPAAFLLY AIDWGRIPDW TSSAFWSDIS GAISHFRIGS LNLYPNSTDS THLDLLRIIS
RSVSRITTWF TSWTAEDGLS PAFIFLLPSL MLVNSLLLPI LAKKLIVGKL TPGRYPLWGS
TFLRWWIERK TTLIAPTYLL AGTPFLNTFM RALGATIGKN VHLISSNLNN PGLVVIGEGT
TVGYDTEVQP FRIADGWLHL SPITIGRDVG IGPRCLLMGG CVLEDASRLG ALTLVPAGQR
IEAGQYWQGS PAHPVSDPPL SDIKLRELGN APDKWTRAHL CGFLGGILMV YHAPFHAALA
GMLLASAALN HWGFIGGLAT VVPAGLIFVL MLCGFIAFIK RICLPHLEPG IHPLRSMQGV
CNWLSDKLME TSLLYTNSLY STLYTAPWLR MLGAKVGARA EISTVSDIDP DLLTLGDECF
VADMASIGPT THLRGWFEIG PTTIGKRSFV GNAALVPANS RMEDNSLLGV QSTTAAGEIP
PNTSWLGSPA LYLPNREVVQ AGEAQTYRPP LLAYAVRLVI EFFRVTLPEG LSLFSASLVF
SLLRQIPDSV PLWQKLTVYP MAIFGTGIGL VVLVAIIKWT LVGSYRPRKE PNWALFVRRT
ELVTALYENV SVSLVLDWLT GTPWLAPFLR IHGVKIGKRV YCDSTFITEF DLLEIGDDCA
IGKDTSLQTH LFEDRVMKMS KVKIESQSQI GSRCIVLYDA IVSKGSYIEN LSMVMKAEFI
PARSRWIGIP AYPYRS