Gene Msil_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1248 
Symbol 
ID7091176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1340386 
End bp1342347 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content65% 
IMG OID643464589 
Productamino acid adenylation domain protein 
Protein accessionYP_002361579 
Protein GI217977432 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTCAC ACCATTCCGG ACAGCCGCAA CCGGCATCCC CCGCGCCAAG CTTCGATCGC 
CGCCGCACGA TTCATTCTTT CTTGTTGGAG GCGGCGGACA ACTACGCCGA TCGTCCGGCC
ATTCTATATG ATGGGCATAA GACCATTCCC TTCGCGGAGC TCGATCGCCG CTCGAACCGC
TTTGCCCGCT ATCTTGCCGC CAAAGGCGTT CTGCCCGGAT CAATCGTCGG CCTCTATCTG
CCGCGCTCGC CGGAGGCGAT CATCGCGATG ATCGGCGCGC TGAAGACGGG AGCCGCCTTC
GCTCCGCTGG ATCCGTCCTA TCCGGCCGAC CATCTCGCCT TCATCACGGC GGACGCCGCG
CCAGCGGTCG TTGTCTCCGC CGCCTCCATG ACTTCGAATG CGAGCGCAGC CAATCTCTGG
ACTGCGCCGA CGATCCTGAT CGATGCGGAA GCGGCGGCGA TCGCGCACGA GGACGATTCG
CCGCTGCCGG AGGCGGCAAG CGGCGAGAGT CCCGCCTATG TCATGTATAC GTCCGGCACG
ACCGGCCGCC CGAAAGGCGC CGTCGTCCCG CATCGCGCCG TGACGCGGCT CGCCTTCAAC
AGCTTTGCCG ATCTCGGGTC CCGCGACGTC GTTCTGCAAT TCGCGCCGCT CGCCTTCGAC
GCCTCGACCT TCGAGATCTG GAACGCGCTT CTGAATGGGG CCGCGATCGC CATCGTCGCA
GAGAATCATC CAAGCTTCGC CGAACTCGGC GCCGCGATCA AAGACTACGG CGTCACCGCG
GCCTGGCTGA CCGCCAGCCT GTTCCACGCC ATCGTCGATC GCCAGATCGA GATCTTGAAG
CCGCTGCGGC TGCTCCTCGC CGGAGGGGAC GTCCTATCGC CGCGCCATGT GCGCCGCGCG
CTCGACGCGC TGCCGGACTG CCGCCTCGTC AATGGCTATG GCCCGACCGA GAACACAACC
TTCACCTGCT GCTACGAAAT TCCCCGCGAT ATCGCGCCGG ACGCTGCGAT CCCGATCGGC
CGCCCGATCG ATCACACTGA CGTGTATGTG CTCGGCCCGG ACCTCTCCCG CGCCGGCGCC
GGCGAAGAAG GCGAGCTTTT CGCCGGCGGC GAGGGAGTCG CGCTCGGCTA TCTCAATCGT
CCGCAGCTGA CGGCCGAGAA ATTCCTCGCC GATCCCTTCT GCGGCGAGCC GGGGCGCCTC
ATGTATCGGA CCGGCGATCT CGTGCGCCAG CGCGCCGATG GAATCGTCGA ATTTATCGGC
CGCGTCGACC GCCAGGTGAA AATTCGCGGC AAGCGCGTCG AAGTCGATGA GGTCGAAGCC
CTGATCCGGC GCTTGCCGCA GGTCGCCGAC GCCACTGCGC TTGTGCGATC CCGCACAGAT
GGCGAGCGGC AAATCATTGC TTTTGTCACG GCTCAAGGCG GCGCGACGCT TGAACTCGGA
GAATTGCGTC ATAGCATGCT CGAGATTGCG CCTGATTATA TGGTTCCTGC GCATTTCATG
ATCCTTGATG AACTGCCGCG AACGCCAAAC GGCAAGGTCG ACCGCGCGGC TCTGCCCGAG
CTTGGCGGCG CAGACGAACA AACCGCGGCG CCGGCGATTT TGGCCGCCGA CGACGTGGAG
CGGCGGCTCG CCGCCCTCTG GAGCAAAATT CTGAAGGTCG GCTCGGTCGG CCTCGACTCC
AATTTTTTCG ATCTGGGAGG GGCGTCGCTG GACGTAATGG CTTTGCAGGA AGAAATACTG
AAGGAATTTC ACATCGACGC GCCGATGACC GCGCTGTTCG AATTCACGAC TGTCCGTTCG
CTCGCGGCCC ATCTCAAAAG CCGCGCGGTC CCGGCGGCGG AGGGCGCGAT CGAGGCGGCC
TCCGCGGACA GACAGGCGCT GGACGAACTC TCTCTCCGCA AGCAGCGTCA GGCGGAGGCT
CTAAAACGCG CCTCGCGCCG CCGCGCCGTC TCCGTCAACT GA
 
Protein sequence
MSSHHSGQPQ PASPAPSFDR RRTIHSFLLE AADNYADRPA ILYDGHKTIP FAELDRRSNR 
FARYLAAKGV LPGSIVGLYL PRSPEAIIAM IGALKTGAAF APLDPSYPAD HLAFITADAA
PAVVVSAASM TSNASAANLW TAPTILIDAE AAAIAHEDDS PLPEAASGES PAYVMYTSGT
TGRPKGAVVP HRAVTRLAFN SFADLGSRDV VLQFAPLAFD ASTFEIWNAL LNGAAIAIVA
ENHPSFAELG AAIKDYGVTA AWLTASLFHA IVDRQIEILK PLRLLLAGGD VLSPRHVRRA
LDALPDCRLV NGYGPTENTT FTCCYEIPRD IAPDAAIPIG RPIDHTDVYV LGPDLSRAGA
GEEGELFAGG EGVALGYLNR PQLTAEKFLA DPFCGEPGRL MYRTGDLVRQ RADGIVEFIG
RVDRQVKIRG KRVEVDEVEA LIRRLPQVAD ATALVRSRTD GERQIIAFVT AQGGATLELG
ELRHSMLEIA PDYMVPAHFM ILDELPRTPN GKVDRAALPE LGGADEQTAA PAILAADDVE
RRLAALWSKI LKVGSVGLDS NFFDLGGASL DVMALQEEIL KEFHIDAPMT ALFEFTTVRS
LAAHLKSRAV PAAEGAIEAA SADRQALDEL SLRKQRQAEA LKRASRRRAV SVN