Gene Msil_2398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2398 
Symbol 
ID7093950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2612146 
End bp2613519 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content66% 
IMG OID643465720 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_002362690 
Protein GI217978543 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCGCGA TCATGCATGT TATCGAAATT GATTTCACGG ACCCGGTCGA AACAGCCGCC 
GCGCTGAGCC GGCTGCCGTT CCTGACCTTT CTCGACAGCG CGATGCCCGA GGATGCGCTC
GGCCGTTACA GTTTTGTCGC CGCCGATCCC TTCGACCGCA TCGAGGGCAA GGCGGGCGAC
GCCAGCTGGG CGGCGCGCTT GAAGACGGCG CTCGCAAAGT TTCACACGCC GCTTGCGCCA
GGCTTGCCGC CGTTTCAGGG CGGAGCCGCG GGGCTTTTTT CCTACGATCT CGGCCGCAGC
CTCGAGCGGC TGCCGGAGCC CGCCGCCGAC GATCTCGCTT TTCCCGATCT GTCGCTCGGC
CTTTACGACG TCGTCGTCGC GTTCGACCTG ATCCAGCGGC GCGCCTGGAT CATCTCGACC
GGCCTGCCCG AAACAGAGCC CGCGGCGCGG CGCGAACGCG CGATCGCGCG GGCGCAAGAA
TTCGAGGCGC ACATCGCCAA AGGCGCGCCG CTCTCCAGCG GAAAAATCTC GCTCGCCGGC
TGGACGAGCA ATTTTACGCG CGCCTCCTAT GAACGAGCGG TCGCCGAGGT GATCGAACGC
ATCCTCGCAG GCGATATTTT TCAGGCCAAT CTGTCGCAGC GCTTCGAGGC GCCGACGCCG
CCGGATTTCG ATCATTTCGG CTTCTACCGG CGCCTCCGCC GGGTCAATCC CGCGCCTTTC
GCAGCCTATC TCGATCATCC CGGCTTCAAG ATCGCCTCCG CTTCGCCCGA GCGATTCCTG
CGCGTCGACG GCGAGTTCGT CGAGACCCGC CCGATCAAGG GCACGCGGCC GCGTTTCGCC
GATCCGCTGG TCGATATGCT GCAGGGAAAG GCCTTGAGCG AAAGCCGCAA GGATCGCGCC
GAGAACGTCA TGATCGTCGA TCTCCTGCGC AATGATCTGT CGAAGGTCTG CGCGCCGGGG
TCGGTCAAGG CGCCGCAGCT CTGCGCGCTC GAATCCTATG CAACCGTGCA TCATCTCGTC
TCGACCGTGA TCGGGCGGCT GGCCGAAGGG TTCGGGCCAG TCGATCTCCT CGCCGCCTCC
TTTCCCGGCG GCTCGATCAC GGGGGCGCCG AAGCTGCGCG CGATGGAGAT CATCACCGAG
CTCGAAGGCC ATGCGCGCGG CCCCTATTGC GGCGCCATCG GCTATATCGG CTTCAATGGC
ATGATGGACC TGAATATCGT CATCCGGACC GCGAGCTTTC GCGCCGGCGT CTGCGTCGTC
CAGGCGGGCG GGGGCATCGT CACGGCGTCG GACCCGGCCT CCGAATATGT CGAGACGCTG
GACAAGGCGC GGCGCATCTT CGAGGCCTTC GGCGCGAGCG AATTCGCGCA ATGA
 
Protein sequence
MGAIMHVIEI DFTDPVETAA ALSRLPFLTF LDSAMPEDAL GRYSFVAADP FDRIEGKAGD 
ASWAARLKTA LAKFHTPLAP GLPPFQGGAA GLFSYDLGRS LERLPEPAAD DLAFPDLSLG
LYDVVVAFDL IQRRAWIIST GLPETEPAAR RERAIARAQE FEAHIAKGAP LSSGKISLAG
WTSNFTRASY ERAVAEVIER ILAGDIFQAN LSQRFEAPTP PDFDHFGFYR RLRRVNPAPF
AAYLDHPGFK IASASPERFL RVDGEFVETR PIKGTRPRFA DPLVDMLQGK ALSESRKDRA
ENVMIVDLLR NDLSKVCAPG SVKAPQLCAL ESYATVHHLV STVIGRLAEG FGPVDLLAAS
FPGGSITGAP KLRAMEIITE LEGHARGPYC GAIGYIGFNG MMDLNIVIRT ASFRAGVCVV
QAGGGIVTAS DPASEYVETL DKARRIFEAF GASEFAQ