Gene Msil_0506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0506 
Symbol 
ID7091239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp562053 
End bp563570 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content65% 
IMG OID643463836 
Productanthranilate synthase component I 
Protein accessionYP_002360840 
Protein GI217976693 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.640019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCTC CAGCTCATGC GGATTACGCC GCAGCCTATG CGGCCGGCCG CCCGTGCCTG 
GTTTCGGCGC GCCTGATCGC CGATCTCGAA ACGCCGGTTT CCGCCTTTCT GAAGCTGTCG
GCGGGGCGCG TCGGACGCAT CTTCCTTCTG GAATCCGTCG AGGGCGGAGC CGCGCGGGGC
CGATACTCCA TGATCGGCCT CGACCCCGAT ATCGTCTGGC GCGCTTTCGG CGACAGAGCC
GAAATCAACC GGTCCGCCTT GAGCGATCCC GACAGCTTTT CGCCTTGCGC GGAGCCCCCG
CTCGACTCGC TGCGCGCGCT GATCGCGGAA TCCCGCATCG ACGCCAGCGA GGAGCTGCCG
CCGATGGCGG CCGGCGTCTT TGGCTATCTC GGCTATGATA TGGCGCGGCA GATGGAGCAG
CTTGGCGCGC CAAAGCCAGA TCCCCTCGGC GCGCCAGACG CGATGATGAT GCGCCCGACC
GTGATGGTCG TGTTCGACTC CGTCCGCGAG GAGATTTTCG TGGTGACGCC GCTGCGCCCT
GCCCCCGGCG TTTCCTTCCT TGCCGCCTAT GACCATGCCC GCGAACGCAT CGACGCGGTG
AGCGTGACGC TGGAGGGGCC GCTTCAACAT GACTGGGTCG CGGCCGATCC CGCGCTTTCG
ACTGTCGCGC CGACCTCAAA TACCAGCGAG GCGCGGTTTC ACGAGATGGT CGCGCGCGCC
AAGGATTACG TCCGCGCCGG CGATATTTTT CAGGTCGTGC TGTCGCAACG CTTTTCGGCG
CCTTTTGGGC TGCATCCTTT CGCGCTCTAC CGCGCCCTGC GCCGGGTCAA TCCCTCGCCC
TTCCTCTGCT ACCTTGATTT CGGACCGTTC CAGATCGTCT GCTCGAGCCC TGAAATTCTG
GTCCGGCTGC GCGACGGCAA GGTCACGATC CGGCCCATCG CCGGCACCCG CTGGCGCGGC
AAGACCAAGG CCGAGGACGA TGCGTTGGCG CAGGACCTTC TTGGCGACGA GAAAGAATGC
GCCGAGCATC TGATGCTGCT CGATCTTGGC CGCAACGACG TCGGCCGCGT CGCCGAGATC
GGCTCCGTCA AGGTGACGGA GCAATTCGCC ATCGAACGCT ACAGCCATGT CATGCATATC
GTCTCGAACG TGGAGGGCCG TCTCTCGAAG ACGCATGACG CAATCGACGC CCTCAGCGCA
GGCTTTCCCG CGGGCACCGT TTCGGGGGCG CCGAAACTGC GCGCGATGGA GATCATCGAC
GAGCTCGAGA CGGACAAGCG TGGCGTTTAC GGCGGCTGCA TCGGCTATTT CGGCGCTTCG
GGCGAGATGG ACACCTGCAT CATCTTGCGC ACCGCCATGG TCAAGGATGG CGTCATGCAT
GTCCAGTCGG GCGCTGGCAT CGTCTATGAC AGCGATCCCG CCTATGAGCA GCGCGAATGC
GTCAACAAGG CGCAAGCTCT GTTCCGCGCC GCCGAGGAGG CCGTGCGTTT CGCGTCGCGG
GCCAAGCGCG GGCAATAG
 
Protein sequence
MDSPAHADYA AAYAAGRPCL VSARLIADLE TPVSAFLKLS AGRVGRIFLL ESVEGGAARG 
RYSMIGLDPD IVWRAFGDRA EINRSALSDP DSFSPCAEPP LDSLRALIAE SRIDASEELP
PMAAGVFGYL GYDMARQMEQ LGAPKPDPLG APDAMMMRPT VMVVFDSVRE EIFVVTPLRP
APGVSFLAAY DHARERIDAV SVTLEGPLQH DWVAADPALS TVAPTSNTSE ARFHEMVARA
KDYVRAGDIF QVVLSQRFSA PFGLHPFALY RALRRVNPSP FLCYLDFGPF QIVCSSPEIL
VRLRDGKVTI RPIAGTRWRG KTKAEDDALA QDLLGDEKEC AEHLMLLDLG RNDVGRVAEI
GSVKVTEQFA IERYSHVMHI VSNVEGRLSK THDAIDALSA GFPAGTVSGA PKLRAMEIID
ELETDKRGVY GGCIGYFGAS GEMDTCIILR TAMVKDGVMH VQSGAGIVYD SDPAYEQREC
VNKAQALFRA AEEAVRFASR AKRGQ