Gene Msil_1384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1384 
Symbol 
ID7091722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1495648 
End bp1496664 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content71% 
IMG OID643464722 
Producturea amidolyase related protein 
Protein accessionYP_002361711 
Protein GI217977564 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.669414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTAGAA CCCTGCGCAT CCTTGGCGCT GGCCCCGGCG CGACGATTCA GGACGCGGGC 
CGAACCGGCT ATATGCGCTA TGGCGTGACG CCGGCGGGGC CGATGGACCC GGCCGCCTTC
GCCACGGTCG CCGCGGCGCT GGAGAACGAG CCGCACGCGG CTGCAATCGA AATTTCGGTG
GGCGGTCTCA GCGTCGGTGC GGACGATGAA CCCCTCTGCG TCGCTTTCGC GGGCGGGGCT
TTCGATTGGC GGCGCAATGG CGACGCGCTG CCGGTCGCGG CGCGCATACG TCTGGCGCCG
GGGGAGACCC TGTCCGCGCG GGCGGGGGCG TTCGGGGCCT GGGCCTATCT CGCCGTCGCC
GGCGGCTTTG AGACGCCGCT CCGGCTTGGC TCCCGCGCGA CTCATTTGCG CTCGCAGATC
GGCGGGCTTG AGGGGCGCAT GCTGCGCGCG GGCGATGCGC TGCCTTGCGG CTCCGGCGCC
TTATTGACGG AGGCCGCGCT TGACGCGCCC TGGCTTGCCT CGTCGGTAGC GCCAATCCGC
GTGCTGCCTG GCCCGCAGGA CGATTATTTC GCGCCCGAGG CGCTCGCCGC CTTCTTTGGC
GAAGTCTTCA CCCTGACCCC GCGCGCCGAC CGGATGGCCT ATGCGTTCAG CGGTCCGCCG
ATCGACCATG CGCGCGGCTA TAATATCGTC TCCGATGGCG TCGCCCTTGG CGCCATTCAG
ATCGCCGGCG ATCGCGCGCC GCTGATCCTG ATGGCGGACC GCCAGCCGAC CGGCGGCTAT
CCCAAGCTAG GCCATGTCAT CGGAGCCGAT ATCGGCCGCC TCGCGCAATT GCGGCCGGGC
GAGCGCTGCC GCTTCAAAGC GGTCGGCCTG GGCGAGGCGC TGGCGGCGCA GGAGGAGTTG
CAGGCGCAGA TTTTGACGAC CGAGCAGCGC CTGCGTCCGC TTGTCCGGCG CGCGACAACC
GAGGCGCTGC TGCGCGCAAA TCTGATCGAT GGGGCGACCG ACGCGCTCGC CGATTGA
 
Protein sequence
MARTLRILGA GPGATIQDAG RTGYMRYGVT PAGPMDPAAF ATVAAALENE PHAAAIEISV 
GGLSVGADDE PLCVAFAGGA FDWRRNGDAL PVAARIRLAP GETLSARAGA FGAWAYLAVA
GGFETPLRLG SRATHLRSQI GGLEGRMLRA GDALPCGSGA LLTEAALDAP WLASSVAPIR
VLPGPQDDYF APEALAAFFG EVFTLTPRAD RMAYAFSGPP IDHARGYNIV SDGVALGAIQ
IAGDRAPLIL MADRQPTGGY PKLGHVIGAD IGRLAQLRPG ERCRFKAVGL GEALAAQEEL
QAQILTTEQR LRPLVRRATT EALLRANLID GATDALAD