Gene Msil_3786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3786 
Symbol 
ID7090714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4145196 
End bp4147202 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content65% 
IMG OID643467071 
ProductCarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_002364030 
Protein GI217979883 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.34193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGAA AGATCCTGAT CGCGAACCGC GGCGAGATCG CCTGTCGCAT CATCAAGACG 
GCGCGCCGGC TTGGCATTGC GACTGTCGCC GTCTATTCCG ACGCCGATCG CGACGCAAGG
CATGTCGAGA TGGCCGACGA GGCGGTTCAT ATCGGTCCGG CGCCGGCGGC GCAAAGCTAT
CTCATGATCG ACAACATCCT TGAGGCCTGC CGCAAAACGG GGGCCGAGGC CGTGCATCCC
GGCTATGGCT TTTTGTCGGA GCGGGCGGCT TTCGCCGAGG CGCTGGCGAC GGAAAATATC
GCCTTCATCG GCCCGAATGT CGGCGCCATC GCGGCGATGG GCGACAAGAT CGAATCGAAG
CGCTTCGCGC GCGCCGCCGG CGTTTCGACG GTCCCGGGCA ATCTCGAAAT CATCAAGGAT
GGCGCGGACG CCGCGCGCAT CGCCGCCGAC ATCGGCTTTC CGGTCATGAT CAAGGCGTCG
GCCGGCGGCG GCGGCAAAGG CATGCGCATC GCAAGATCGG CAGGCGAGGT CGAGGAAGGA
TTTGCGCGCG CCAAATCGGA AGCCAAATCC TCCTTCGGCG ACGACCGCAT CTTCATCGAG
AAATTCATCG AAAATCCACG GCACGTCGAG ATTCAGATCA TTGGGGACAA GCATGGCCAT
GTGATACATC TCGGCGAACG CGAATGTTCG ATCCAGCGGC GCAATCAGAA GATCATCGAG
GAAGCGCCCT CCCCGCTGCT TGACGCAGCG ACGCGCGAAC TCATGGGCGC GCAGGCGGTG
GCTCTCGCGC AAGCGGTCGG CTATGATTCG GCCGGCACCG TCGAATTCGT CGCGGGCCAG
GACCGGAGCT TCTATTTTCT CGAGATGAAC ACCAGGCTGC AGGTCGAGCA TCCCGTCACC
GAACTGATCA CCGGCCTCGA CCTCGTCGAG CTGATGATCC GCAGCGCCGC CGGCGAGCCG
CTGCCCCTCG CGCAGGAAGA TGTGCGCCTC TCTGGCTGGG CCGTCGAAAG CCGCGTCTAT
GCCGAAGACC CGACCCGCGG CTTCCTGCCC TCGATCGGGC GGCTCACAAC CTACCGTCCC
CCGGCCGAAG GCAAATTCGG CGAGCTCACC ATCCGCAACG ATACGGGCGT CGCCGAAGGC
GGCGAGATCG CGATCCATTA TGATCCGATG ATCGCCAAGC TCGTGACGCA CGCGCCGACG
CGGAGCGAGG CGATCCACGG CCACAGCGCC GCGCTCGACG CCTTCGCCCT CGACGGCATC
CGCCACAACA TCCCTTTTCT TTCGAGCCTC ATGTCCCATC CGCGCTGGCG CGAGGGGCGG
CTCTCGACGG GGTTCATCGC CGAGGAATAT CCGGAAGGCT TCGCCAATCC GGCGCCTGCC
GGCGCGATCG CCCTGCGCCT CGCCGCAATT GCGGGGGCGA TCGACCACCA GCTCAATCAA
CGCAAGCGCC GGATTTCCGG GCAGATGCCG GTCGCCAAAG CGGTCACTTT CGAGCGCCGC
CGCCACGTCG TCGTCGGCGC GGAAGACTTC GCCTTCGAGA TCGACGAAAC GCCAAAAGGC
CTCAACCTTG CCTTTGAGGA CGGACGGCTG GTTTCGGTCC TCTCCTTGTG GAAACCGGGC
GAACCGGTGT GGCGCGGCGT GGTCGACGGC GAACGCATCG CCGCGCAGGT CCGCCCCATT
TTGAACGGCG TCCTGCTCGC ACATGGCGGA TTTTTTGCCG AAGCGCGCGT CTATACCCAG
CGTGAAGCCG AACTCGTGCG GCTCATGCCG GAAAAACGCG CGGCGGACAG CGGCAAGCAT
CTGCTCTGCC CGATGCCCGG CCTCATCCGC GAGGTTCTGG TGAGCGAAGG ACAGGCCGTA
AAAGCCGGCG AAGCGCTGGC GATCGTCGAG GCGATGAAGA TGGAGAACAT CCTGCGCGCG
GAACGCGACG CGACAATCGG CAAGGTCTAT GCGGCGGCCG GTCAGAGCCT TGCGGTCGAC
GCCGTCATCA TGGATTTTGC GGCGTGA
 
Protein sequence
MFGKILIANR GEIACRIIKT ARRLGIATVA VYSDADRDAR HVEMADEAVH IGPAPAAQSY 
LMIDNILEAC RKTGAEAVHP GYGFLSERAA FAEALATENI AFIGPNVGAI AAMGDKIESK
RFARAAGVST VPGNLEIIKD GADAARIAAD IGFPVMIKAS AGGGGKGMRI ARSAGEVEEG
FARAKSEAKS SFGDDRIFIE KFIENPRHVE IQIIGDKHGH VIHLGERECS IQRRNQKIIE
EAPSPLLDAA TRELMGAQAV ALAQAVGYDS AGTVEFVAGQ DRSFYFLEMN TRLQVEHPVT
ELITGLDLVE LMIRSAAGEP LPLAQEDVRL SGWAVESRVY AEDPTRGFLP SIGRLTTYRP
PAEGKFGELT IRNDTGVAEG GEIAIHYDPM IAKLVTHAPT RSEAIHGHSA ALDAFALDGI
RHNIPFLSSL MSHPRWREGR LSTGFIAEEY PEGFANPAPA GAIALRLAAI AGAIDHQLNQ
RKRRISGQMP VAKAVTFERR RHVVVGAEDF AFEIDETPKG LNLAFEDGRL VSVLSLWKPG
EPVWRGVVDG ERIAAQVRPI LNGVLLAHGG FFAEARVYTQ REAELVRLMP EKRAADSGKH
LLCPMPGLIR EVLVSEGQAV KAGEALAIVE AMKMENILRA ERDATIGKVY AAAGQSLAVD
AVIMDFAA