Gene Msil_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3858 
Symbol 
ID7092554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4226139 
End bp4227497 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content62% 
IMG OID643467143 
Productacetyl-CoA carboxylase, biotin carboxylase 
Protein accessionYP_002364102 
Protein GI217979955 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0717744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAAA AAATCCTCAT CGCCAATCGC GGCGAGATCG CGTTGCGGAT TCTGCGCGCG 
GCCAAGGAGC TTGGCATTGC GACGGTCGCG GTCCATTCGA CCGCCGATTC CGAAGCAATG
CATGTCAAGC TCGCCGACGA ATCCGTCTGC GTCGGGCCGC CGCCCGCTCG CGAATCCTAT
CTCAACATTC CGGCTCTCCT CGCCGCCTGC GAGATCACCG GGGCCGAGGC GCTGCATCCC
GGCTATGGAT TTTTGTCGGA AAACGCCCGC TTCGCGGAAA TCCTCGCCGA GCATCACATC
GTATTCGTCG GGCCAAAGCC GGAGCATATC CGCCTGATGG GCGACAAGAT CGAGGCGAAG
CGCACGGCGC TGCGGCTCGG CATCCCATGT GTGCCAGGCT CGGCCGGCGC CATCACGGAT
GAGGCCGAGG CGAAGGCGGC GGCAAGAGAA CTCGGCTATC CTGTACTCGT CAAGGCGGCG
GCGGGCGGCG GCGGCCGCGG CATGAAGGTT TCATTCAGCG AGGAGGACAT CGCCTCGACG
CTGGAGACGG CGCGCATGGA GGCGAAGTCC GCCTTTGGCG ATGATTCCGT GTACCTTGAA
AAATATCTCG AAAAACCCCG CCACATCGAA GTGCAGATTC TCGGCGACGG ACGCGGCGGC
GCGATCCATC TTGGCGAGCG CGACTGCTCG CTGCAGCGCC GGCACCAGAA AGTCTGGGAG
GAAGGCCCGT CCCCCGCGCT CAATGAGTCG CAGCGCAAGG AAATCGGCGA GATCTGCGCG
GCGGCCATGC GCGAACTGCA GTATGCCGGC GCCGGCACGA TCGAATTCCT CTATGAGGAC
GGCAAATTCT ATTTCATCGA GATGAACACC CGCATCCAGG TCGAGCATCC GGTGACCGAG
ATGATCACCG GCGTCGATCT CGTCAATGAG CAGATCAAGA TCGCCGCCGG ATCGGCGCTG
ACCTTGACGC AGGAAGACGT TTCCTTCAAC GGACACGCCA TCGAATGCCG CATCAACGCC
GAACATCCGG CCACCTTCCG CCCCTCGCCG GGGATGATCA ATTATTACCA TCCGCCGGGC
GGCCTCGGCG TCCGCGTCGA TAGCGCCGTC TACGCCGGCT ATACGATCCC GCCGACCTAT
GATTCACTTG TCGGCAAGCT GATCGTGCAT GGCCGCAATC GCAATGAAGC GCTGATGCGC
CTGCGCCGCT CGCTCGATGA GTTCATTATC GACGGCATCG ACACGACCAT CCCGCTGTTC
CAGACGCTGG TGCGCAACGC CGACATCCAG AACGGGCTTT ACGATATCCA TTGGCTCGAA
AAATTTCTGG CCGACGGCGG CATGGACGGC ACGGAGTAA
 
Protein sequence
MFEKILIANR GEIALRILRA AKELGIATVA VHSTADSEAM HVKLADESVC VGPPPARESY 
LNIPALLAAC EITGAEALHP GYGFLSENAR FAEILAEHHI VFVGPKPEHI RLMGDKIEAK
RTALRLGIPC VPGSAGAITD EAEAKAAARE LGYPVLVKAA AGGGGRGMKV SFSEEDIAST
LETARMEAKS AFGDDSVYLE KYLEKPRHIE VQILGDGRGG AIHLGERDCS LQRRHQKVWE
EGPSPALNES QRKEIGEICA AAMRELQYAG AGTIEFLYED GKFYFIEMNT RIQVEHPVTE
MITGVDLVNE QIKIAAGSAL TLTQEDVSFN GHAIECRINA EHPATFRPSP GMINYYHPPG
GLGVRVDSAV YAGYTIPPTY DSLVGKLIVH GRNRNEALMR LRRSLDEFII DGIDTTIPLF
QTLVRNADIQ NGLYDIHWLE KFLADGGMDG TE