Gene Msil_0672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0672 
Symbol 
ID7093753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp729297 
End bp732620 
Gene Length3324 bp 
Protein Length1107 aa 
Translation table11 
GC content64% 
IMG OID643464007 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_002361006 
Protein GI217976859 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.715776 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGC GGACCGACAT CTCGACGATC CTTATCATCG GCGCGGGACC GATCATCATC 
GGACAGGCCT GCGAATTCGA CTATTCCGGC ACGCAGGCAT GCAAGGCGCT GCGCGCAGAA
GGTTACCGGA TCGTCCTCGT CAACTCCAAT CCGGCGACGA TCATGACGGA CCCGGACATG
GCCGACCGCA CCTATATCGA GCCGATCGTT CCCGAGGCCG TCGCCAAGAT CATCGCCAAG
GAAAGATATG CCGCGCCGGG GGGCTTCGCC CTGCTGCCGA CCATGGGCGG CCAGACGGCG
TTGAATTGCG CGCTCTCGCT CAAGAAAATG GGCATTCTCG AGCAATATGA CGTCGAGATG
ATCGGCGCCA GCGCCGAGGC GATCGACATG GCCGAGGATC GCGAATTGTT CCGCGAGGCG
ATGTCGCGCA TCGGCCTGTC GACGCCGCGT TCGCACCAGA TCAAGACGCT CTCCCAGGCG
CTCGCCATCA TCGACGATAT CGGCCTGCCG GCGATCATCC GTCCCTCCTT CACCATGGGC
GGGACCGGCG GCGGCATCGC CTATAACAAG GCGGAGTTCA TCGACATCAT CGAGCGGGGC
ATTGACGCCT CGCCGACAAG CGAAGTGCTG GTCGAGGAAA GCGTTCTCGG CTGGAAGGAA
TATGAAATGG AGGTCGTCCG CGACAAGGCG GACAATTGCA TCATCGTCTG CTCCATCGAG
AACATCGATC CGATGGGCGT TCACACAGGC GATTCCATCA CGGTCGCGCC GGCGCTGACG
CTGACCGACA AGGAATATCA GATCATGCGC GACGCTTCTC TCGCGGTGCT GCGCGAGATC
GGCGTCGAAA CGGGCGGCTC CAACGTCCAG TTCGCGGTCA ATCCAGCCGA CGGACGCATG
ATCGTCATTG AAATGAACCC TCGCGTGTCG CGCTCCTCGG CCCTCGCCTC GAAGGCGACC
GGCTTCCCGA TCGCCAAGGT AGCGGCCAAG CTCGCGGTCG GCTTCACCCT CGACGAGATC
GAAAACGACA TCACCGGCGG CGCGACGCCG GCGTCCTTCG AGCCGAGCAT TGATTATGTC
GTCACCAAAA TCCCGCGCTT CGCTTTCGAG AAATTTCCCG GCGCCGACAA TATTTTGACC
ACCGCGATGA AGTCGGTCGG CGAATCCATG GCGATCGGGC GCACCTTCGC CGAAAGCCTG
CAGAAGGCGC TGCGTTCTCT GGACATCGGG CTCGACGGGC TCGACGAGAT CGAGATCGAA
GGGCTCGGCC TTGGCGACGA CATGAATGCG CTGCGCGCCG CGCTCGGCCG GCCGACGCCG
GACCGGCTGC TCGTCGCGGC GCAGGCGTTG CGCTATGGCA TGACGCCGGC CGAGATCCAC
GAGGCCTGCC GTATCGACGT CTGGTTTCTC GAGCGGCTGC AGGAAATTCT CGACCTTGAG
GCGCTGGTGC GCGCGCGCGG CCTGCCCGGC GACAGCGCCA ATCTGCGCAT GCTGAAGGCG
GCCGGCTTTT CCGACGCCCG CCTCGCCGCT CTGACCGGGT CCTCCGCAGC CGCCGTCGGG
GCGCTCCGCC GCAGCCTTGG CGTGCGGCCG GTCTATAAGC GCATCGACAC CTGCGCCGCC
GAATTCGCCT CGCCGACGGC CTATATGTAT TCGACCTATG CGCCGCCCTT CGCCGGGATC
TCGGCCGACG AGGCGCGCCC CTCGGATCGC GAAAAAGTCG TCATCCTCGG CGGCGGCCCG
AACCGCATCG GCCAGGGCAT CGAATTCGAC TATTGCTGCT GTCACGCCTC CTTCGCCCTG
CGCGAGCGGG GGATCGAGAC GATCATGGTC AATTGCAACC CGGAAACGGT TTCGACCGAC
TACGACACCT CCGACCGGCT CTATTTCGAG CCGCTGACGA TGGAGGACGT ACTCGAAATC
CTCGCCAAGG AAAGCGAGGC CGGCAAGCTG AAGGGCGTCA TCGTGCAGTT CGGCGGCCAG
ACGCCGCTCA AACTGGCGGA GGGACTGCAG AGGGCGGGAA TCCCGATTCT CGGCACCTCG
GTTGATTCGA TCGACCTCGC CGAGGACCGC GACCGCTTCA AGCGCCTGCT CGACAAGATC
GGGCTGAAGC AGCCGAAGAA CGGCATCGCC TATTCGGTCG AGCAATCGCG CATCGTCGCC
TCGGATCTTG GCCTGCCGCT CGTGGTGCGC CCGTCCTATG TGCTGGGCGG GCGGGCCATG
GCGATCATCC GCGATCATGC CGAATTCGAC GATTATCTGC TCGGAGTTCT GCCCGGCCTC
GTGCCCTCAG ACGTCAAGGC GCGCTATCCG AACGACAAGA CCGGGCAGAT CAATACGGTG
CTCGGCAAGA ATCCCCTGCT GTTCGACCGC TATCTCAGCG ACGCCGTCGA AGTCGACGTC
GACGCTTTGT CGGACGGGAC CGACGTCTAT ATCTGCGGCG TCATGGAGCA TATCGAGGAG
GCCGGTATTC ATTCGGGCGA TAGCGCCTGC TCGCTGCCGC CGCGCTCGCT GTCGCCGGAC
ATGATCGCAA AGCTCGAACG CCAGACCCGC GACCTCGCGC TCGCGCTTGG CGTCGGCGGG
TTGATGAATG TGCAATATGC GCTGAAGGAC GGGGAAATCT ACGTGCTCGA GGTGAACCCC
CGCGCCGCCC GCACCGTGCC CTTCGTCGCC AAGGTGATCG GGGTTCCGAT CGCCAAGATC
GCCGCCCGCA TCATGGCCGG CGAGAGCCTC GCCAGCTTCG GCCTGACGCC GCCCCGCTTC
GATCACGTCG GCGTCAAGGA GAGCGTGTTT CCCTTCGCGC GCTTTCCCGG CGTCGACACG
GTGCTCGGCC CGGAAATGCG CTCGACCGGC GAGGTCATGG GGCTCGACCG CTCCTTCGCT
GTCGCCTTCG CCAAGAGCCA GCTCGGCGGC GGCACCAATG TGCCGACCTC GGGCGCGGTG
TTCGTATCCG TCCGCGACGC CGACAAGCAG CGCGTGCTCG GCACGATCAA GCTATTGGCC
GGGCTTGGTT TCCGCATCCT TGCGACCGGG GGCACCGCCC GCTTCCTTCA CGCCGAAGGA
GTCGCGGCGC AGCGCATCAA CAAAGTCTCG GAAGGCCGGC CGCATGTGGT CGATCTCATC
AAGAACGGCA GCGTGCAGCT CGTCCTCAAT ACGACCGAAG GCAAGCAGGC CCTCGCCGAC
TCGCGTTCGT TGCGCCGCGC GGCGCTTTTG CACAAAGTGC CCTATTACAC GACGCTCGCC
GGGGCGATCG CGGCGTCCGA AGGAATCAAG GCCTATATCG CCGGCGATCT CGAAGTCCGC
GCGCTGCAGG ATTATTTCAG TTAA
 
Protein sequence
MPKRTDISTI LIIGAGPIII GQACEFDYSG TQACKALRAE GYRIVLVNSN PATIMTDPDM 
ADRTYIEPIV PEAVAKIIAK ERYAAPGGFA LLPTMGGQTA LNCALSLKKM GILEQYDVEM
IGASAEAIDM AEDRELFREA MSRIGLSTPR SHQIKTLSQA LAIIDDIGLP AIIRPSFTMG
GTGGGIAYNK AEFIDIIERG IDASPTSEVL VEESVLGWKE YEMEVVRDKA DNCIIVCSIE
NIDPMGVHTG DSITVAPALT LTDKEYQIMR DASLAVLREI GVETGGSNVQ FAVNPADGRM
IVIEMNPRVS RSSALASKAT GFPIAKVAAK LAVGFTLDEI ENDITGGATP ASFEPSIDYV
VTKIPRFAFE KFPGADNILT TAMKSVGESM AIGRTFAESL QKALRSLDIG LDGLDEIEIE
GLGLGDDMNA LRAALGRPTP DRLLVAAQAL RYGMTPAEIH EACRIDVWFL ERLQEILDLE
ALVRARGLPG DSANLRMLKA AGFSDARLAA LTGSSAAAVG ALRRSLGVRP VYKRIDTCAA
EFASPTAYMY STYAPPFAGI SADEARPSDR EKVVILGGGP NRIGQGIEFD YCCCHASFAL
RERGIETIMV NCNPETVSTD YDTSDRLYFE PLTMEDVLEI LAKESEAGKL KGVIVQFGGQ
TPLKLAEGLQ RAGIPILGTS VDSIDLAEDR DRFKRLLDKI GLKQPKNGIA YSVEQSRIVA
SDLGLPLVVR PSYVLGGRAM AIIRDHAEFD DYLLGVLPGL VPSDVKARYP NDKTGQINTV
LGKNPLLFDR YLSDAVEVDV DALSDGTDVY ICGVMEHIEE AGIHSGDSAC SLPPRSLSPD
MIAKLERQTR DLALALGVGG LMNVQYALKD GEIYVLEVNP RAARTVPFVA KVIGVPIAKI
AARIMAGESL ASFGLTPPRF DHVGVKESVF PFARFPGVDT VLGPEMRSTG EVMGLDRSFA
VAFAKSQLGG GTNVPTSGAV FVSVRDADKQ RVLGTIKLLA GLGFRILATG GTARFLHAEG
VAAQRINKVS EGRPHVVDLI KNGSVQLVLN TTEGKQALAD SRSLRRAALL HKVPYYTTLA
GAIAASEGIK AYIAGDLEVR ALQDYFS