Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0672 |
Symbol | |
ID | 7093753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 729297 |
End bp | 732620 |
Gene Length | 3324 bp |
Protein Length | 1107 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643464007 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_002361006 |
Protein GI | 217976859 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.715776 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAGC GGACCGACAT CTCGACGATC CTTATCATCG GCGCGGGACC GATCATCATC GGACAGGCCT GCGAATTCGA CTATTCCGGC ACGCAGGCAT GCAAGGCGCT GCGCGCAGAA GGTTACCGGA TCGTCCTCGT CAACTCCAAT CCGGCGACGA TCATGACGGA CCCGGACATG GCCGACCGCA CCTATATCGA GCCGATCGTT CCCGAGGCCG TCGCCAAGAT CATCGCCAAG GAAAGATATG CCGCGCCGGG GGGCTTCGCC CTGCTGCCGA CCATGGGCGG CCAGACGGCG TTGAATTGCG CGCTCTCGCT CAAGAAAATG GGCATTCTCG AGCAATATGA CGTCGAGATG ATCGGCGCCA GCGCCGAGGC GATCGACATG GCCGAGGATC GCGAATTGTT CCGCGAGGCG ATGTCGCGCA TCGGCCTGTC GACGCCGCGT TCGCACCAGA TCAAGACGCT CTCCCAGGCG CTCGCCATCA TCGACGATAT CGGCCTGCCG GCGATCATCC GTCCCTCCTT CACCATGGGC GGGACCGGCG GCGGCATCGC CTATAACAAG GCGGAGTTCA TCGACATCAT CGAGCGGGGC ATTGACGCCT CGCCGACAAG CGAAGTGCTG GTCGAGGAAA GCGTTCTCGG CTGGAAGGAA TATGAAATGG AGGTCGTCCG CGACAAGGCG GACAATTGCA TCATCGTCTG CTCCATCGAG AACATCGATC CGATGGGCGT TCACACAGGC GATTCCATCA CGGTCGCGCC GGCGCTGACG CTGACCGACA AGGAATATCA GATCATGCGC GACGCTTCTC TCGCGGTGCT GCGCGAGATC GGCGTCGAAA CGGGCGGCTC CAACGTCCAG TTCGCGGTCA ATCCAGCCGA CGGACGCATG ATCGTCATTG AAATGAACCC TCGCGTGTCG CGCTCCTCGG CCCTCGCCTC GAAGGCGACC GGCTTCCCGA TCGCCAAGGT AGCGGCCAAG CTCGCGGTCG GCTTCACCCT CGACGAGATC GAAAACGACA TCACCGGCGG CGCGACGCCG GCGTCCTTCG AGCCGAGCAT TGATTATGTC GTCACCAAAA TCCCGCGCTT CGCTTTCGAG AAATTTCCCG GCGCCGACAA TATTTTGACC ACCGCGATGA AGTCGGTCGG CGAATCCATG GCGATCGGGC GCACCTTCGC CGAAAGCCTG CAGAAGGCGC TGCGTTCTCT GGACATCGGG CTCGACGGGC TCGACGAGAT CGAGATCGAA GGGCTCGGCC TTGGCGACGA CATGAATGCG CTGCGCGCCG CGCTCGGCCG GCCGACGCCG GACCGGCTGC TCGTCGCGGC GCAGGCGTTG CGCTATGGCA TGACGCCGGC CGAGATCCAC GAGGCCTGCC GTATCGACGT CTGGTTTCTC GAGCGGCTGC AGGAAATTCT CGACCTTGAG GCGCTGGTGC GCGCGCGCGG CCTGCCCGGC GACAGCGCCA ATCTGCGCAT GCTGAAGGCG GCCGGCTTTT CCGACGCCCG CCTCGCCGCT CTGACCGGGT CCTCCGCAGC CGCCGTCGGG GCGCTCCGCC GCAGCCTTGG CGTGCGGCCG GTCTATAAGC GCATCGACAC CTGCGCCGCC GAATTCGCCT CGCCGACGGC CTATATGTAT TCGACCTATG CGCCGCCCTT CGCCGGGATC TCGGCCGACG AGGCGCGCCC CTCGGATCGC GAAAAAGTCG TCATCCTCGG CGGCGGCCCG AACCGCATCG GCCAGGGCAT CGAATTCGAC TATTGCTGCT GTCACGCCTC CTTCGCCCTG CGCGAGCGGG GGATCGAGAC GATCATGGTC AATTGCAACC CGGAAACGGT TTCGACCGAC TACGACACCT CCGACCGGCT CTATTTCGAG CCGCTGACGA TGGAGGACGT ACTCGAAATC CTCGCCAAGG AAAGCGAGGC CGGCAAGCTG AAGGGCGTCA TCGTGCAGTT CGGCGGCCAG ACGCCGCTCA AACTGGCGGA GGGACTGCAG AGGGCGGGAA TCCCGATTCT CGGCACCTCG GTTGATTCGA TCGACCTCGC CGAGGACCGC GACCGCTTCA AGCGCCTGCT CGACAAGATC GGGCTGAAGC AGCCGAAGAA CGGCATCGCC TATTCGGTCG AGCAATCGCG CATCGTCGCC TCGGATCTTG GCCTGCCGCT CGTGGTGCGC CCGTCCTATG TGCTGGGCGG GCGGGCCATG GCGATCATCC GCGATCATGC CGAATTCGAC GATTATCTGC TCGGAGTTCT GCCCGGCCTC GTGCCCTCAG ACGTCAAGGC GCGCTATCCG AACGACAAGA CCGGGCAGAT CAATACGGTG CTCGGCAAGA ATCCCCTGCT GTTCGACCGC TATCTCAGCG ACGCCGTCGA AGTCGACGTC GACGCTTTGT CGGACGGGAC CGACGTCTAT ATCTGCGGCG TCATGGAGCA TATCGAGGAG GCCGGTATTC ATTCGGGCGA TAGCGCCTGC TCGCTGCCGC CGCGCTCGCT GTCGCCGGAC ATGATCGCAA AGCTCGAACG CCAGACCCGC GACCTCGCGC TCGCGCTTGG CGTCGGCGGG TTGATGAATG TGCAATATGC GCTGAAGGAC GGGGAAATCT ACGTGCTCGA GGTGAACCCC CGCGCCGCCC GCACCGTGCC CTTCGTCGCC AAGGTGATCG GGGTTCCGAT CGCCAAGATC GCCGCCCGCA TCATGGCCGG CGAGAGCCTC GCCAGCTTCG GCCTGACGCC GCCCCGCTTC GATCACGTCG GCGTCAAGGA GAGCGTGTTT CCCTTCGCGC GCTTTCCCGG CGTCGACACG GTGCTCGGCC CGGAAATGCG CTCGACCGGC GAGGTCATGG GGCTCGACCG CTCCTTCGCT GTCGCCTTCG CCAAGAGCCA GCTCGGCGGC GGCACCAATG TGCCGACCTC GGGCGCGGTG TTCGTATCCG TCCGCGACGC CGACAAGCAG CGCGTGCTCG GCACGATCAA GCTATTGGCC GGGCTTGGTT TCCGCATCCT TGCGACCGGG GGCACCGCCC GCTTCCTTCA CGCCGAAGGA GTCGCGGCGC AGCGCATCAA CAAAGTCTCG GAAGGCCGGC CGCATGTGGT CGATCTCATC AAGAACGGCA GCGTGCAGCT CGTCCTCAAT ACGACCGAAG GCAAGCAGGC CCTCGCCGAC TCGCGTTCGT TGCGCCGCGC GGCGCTTTTG CACAAAGTGC CCTATTACAC GACGCTCGCC GGGGCGATCG CGGCGTCCGA AGGAATCAAG GCCTATATCG CCGGCGATCT CGAAGTCCGC GCGCTGCAGG ATTATTTCAG TTAA
|
Protein sequence | MPKRTDISTI LIIGAGPIII GQACEFDYSG TQACKALRAE GYRIVLVNSN PATIMTDPDM ADRTYIEPIV PEAVAKIIAK ERYAAPGGFA LLPTMGGQTA LNCALSLKKM GILEQYDVEM IGASAEAIDM AEDRELFREA MSRIGLSTPR SHQIKTLSQA LAIIDDIGLP AIIRPSFTMG GTGGGIAYNK AEFIDIIERG IDASPTSEVL VEESVLGWKE YEMEVVRDKA DNCIIVCSIE NIDPMGVHTG DSITVAPALT LTDKEYQIMR DASLAVLREI GVETGGSNVQ FAVNPADGRM IVIEMNPRVS RSSALASKAT GFPIAKVAAK LAVGFTLDEI ENDITGGATP ASFEPSIDYV VTKIPRFAFE KFPGADNILT TAMKSVGESM AIGRTFAESL QKALRSLDIG LDGLDEIEIE GLGLGDDMNA LRAALGRPTP DRLLVAAQAL RYGMTPAEIH EACRIDVWFL ERLQEILDLE ALVRARGLPG DSANLRMLKA AGFSDARLAA LTGSSAAAVG ALRRSLGVRP VYKRIDTCAA EFASPTAYMY STYAPPFAGI SADEARPSDR EKVVILGGGP NRIGQGIEFD YCCCHASFAL RERGIETIMV NCNPETVSTD YDTSDRLYFE PLTMEDVLEI LAKESEAGKL KGVIVQFGGQ TPLKLAEGLQ RAGIPILGTS VDSIDLAEDR DRFKRLLDKI GLKQPKNGIA YSVEQSRIVA SDLGLPLVVR PSYVLGGRAM AIIRDHAEFD DYLLGVLPGL VPSDVKARYP NDKTGQINTV LGKNPLLFDR YLSDAVEVDV DALSDGTDVY ICGVMEHIEE AGIHSGDSAC SLPPRSLSPD MIAKLERQTR DLALALGVGG LMNVQYALKD GEIYVLEVNP RAARTVPFVA KVIGVPIAKI AARIMAGESL ASFGLTPPRF DHVGVKESVF PFARFPGVDT VLGPEMRSTG EVMGLDRSFA VAFAKSQLGG GTNVPTSGAV FVSVRDADKQ RVLGTIKLLA GLGFRILATG GTARFLHAEG VAAQRINKVS EGRPHVVDLI KNGSVQLVLN TTEGKQALAD SRSLRRAALL HKVPYYTTLA GAIAASEGIK AYIAGDLEVR ALQDYFS
|
| |