Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_1598 |
Symbol | carB |
ID | 8525461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 1623588 |
End bp | 1626716 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_003252713 |
Protein GI | 261419031 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAAG ATTCTTCGCT TCAGTCGATT CTCCTGATCG GGTCGGGGCC GATTGTCATC GGCCAGGCCG CCGAGTTTGA TTATTCCGGC ACACAAGCGT GCATCGCCTT AAAGGAAGAA GGATATCGCG TCATTTTAGT GAACAACAAT CCGGCGACGA TCATGACCGA TGAAGTTCAT GCCGATGCCG TGTATTTTGA ACCGCTCACC GTCGATGTGC TCGAAGCGAT TATTGCCAAA GAACGCCCGG ACGGGCTGCT CGCCACGTTC GGCGGCCAGA CAGGGCTCAA CTTGGCGTTT CAGCTGCATG AAGCCGGCGT GCTTGAAAAG TATGGGGTGA GACTGCTCGG AACACCGATT GAAGCGATCA AGCGCGGGGA AGACCGCGAA GCGTTCCGCG CGTTAATGCA TGAGCTTGGC GAACCGGTGC CGGAGAGCGA AATTGTCACA AGCGTCGAGG AAGCGGTCGC GTTTGCCGAA CAAATCGGTT TTCCGATCAT TATTCGTCCC GCGTATACGC TCGGCGGGAC GGGCGGCGGC ATTGCCGAAA ACATGGAGCA GTTTCTCGCG CTCGTGGAAA AAGGGCTGGC CGAAAGCCCG ATCCGCCAAT GTTTGATCGA ACGGAGCGTC GCTGGGTTTA AAGAAATTGA ATACGAAGTG ATGCGCGACC AGTCGAATAC GTGCATCACG GTTTGCAATA TGGAAAACGT CGATCCAGTC GGCATCCATA CGGGCGATTC GATCGTCGTC GCACCGTCGC AGACGTTGAC CGATGAGGAG TACCAAATGC TCCGTTCGTC GGCGGTGAAG ATCATTTCCG CATTAGGGAT CATCGGCGGC TGCAACATTC AGTTCGCCCT TGACCCGAAC AGCAAACAAT ATTACTTAAT CGAAGTCAAC CCGCGCGTCA GCCGCTCGTC AGCGCTCGCC TCGAAAGCGA CCGGCTACCC AATCGCCCGC ATTGCGGCGA AGCTGGCGGT TGGCTATACG CTCGCGGAAC TCGTCAATCC GGTGACGAAA ACGACGTACG CCAGCTTCGA GCCGGCCTTG GACTACGTCG TCGTCAAGTT TCCGCGCTTG CCGTTTGACA AGTTCCCGCA CGCGGATCGG AAGCTCGGTA CGCAAATGAA AGCGACCGGG GAAGTGATGG CGATCGATCG CAATATGGAG CGGGCGTTCC AAAAAGCGGT GCAGTCGCTC GAAGGAAAAA ACAACGGACT GTTTTTGCCG GAGCTTTCGG CGAAAACGGA TGACGAGCTG AAACAACTGC TTGTTGACAA AGACGACCGC CGCTTTTTCG CCATTTTGGA ACTGCTCCGC CGCGGGGTGG CGGTCGAAGC CATTCACGAA TGGACGAAAA TCGACCGCTT TTTCCTTGCT TCGTTCGAAC GGCTCGTGGC GCTCGAAAAA CAGGCGGCAG CCACCACGAT CGATACGATC GATGAGCCGA CGTTGCGCTT CTTGAAAGAA AAAGGGGCAA GCGACGCCTT TTTGGCCGAA ACGTGGGGCG TGACGGAGCT TGATGTGCGC AACAAGCGGA AGGAACTTGG CATCGTGCCG TCATACAAAA TGGTCGACAC GTGCGCGGCC GAATTTCATT CGGAAACGGA TTATTACTAC TCGACCTATT TCGGCGAAGA CGAGCGGAAG AAAGCGAGCG GCAAGGAGAA AGTGCTGATC ATTGGAGCCG GACCTATTCG CATCGGCCAA GGCATTGAGT TTGACTACAG CTCCGTTCAT AGCGTGTTCG CCTTGCAAAA AGAAGGCTAT GAAACGGTGA TGATCAACAA CAATCCGGAA ACAGTGAGCA CCGACTTTGC CGTCGCCGAC CGTCTGTACT TTGAACCGCT GACGCTTGAG AGCGTCCTCG ATGTTATCGA AGCGGAACAA ATCAAGCATG TCATCGTTCA ATTTGGCGGA CAGACGGCGA TCAATTTGGT CAAAGGGCTT GAAGAAGCCG GCGTGTCGCT GTTGGGCGTC ACGTACGATA TGATTGACCA GCTCGAGGAC CGCGATCGTT TTTACCAGTT GCTTGAGGAG CTTGACATCC CGCACGTACC GGGCTTGATG GCGAACAACG CCGAGGAGCT CGCCGCCAAA GCGGCTGAGA TCGGCTGCCC GGTGCTGCTT CGTCCGTCGT ATGTCATCGG GGGCCGCGGC ATGTTTATCG TCCATAGCGA GGCGCAGCTC GCTGCTTTGA TCGAGCAAGG CGAGTTGACC TATCCGATTT TGATTGATGC GTATTTGGAC GGGAAAGAGG CAGAAGCGGA CATCGTGACG GATGGAACAG ACATCGTGCT GCCGGTCATC ATTGAACACG TCGAAAAAGC CGGCGTCCAC TCCGGTGACA GCTATGCTTG GCTGCCGGCG CAGACGCTCA CAGACGAAGA AAAAGCGAAA ATCATCGACT ATGCGAGCCG GATTGCGAAA AAACTCGGCT TTAAAGGAAT TATGAACATT CAATATGTCA TTGCGGACGG CAATGTATAT GTCCTGGAAG TGAACCCGCG CGCGAGCCGG ACGGTGCCGA TCGTCAGCAA AACGACCGGC GTGCCATTGG CGCAAATTGC GACAAAGTTA TTGCTTGGGA AATCACTCGT CGACATCGTT GACGAAAAAG CTCGCGGATT GGAGGTCATG CCGTACGCCG TGTTAAAGTA TCCGGTCTTT TCGACCCATA AGCTGCCGGG GGTTGACCCG ATGGTTGGAC CGGAGATGAA ATCGACCGGT GAAGGCATCA GCATCGCCGC GACGAAGGAA GAAGCGGCGT ACAAAGCGTT TTACGCGTAC TTGCGAAAGA AAGCAAACGC CAATGAAATG TATGTGATTG GCGGCATCGA TGAGACGATG GCCGCGGAAA TCGAAGCGAA ACAGCTTGTG ATCGTGTCAG ATGTCCCGTT TGCCGATTGG GTGAAGCGCG ACACGGCGCT CGCGGTGATC AACTTGGGCA AAGAGGAAGA CGAGGCAAAC AAACGAATGA CTGCGTTGTC CCGACAATTG CTCGTCTTCA CGGAACGTGA GACATTGAAG CTCTTCTTGC AGGCGCTCGA TGTGGATCAT CTTGATGTGC AGCCGATCCA CGGCTGGTTG GAAAAGAAAA AACAGGCAGA ACAGGCGGTG ATCGCATGA
|
Protein sequence | MPKDSSLQSI LLIGSGPIVI GQAAEFDYSG TQACIALKEE GYRVILVNNN PATIMTDEVH ADAVYFEPLT VDVLEAIIAK ERPDGLLATF GGQTGLNLAF QLHEAGVLEK YGVRLLGTPI EAIKRGEDRE AFRALMHELG EPVPESEIVT SVEEAVAFAE QIGFPIIIRP AYTLGGTGGG IAENMEQFLA LVEKGLAESP IRQCLIERSV AGFKEIEYEV MRDQSNTCIT VCNMENVDPV GIHTGDSIVV APSQTLTDEE YQMLRSSAVK IISALGIIGG CNIQFALDPN SKQYYLIEVN PRVSRSSALA SKATGYPIAR IAAKLAVGYT LAELVNPVTK TTYASFEPAL DYVVVKFPRL PFDKFPHADR KLGTQMKATG EVMAIDRNME RAFQKAVQSL EGKNNGLFLP ELSAKTDDEL KQLLVDKDDR RFFAILELLR RGVAVEAIHE WTKIDRFFLA SFERLVALEK QAAATTIDTI DEPTLRFLKE KGASDAFLAE TWGVTELDVR NKRKELGIVP SYKMVDTCAA EFHSETDYYY STYFGEDERK KASGKEKVLI IGAGPIRIGQ GIEFDYSSVH SVFALQKEGY ETVMINNNPE TVSTDFAVAD RLYFEPLTLE SVLDVIEAEQ IKHVIVQFGG QTAINLVKGL EEAGVSLLGV TYDMIDQLED RDRFYQLLEE LDIPHVPGLM ANNAEELAAK AAEIGCPVLL RPSYVIGGRG MFIVHSEAQL AALIEQGELT YPILIDAYLD GKEAEADIVT DGTDIVLPVI IEHVEKAGVH SGDSYAWLPA QTLTDEEKAK IIDYASRIAK KLGFKGIMNI QYVIADGNVY VLEVNPRASR TVPIVSKTTG VPLAQIATKL LLGKSLVDIV DEKARGLEVM PYAVLKYPVF STHKLPGVDP MVGPEMKSTG EGISIAATKE EAAYKAFYAY LRKKANANEM YVIGGIDETM AAEIEAKQLV IVSDVPFADW VKRDTALAVI NLGKEEDEAN KRMTALSRQL LVFTERETLK LFLQALDVDH LDVQPIHGWL EKKKQAEQAV IA
|
| |