Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1047 |
Symbol | carB |
ID | 7976827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1097549 |
End bp | 1100746 |
Gene Length | 3198 bp |
Protein Length | 1065 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644798000 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_002949173 |
Protein GI | 239826549 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAAC GCCAAGACAT TGAAACGATT TTAGTGATCG GTTCCGGCCC GATTGTCATC GGCCAGGCGG CGGAGTTTGA TTATGCGGGC ACACAGGCAT GTTTGGCGCT AAAGGAAGAA GGATACAAAG TTATTTTAGT TAACTCCAAT CCAGCAACGA TTATGACCGA TACGGAAATT GCCGACAAAG TATATATGGA GCCGCTCACG CTCGAATTCG TTTCTCGCAT TATCCGCAAA GAACGTCCGG ACGCGATTTT GCCGACACTT GGCGGACAGA CTGGGCTAAA CTTGGCGGTC GAGCTTGCTA GAACGGGGGT GCTCGCAGAA TGCGGCGTTG AAATTTTAGG AACAAAATTA GAAGCAATTG AAAAAGCGGA AGACCGCGAA CAATTTCGCG CGCTCATGAA CGAACTTGGC GAACCGGTTC CGGAAAGCGA AATTATTCAT AGCTTGGAAG AAGCATACGC TTTCGTGGAA AAGGTCGGCT ATCCGGTTAT CGTCCGCCCG GCGTTTACGC TCGGCGGCAC TGGCGGCGGC ATTTGCAAAA ACGAGGAAGA ATTGATCGAT ATCGTTTCTA CTGGGTTAAA ACTAAGCCCT GTTCATCAAT GCTTGCTAGA AAAAAGCATC GCGGGCTATA AAGAAATCGA GTATGAAGTG ATGCGCGACG CCAACGATAA CGCGATCGTC GTCTGCAATA TGGAAAATAT CGATCCGGTC GGCATTCACA CCGGCGATTC GATCGTCGTC GCTCCAAGCC AAACGCTAAG CGACCGCGAA TATCAATTGC TGCGCAACGC GTCGCTAAGA ATCATTCGCG CTCTTGGCAT CGAAGGCGGC TGCAATGTGC AGCTGGCGCT CGATCCGCAT AGCTTCCATT ATTACGTCAT TGAAGTCAAC CCGCGCGTCA GCCGTTCATC GGCGCTGGCG TCAAAAGCTA CCGGCTACCC GATTGCCAAG CTCGCCGCGA AAATCGCCGT CGGCTTAACG TTAGATGAAA TCATTAATCC GGTGACAGGA AAAACATACG CTTGCTTTGA ACCAACGCTC GATTACGTTG TGACGAAAAT TCCGCGCTTT CCGTTCGATA AATTCGAATC GGCCAACCGC CGTCTTGGCA CGCAAATGAA AGCTACAGGC GAAGTGATGG CGATCGGACG GACGCTAGAA GAGTCACTGT TAAAAGCGGT TCGCTCCCTT GAGACGAACG TCTACCATCT CGAACTTAAA GATGCCGAAA ATGTATCAGA TGAGTTAATC GAAAAGCGGA TTCGCAAAGC GGGAGATGAA CGCCTCTTCT ATATTGCCGA GGCGCTGCGC CGCGGATTTA CTGTCGAGCA AATTCACGAG TGGAGCCAAA TTGACCGATT TTTCTTAACC AAAATCGAAA ACATCGTCCG CTTTGAAAAC GTTGTTCGTG ACTATAAGGG GGATATCGAA GTACTGCGAA AAGCGAAAGA AATGGGCTTT TCCGATGTAG CCATCGCCAA GCTTTGGAAC AAGAGTGAGC GCGATGTGTA TGAGATGCGC AAACAAGCAG GAATCATTCC TGTATATAAA ATGGTGGATA CATGCGCGGC GGAATTTGAA TCAGAAACGC CGTATTACTA CAGCACGTAC GAAGACGAAA ACGAATCGGT CGTCACCGAC CGAGAAAGCG TCGTTGTGCT CGGCTCAGGA CCAATTCGCA TCGGGCAAGG GATTGAATTC GATTATGCGA CCGTTCATTC GGTCTGGGCG ATTAAAGAAG CGGGCTATGA GGCGATTATT ATCAACAACA ATCCGGAAAC GGTGTCGACC GACTTCAGCA TATCGGACAA ATTATATTTC GAGCCGTTAA CCATTGAAGA TGTGATGCAT GTCATTGATT TAGAAAAGCC GATCGGGGTT ATCGTGCAAT TCGGCGGCCA GACGGCAATC AACTTGGCGG CCGAATTAGC GGCGCGAGGC GTCCGCATTT TAGGAACGTC GCTTGAGGAC TTAGACCGCG CCGAAGACCG TGACAAATTT GAACAAACGT TATCGGAGCT AGGCATTCCG CAGCCGCAAG GAAAAACGGC ATTTTCCGTC GAGGAAGCGG TGCGGATTGC TGAGGAAATC GGCTATCCGG TGCTTGTTCG TCCATCGTAT GTTCTTGGCG GCCGCGCAAT GGAAATCGTG TATCAAGAAG AAGAGCTATT GCACTACATG GAGCACGCCG TCAAAGTGAA CCCGCAGCAC CCGGTGCTCA TCGACCGCTA TTTAATCGGA AAAGAAATCG AAGTCGATGC GATTTCTGAT GGAGAAACGG TGTTTATTCC GGGAATTATG GAACATATCG AACGGGCGGG CGTGCATTCC GGCGACTCGA TTGCCGTTTA TCCGCCGCAA ACGTTAACGA AGGACATCCA GCAAAAAATC GTCGATTACA CGATTAAATT GGCAAGAGGA TTGCGAATTG TCGGACTGCT GAACATCCAA TTTGTCATGT ACCAAGGCGA AGTGTACGTG CTGGAGGTGA ATCCGCGCTC AAGCCGCACC GTTCCGTTTT TAAGCAAAAT TACCGGCGTG CCGATGGCGA ATATTGCGAC GAAAGTCATT TTAGGCGCGA AGCTCGCGGA ACTTGGCTAT GAAACAGGCT TGAGGCAAGA AAGCGAAGGA GTATACGTGA AAGCGCCGGT CTTCTCATTC GCAAAACTGC GCAACGTCGA TATTTCGCTC GGCCCGGAAA TGAAGTCGAC TGGCGAAGTG ATCGGCAAAG ACGTGACATT TGAAAAAGCG TTATATAAAG GGTTAGTTGC TTCGGGAATC CATATTCGTC CATACGGAGC TGTCCTATTA ACGGTTGCCG ATAAAGATAA AGAAGATGCG ATTGAACTTG CAAGACGTTT CTATCAAATT GGCTATCAGC TGCTCGCGAC AAACGGCACG GCGGAAGCGT TAAAAGCGGC GGACATTCCG GTAACCGTCG TCAATAAAAT CCATTCCGCA TCGCCGAACA TTTTAGATGT GATTCGTCAA GGGAAAGCGC AAGTTGTCAT CAACACGCTT ACAAAAGGAA AACAGCCGGA AAGCGATGGA TTCCGCATTC GCCGTGAAGC GGTCGAAAAC GGCATTCCAT GCTTAACCTC GCTGGATACG GCGAAGGCGA TGCTTCAAGT CATCGAATCG ATGACGTTTT CGACAACGGC GATGACACAA GGGATGGTGC GCGTATGA
|
Protein sequence | MPKRQDIETI LVIGSGPIVI GQAAEFDYAG TQACLALKEE GYKVILVNSN PATIMTDTEI ADKVYMEPLT LEFVSRIIRK ERPDAILPTL GGQTGLNLAV ELARTGVLAE CGVEILGTKL EAIEKAEDRE QFRALMNELG EPVPESEIIH SLEEAYAFVE KVGYPVIVRP AFTLGGTGGG ICKNEEELID IVSTGLKLSP VHQCLLEKSI AGYKEIEYEV MRDANDNAIV VCNMENIDPV GIHTGDSIVV APSQTLSDRE YQLLRNASLR IIRALGIEGG CNVQLALDPH SFHYYVIEVN PRVSRSSALA SKATGYPIAK LAAKIAVGLT LDEIINPVTG KTYACFEPTL DYVVTKIPRF PFDKFESANR RLGTQMKATG EVMAIGRTLE ESLLKAVRSL ETNVYHLELK DAENVSDELI EKRIRKAGDE RLFYIAEALR RGFTVEQIHE WSQIDRFFLT KIENIVRFEN VVRDYKGDIE VLRKAKEMGF SDVAIAKLWN KSERDVYEMR KQAGIIPVYK MVDTCAAEFE SETPYYYSTY EDENESVVTD RESVVVLGSG PIRIGQGIEF DYATVHSVWA IKEAGYEAII INNNPETVST DFSISDKLYF EPLTIEDVMH VIDLEKPIGV IVQFGGQTAI NLAAELAARG VRILGTSLED LDRAEDRDKF EQTLSELGIP QPQGKTAFSV EEAVRIAEEI GYPVLVRPSY VLGGRAMEIV YQEEELLHYM EHAVKVNPQH PVLIDRYLIG KEIEVDAISD GETVFIPGIM EHIERAGVHS GDSIAVYPPQ TLTKDIQQKI VDYTIKLARG LRIVGLLNIQ FVMYQGEVYV LEVNPRSSRT VPFLSKITGV PMANIATKVI LGAKLAELGY ETGLRQESEG VYVKAPVFSF AKLRNVDISL GPEMKSTGEV IGKDVTFEKA LYKGLVASGI HIRPYGAVLL TVADKDKEDA IELARRFYQI GYQLLATNGT AEALKAADIP VTVVNKIHSA SPNILDVIRQ GKAQVVINTL TKGKQPESDG FRIRREAVEN GIPCLTSLDT AKAMLQVIES MTFSTTAMTQ GMVRV
|
| |