Gene GWCH70_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1047 
SymbolcarB 
ID7976827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1097549 
End bp1100746 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content50% 
IMG OID644798000 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_002949173 
Protein GI239826549 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAC GCCAAGACAT TGAAACGATT TTAGTGATCG GTTCCGGCCC GATTGTCATC 
GGCCAGGCGG CGGAGTTTGA TTATGCGGGC ACACAGGCAT GTTTGGCGCT AAAGGAAGAA
GGATACAAAG TTATTTTAGT TAACTCCAAT CCAGCAACGA TTATGACCGA TACGGAAATT
GCCGACAAAG TATATATGGA GCCGCTCACG CTCGAATTCG TTTCTCGCAT TATCCGCAAA
GAACGTCCGG ACGCGATTTT GCCGACACTT GGCGGACAGA CTGGGCTAAA CTTGGCGGTC
GAGCTTGCTA GAACGGGGGT GCTCGCAGAA TGCGGCGTTG AAATTTTAGG AACAAAATTA
GAAGCAATTG AAAAAGCGGA AGACCGCGAA CAATTTCGCG CGCTCATGAA CGAACTTGGC
GAACCGGTTC CGGAAAGCGA AATTATTCAT AGCTTGGAAG AAGCATACGC TTTCGTGGAA
AAGGTCGGCT ATCCGGTTAT CGTCCGCCCG GCGTTTACGC TCGGCGGCAC TGGCGGCGGC
ATTTGCAAAA ACGAGGAAGA ATTGATCGAT ATCGTTTCTA CTGGGTTAAA ACTAAGCCCT
GTTCATCAAT GCTTGCTAGA AAAAAGCATC GCGGGCTATA AAGAAATCGA GTATGAAGTG
ATGCGCGACG CCAACGATAA CGCGATCGTC GTCTGCAATA TGGAAAATAT CGATCCGGTC
GGCATTCACA CCGGCGATTC GATCGTCGTC GCTCCAAGCC AAACGCTAAG CGACCGCGAA
TATCAATTGC TGCGCAACGC GTCGCTAAGA ATCATTCGCG CTCTTGGCAT CGAAGGCGGC
TGCAATGTGC AGCTGGCGCT CGATCCGCAT AGCTTCCATT ATTACGTCAT TGAAGTCAAC
CCGCGCGTCA GCCGTTCATC GGCGCTGGCG TCAAAAGCTA CCGGCTACCC GATTGCCAAG
CTCGCCGCGA AAATCGCCGT CGGCTTAACG TTAGATGAAA TCATTAATCC GGTGACAGGA
AAAACATACG CTTGCTTTGA ACCAACGCTC GATTACGTTG TGACGAAAAT TCCGCGCTTT
CCGTTCGATA AATTCGAATC GGCCAACCGC CGTCTTGGCA CGCAAATGAA AGCTACAGGC
GAAGTGATGG CGATCGGACG GACGCTAGAA GAGTCACTGT TAAAAGCGGT TCGCTCCCTT
GAGACGAACG TCTACCATCT CGAACTTAAA GATGCCGAAA ATGTATCAGA TGAGTTAATC
GAAAAGCGGA TTCGCAAAGC GGGAGATGAA CGCCTCTTCT ATATTGCCGA GGCGCTGCGC
CGCGGATTTA CTGTCGAGCA AATTCACGAG TGGAGCCAAA TTGACCGATT TTTCTTAACC
AAAATCGAAA ACATCGTCCG CTTTGAAAAC GTTGTTCGTG ACTATAAGGG GGATATCGAA
GTACTGCGAA AAGCGAAAGA AATGGGCTTT TCCGATGTAG CCATCGCCAA GCTTTGGAAC
AAGAGTGAGC GCGATGTGTA TGAGATGCGC AAACAAGCAG GAATCATTCC TGTATATAAA
ATGGTGGATA CATGCGCGGC GGAATTTGAA TCAGAAACGC CGTATTACTA CAGCACGTAC
GAAGACGAAA ACGAATCGGT CGTCACCGAC CGAGAAAGCG TCGTTGTGCT CGGCTCAGGA
CCAATTCGCA TCGGGCAAGG GATTGAATTC GATTATGCGA CCGTTCATTC GGTCTGGGCG
ATTAAAGAAG CGGGCTATGA GGCGATTATT ATCAACAACA ATCCGGAAAC GGTGTCGACC
GACTTCAGCA TATCGGACAA ATTATATTTC GAGCCGTTAA CCATTGAAGA TGTGATGCAT
GTCATTGATT TAGAAAAGCC GATCGGGGTT ATCGTGCAAT TCGGCGGCCA GACGGCAATC
AACTTGGCGG CCGAATTAGC GGCGCGAGGC GTCCGCATTT TAGGAACGTC GCTTGAGGAC
TTAGACCGCG CCGAAGACCG TGACAAATTT GAACAAACGT TATCGGAGCT AGGCATTCCG
CAGCCGCAAG GAAAAACGGC ATTTTCCGTC GAGGAAGCGG TGCGGATTGC TGAGGAAATC
GGCTATCCGG TGCTTGTTCG TCCATCGTAT GTTCTTGGCG GCCGCGCAAT GGAAATCGTG
TATCAAGAAG AAGAGCTATT GCACTACATG GAGCACGCCG TCAAAGTGAA CCCGCAGCAC
CCGGTGCTCA TCGACCGCTA TTTAATCGGA AAAGAAATCG AAGTCGATGC GATTTCTGAT
GGAGAAACGG TGTTTATTCC GGGAATTATG GAACATATCG AACGGGCGGG CGTGCATTCC
GGCGACTCGA TTGCCGTTTA TCCGCCGCAA ACGTTAACGA AGGACATCCA GCAAAAAATC
GTCGATTACA CGATTAAATT GGCAAGAGGA TTGCGAATTG TCGGACTGCT GAACATCCAA
TTTGTCATGT ACCAAGGCGA AGTGTACGTG CTGGAGGTGA ATCCGCGCTC AAGCCGCACC
GTTCCGTTTT TAAGCAAAAT TACCGGCGTG CCGATGGCGA ATATTGCGAC GAAAGTCATT
TTAGGCGCGA AGCTCGCGGA ACTTGGCTAT GAAACAGGCT TGAGGCAAGA AAGCGAAGGA
GTATACGTGA AAGCGCCGGT CTTCTCATTC GCAAAACTGC GCAACGTCGA TATTTCGCTC
GGCCCGGAAA TGAAGTCGAC TGGCGAAGTG ATCGGCAAAG ACGTGACATT TGAAAAAGCG
TTATATAAAG GGTTAGTTGC TTCGGGAATC CATATTCGTC CATACGGAGC TGTCCTATTA
ACGGTTGCCG ATAAAGATAA AGAAGATGCG ATTGAACTTG CAAGACGTTT CTATCAAATT
GGCTATCAGC TGCTCGCGAC AAACGGCACG GCGGAAGCGT TAAAAGCGGC GGACATTCCG
GTAACCGTCG TCAATAAAAT CCATTCCGCA TCGCCGAACA TTTTAGATGT GATTCGTCAA
GGGAAAGCGC AAGTTGTCAT CAACACGCTT ACAAAAGGAA AACAGCCGGA AAGCGATGGA
TTCCGCATTC GCCGTGAAGC GGTCGAAAAC GGCATTCCAT GCTTAACCTC GCTGGATACG
GCGAAGGCGA TGCTTCAAGT CATCGAATCG ATGACGTTTT CGACAACGGC GATGACACAA
GGGATGGTGC GCGTATGA
 
Protein sequence
MPKRQDIETI LVIGSGPIVI GQAAEFDYAG TQACLALKEE GYKVILVNSN PATIMTDTEI 
ADKVYMEPLT LEFVSRIIRK ERPDAILPTL GGQTGLNLAV ELARTGVLAE CGVEILGTKL
EAIEKAEDRE QFRALMNELG EPVPESEIIH SLEEAYAFVE KVGYPVIVRP AFTLGGTGGG
ICKNEEELID IVSTGLKLSP VHQCLLEKSI AGYKEIEYEV MRDANDNAIV VCNMENIDPV
GIHTGDSIVV APSQTLSDRE YQLLRNASLR IIRALGIEGG CNVQLALDPH SFHYYVIEVN
PRVSRSSALA SKATGYPIAK LAAKIAVGLT LDEIINPVTG KTYACFEPTL DYVVTKIPRF
PFDKFESANR RLGTQMKATG EVMAIGRTLE ESLLKAVRSL ETNVYHLELK DAENVSDELI
EKRIRKAGDE RLFYIAEALR RGFTVEQIHE WSQIDRFFLT KIENIVRFEN VVRDYKGDIE
VLRKAKEMGF SDVAIAKLWN KSERDVYEMR KQAGIIPVYK MVDTCAAEFE SETPYYYSTY
EDENESVVTD RESVVVLGSG PIRIGQGIEF DYATVHSVWA IKEAGYEAII INNNPETVST
DFSISDKLYF EPLTIEDVMH VIDLEKPIGV IVQFGGQTAI NLAAELAARG VRILGTSLED
LDRAEDRDKF EQTLSELGIP QPQGKTAFSV EEAVRIAEEI GYPVLVRPSY VLGGRAMEIV
YQEEELLHYM EHAVKVNPQH PVLIDRYLIG KEIEVDAISD GETVFIPGIM EHIERAGVHS
GDSIAVYPPQ TLTKDIQQKI VDYTIKLARG LRIVGLLNIQ FVMYQGEVYV LEVNPRSSRT
VPFLSKITGV PMANIATKVI LGAKLAELGY ETGLRQESEG VYVKAPVFSF AKLRNVDISL
GPEMKSTGEV IGKDVTFEKA LYKGLVASGI HIRPYGAVLL TVADKDKEDA IELARRFYQI
GYQLLATNGT AEALKAADIP VTVVNKIHSA SPNILDVIRQ GKAQVVINTL TKGKQPESDG
FRIRREAVEN GIPCLTSLDT AKAMLQVIES MTFSTTAMTQ GMVRV