Gene GWCH70_0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0741 
SymbolcarB 
ID7979484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp816428 
End bp819550 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content43% 
IMG OID644797719 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_002948893 
Protein GI239826269 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAG ATACATCGCT GCAATCGATT TTAATCATCG GATCCGGTCC GATTGTCATC 
GGACAGGCGG CGGAGTTTGA CTATTCCGGC ACGCAAGCGT GTATCGCCTT AAAAGAAGAA
GGGTACCGCG TTATTTTAGT GAATAACAAT CCGGCAACGA TTATGACAGA CGAAGTTCAT
GCCGATGCCG TGTACTTTGA GCCGTTGACG GTGGATAGCG TGGAGGCGAT TATCGCGAAA
GAGCGACCAG ACGGGCTATT GGCGACATTT GGGGGACAGA CGGGACTAAA CTTAGCATTT
CAGCTTCATG AAGAAGGAAT TTTAGAAAAA TATGGGGTGA AGCTGCTCGG AACTCCGATT
GAAGCGATTA AAAGAGGAGA GGACCGCGAA GCGTTTCGCG CGCTCATGTA TGAATTAGGC
GAACCAGTTC CGGAGAGTGA AATTATTACG AGTGTCGATG AAGCGGTCGC TTTTGCCGAA
AAAATCGGAT TTCCGATCAT TATTCGCCCC GCCTATACAC TCGGAGGAAC GGGCGGGGGA
ATCGCAGAAA CGATGGAACA ATTTATTGAT CTAGTCGAAA AAGGACTCGC GGAAAGTCCA
ATTACGCAAT GTCTTGTTGA ACGAAGCGTA GCTGGATATA AGGAAATTGA ATATGAAGTA
ATGCGCGACC ATACGAATAC GTGCATTACT GTTTGCAATA TGGAAAACGT CGATCCCGTC
GGCATCCATA CCGGTGACTC TATCGTTGTT GCTCCATCGC AAACGTTGAC GGACGAGGAA
TACCAAATGT TGCGATCTTC AGCCATCAAA ATTATTTCGG CGCTCGGTAT CATCGGAGGA
TGCAACATTC AGTTCGCGCT AGATCCGTTT AGCAAACGAT ACTATTTAAT TGAAGTAAAT
CCGCGTGTCA GCCGTTCATC GGCGCTTGCA TCGAAAGCGA CGGGATACCC GATCGCACGC
ATAGCCGCGA AATTAGCAGT CGGCTATACA TTGGCGGAAC TTCTTAATCC AGTAACGAAA
ACGACGTACG CAAGCTTTGA ACCAGCACTC GATTATGTCG TTGTAAAATT TCCGCGCTTG
CCGTTTGATA AATTTCCTTT CGGCGATCGC CAGCTTGGCA CGCAAATGAA AGCGACGGGA
GAAGTGATGG CGATTGACCG CAACATGGAA CGAGCGTTTC AAAAAGCGGT TTACTCATTA
GAAGGTGCGA ATAACGGATT GTATTTACCG GAACTTGCTT CCCATACAGA TGATGAGCTC
AAACAACTGC TTGTTCGAAA AGATGATCGC CGCTTTTTTG CGATTCTCGA ATTGTTCCGT
CGCGGAGAAA GGATCGATAC CGTATATGAA TTAACAAAAA TCGACCGTTT CTTTTTGCAT
TCGTTTTATC AGCTGATCGA ATTAGAAAAG AAAGCGAAAG AAACGAGCTT AGAACATATT
GATGAATCGA CGTTCCGTCT GTTGAAAGAA AAAGGGTTTT CTGATGCGTT TTTAGCGGAA
GTATGGAATG TGAAGGAAAA AGACGTAAGA GAAAAGCGAA AACAACTTGG CATCGTTCCG
GCATATAAAA AGGTCGATAC GTGCGCGGCA GAATTCCATT CAGAAACGGA TTATTACTAT
TCGACATATT TCGGCGAAGA TGAACGGAAA AAAAGCGATA AAGAGAAAGT ATTGATTATT
GGCGCAGGAC CGATTCGAAT CGGGCAAGGA ATCGAGTTTG ATTACAGCTC TGTCCACAGT
GTGTACGCAC TTCAAGAAGA AGGATATGAA ACGGTATTGA TGAACAATAA TCCGGAAACC
GTCAGCACTG ATTTTGCTGT GGCTGACCGT CTCTATTTCG AGCCGCTTAC TTTAGAAGAA
GTGTTGAACG TGATAGAAGC GGAACAGATT AAAAAAGTCA TTGTTCAGTT CGGAGGACAG
ACGGCAATTA ATCTCGTGAA AGGGCTCGAA GAAGCGAATA TCACTCTTCT CGGCGTCACT
TATGATGTGA TTGACCAGCT TGAAGATCGT GATCGTTTTT ATCAGCTATT AGAAGAATTG
GATATTCCAC ACGTACCGGG AATGATGGCA AATAACGAAG AAGAACTTAT TTCGAACGCC
AAAAAAATCG GTTATCCGAT ATTGCTTCGT CCTTCTTATG TCATTGGCGG CAGAGGAATG
TTCATTGTGA AAAATGAACA ACAATTATGC GCGCTGCTAG AGCAGCGAAT AATTACGTAC
CCTGTGCTGA TCGATGCATA TTTAGACGGA AAAGAAGCAG AAGTCGATGT TGTTGCGGAT
GGAAACGATA TTTTACTTCC GACGATTATC GAACATGTGG AAAAGGCTGG AGTACATTCA
GGAGACAGCT ATGCAATGCT GCCAGCGCAA ACGATTACGG ATGAAGAACA ATCCAAAATC
ATTACGTACG CCGAAAAAAT TGTAAAGAAA TTACAATTTA AAGGCATTAT GAATATTCAA
TATGTCATTG CCAACGATCA GGTATACGTA TTGGAAGTGA ACCCGCGCGC AAGCCGGACC
GTTCCGATCG TCAGCAAGGC AACAGGCATA CCGCTCGCGC AAATTGCGAC AAAATTATTG
CTTGGGAAAT GTTTAAGCGA TGTCGTTGAC GAAAAACAGC GGCGTTTGGC GAGCTTGCCA
TATATTGTTT TAAAATATCC GGTATTTTCG ACATATAAAT TGCCGGGAGT CGATCCGCTT
GTTGGACCAG AAATGAAATC GACGGGAGAA GGAATCAGCA TTGCAAAAAC AATGGAAGAA
GCGGCGGCAA AGGCGTTTTA TTCGTATTTA GCAAAAAAAC ATAAAGCACG AGAAATTTAT
GTGAACGGTG AAATAAGCGA TGAACTTCTT CAAATCATAA AGGAAAAACA ATTAATCATT
GTTTCCGACA TACCGTTTTC CGAATGGATA AAACGAAAGG AAGCGTTAGC GTTTTTAGAT
TTGCAAAAAG ACGGAAACAG CCAATATAGA ATGCTTGCGT TGTCCCGGCA AATTATGACG
TTTACAGAAA TCGAAACGTT CTGTCTCTTT TTACAAGCAG TCGGTGTGAA AAAGTTTTCT
GTTTCATCCA TTCAAGAATG GCTTGAAAAG AAAAAACAAA TCGAGAAAGC GGTGATTATA
TGA
 
Protein sequence
MPKDTSLQSI LIIGSGPIVI GQAAEFDYSG TQACIALKEE GYRVILVNNN PATIMTDEVH 
ADAVYFEPLT VDSVEAIIAK ERPDGLLATF GGQTGLNLAF QLHEEGILEK YGVKLLGTPI
EAIKRGEDRE AFRALMYELG EPVPESEIIT SVDEAVAFAE KIGFPIIIRP AYTLGGTGGG
IAETMEQFID LVEKGLAESP ITQCLVERSV AGYKEIEYEV MRDHTNTCIT VCNMENVDPV
GIHTGDSIVV APSQTLTDEE YQMLRSSAIK IISALGIIGG CNIQFALDPF SKRYYLIEVN
PRVSRSSALA SKATGYPIAR IAAKLAVGYT LAELLNPVTK TTYASFEPAL DYVVVKFPRL
PFDKFPFGDR QLGTQMKATG EVMAIDRNME RAFQKAVYSL EGANNGLYLP ELASHTDDEL
KQLLVRKDDR RFFAILELFR RGERIDTVYE LTKIDRFFLH SFYQLIELEK KAKETSLEHI
DESTFRLLKE KGFSDAFLAE VWNVKEKDVR EKRKQLGIVP AYKKVDTCAA EFHSETDYYY
STYFGEDERK KSDKEKVLII GAGPIRIGQG IEFDYSSVHS VYALQEEGYE TVLMNNNPET
VSTDFAVADR LYFEPLTLEE VLNVIEAEQI KKVIVQFGGQ TAINLVKGLE EANITLLGVT
YDVIDQLEDR DRFYQLLEEL DIPHVPGMMA NNEEELISNA KKIGYPILLR PSYVIGGRGM
FIVKNEQQLC ALLEQRIITY PVLIDAYLDG KEAEVDVVAD GNDILLPTII EHVEKAGVHS
GDSYAMLPAQ TITDEEQSKI ITYAEKIVKK LQFKGIMNIQ YVIANDQVYV LEVNPRASRT
VPIVSKATGI PLAQIATKLL LGKCLSDVVD EKQRRLASLP YIVLKYPVFS TYKLPGVDPL
VGPEMKSTGE GISIAKTMEE AAAKAFYSYL AKKHKAREIY VNGEISDELL QIIKEKQLII
VSDIPFSEWI KRKEALAFLD LQKDGNSQYR MLALSRQIMT FTEIETFCLF LQAVGVKKFS
VSSIQEWLEK KKQIEKAVII