Gene GYMC61_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1917 
SymbolcarB 
ID8525781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1936885 
End bp1940082 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content58% 
IMG OID 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_003253023 
Protein GI261419341 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAC GCCGCGACAT TGAAACGATT TTAGTCATCG GCTCGGGGCC GATCGTCATC 
GGCCAGGCGG CCGAGTTCGA CTACGCAGGG ACGCAAGCGT GCTTAGCCCT AAAAGAAGAA
GGATACAAAG TCATTCTGGT GAACTCAAAC CCGGCGACGA TCATGACCGA TACGGAAATC
GCCGATAAAG TCTATATGGA GCCGCTGACG CTCGATTTTG TCGCCCGCAT CATCCGCAAA
GAGCGGCCGG ACGCGATTTT GCCGACGCTT GGCGGCCAGA CCGGCCTCAA CTTGGCGGTT
GAGCTCGCCA AAGCCGGCGT GCTCGAGGAG TGCGGCGTGG AAATTCTCGG CACGAAGCTC
GAGGCGATCG AAAAGGCGGA AGACCGCGAA CAGTTTCGCG CCCTCATGAA CGAGCTTGGC
GAGCCGGTGC CGGAAAGCGC GATTATTCAC AGCTTGGAAG AGGCATACGC CTTTGTCGAA
CAAATCGGCT ATCCGGTCAT CGTCCGCCCG GCGTTCACGC TCGGCGGCAC GGGCGGCGGC
ATTTGCACGA ACGAAGAAGA GCTTATTGAA ATCGTCTCAA CCGGATTGAA GATGAGCCCG
GTGCACCAAT GCTTGCTCGA GCGGAGCATC GCCGGCTACA AGGAGATTGA ATACGAAGTG
ATGCGCGACG CCAATGACAA CGCGATTGTC GTCTGCAACA TGGAAAACAT CGACCCGGTC
GGCATTCATA CGGGCGATTC GATCGTCGTC GCCCCGAGCC AGACGTTAAG CGACCGCGAA
TATCAGCTTT TGCGGAACGC GTCCTTGAAA ATCATCCGCG CCCTTGGCAT AGAAGGCGGC
TGCAACGTCC AGCTGGCGCT TGACCCAGAC AGCTTCCGCT ATTACGTCAT TGAAGTGAAC
CCGCGCGTCA GCCGCTCGTC GGCGCTGGCG TCAAAAGCGA CGGGGTATCC GATCGCGAAA
CTGGCGGCGA AAATCGCCGT CGGGTTGACG CTTGACGAAA TGATCAACCC GGTCACCGGG
AAAACGTACG CCTGCTTCGA GCCGGCGCTT GATTATGTGG TGACGAAAAT TCCGCGCTTT
CCGTTTGACA AATTTGAATC GGCGAACCGC CGCCTTGGCA CGCAAATGAA AGCGACCGGC
GAAGTGATGG CGATCGGCCG GACGTTCGAA GAATCGCTGT TAAAAGCCGT TCGTTCGCTC
GAGATCGGAG TGCATCATCT GGAATTGAAC GAGGCGAAAA CGGCCGCCGA CGACGTGATC
GAAAAACGGA TCCGCAAGGC GGGCGATGAG CGGCTCTTTT ACATCGCCGA GGCGCTCCGC
CGCGGGGTGA CGGTGGAGAC GCTGCATGAA TGGAGCCAAA TCGACCGCTT TTTCCTGCAT
AAAATCCAGA ACATCATCGA GATGGAAACG GTCTTGAAAA ACCACCCGGG CGACCTTGAC
GTGTTAAAGA AAGCGAAAGC GCTCGGCTTC TCCGATGCGG CGATAGCGGC GTTATGGAAC
AAAACGGAGC GCGACATTTA CGCGCTGCGC CGCCAACACG GCATCATGCC GGTGTACAAA
ATGGTCGATA CGTGCGCAGC CGAATTCACG TCGGAAACGC CGTACTACTA CAGCACGTAC
GAGGAAGAAA ACGAATCGAT CGTGACGGAA AAACCGAGCG TCGTCGTGCT TGGCTCCGGC
CCGATCCGCA TCGGCCAAGG GATCGAATTT GACTATGCGA CCGTCCATTG CGTCTGGGCC
ATTAAGCAGG CCGGTTATGA GGCGATCATC ATCAACAACA ACCCGGAAAC GGTCTCGACC
GACTTCAGCA CATCGGACAA ACTGTATTTC GAACCGCTGA CGGCCGAAGA CGTGATGCAT
GTCATCGACC TCGAGCAGCC GGTCGGCGTC ATCGTCCAAT TCGGCGGTCA GACGGCCATC
AACTTGGCGG CGGAACTCGA GGCGCGCGGC GTCCGCTTGC TTGGGACGAC GCTTGAAGAT
TTGGACCGCG CCGAAGACCG CGACAAATTT GAACAGGCGC TCTCGGAACT CGGCATTCCC
AAACCGGCCG GCAAAACCGC CGTTTCGGTC GAAGAAGCGG TCGCCATCGC CGAGGAGATC
GGCTACCCGG TGCTCGTCCG TCCTTCGTAC GTGCTCGGCG GCCGGGCGAT GGAAATCGTG
TACAATCGCG GCGAGCTGCT TCATTACATG GAACACGCCG TGCGCGTCAA TCCACAGCAC
CCAGTGCTTG TTGACCGCTA CATCACTGGC AAGGAAGTCG AAGTCGATGC CATCGCCGAC
GGCGAGACGG TCGTCATCCC CGGGATCATG GAACATATCG AGCGGGCCGG CGTCCATTCC
GGCGACTCGA TCGCCGTCTA CCCGCCCCAG ACATTAAGCG CGGAAGTGAT CGACAAGATC
GCGGATTACA CGATCCGACT GGCGCGCGGG CTGCATATTG TCGGGCTGTT GAACATCCAG
TTTGTCGTCT CGGGAAGCGA CGTCTATGTC TTGGAAGTGA ACCCGCGCTC AAGCCGCACG
GTGCCGTTTT TAAGCAAAAT CACCGGCGTG CCGATGGCGA ATCTCGCCAC GAAAGCCATT
TTAGGGGCGA AGCTCGCCGA CATGGGCTAC ACAACCGGCG TCTGCCCGGT GCGCTCCGGC
GTGTACGTGA AAGTGCCGGT GTTCTCGTTT GCGAAATTGC GCAACGTCGA CATTTCGCTC
GGCCCGGAAA TGAAGTCGAC CGGCGAAGTG ATCGGCAAGG ACGTGACGTT TGAAAAAGCG
CTCTATAAGG GGCTTGTCGC CTCGGGCATC CATATCCAGC CGCATGGAGC AGTGCTGTTG
ACGGTGGCCG ACAAAGATAA AGAAGAAGCG GTCGACCTGG CGCGCCGCTT TGCCGACATC
GGCTACCAGC TGTTGGCGAC GAACGGCACG GCGGAAACAT TGAAAGCGGC CGGCATTCCG
GTGACGGTCG TCAATAAAAT CCACTCGGCG TCGCCGAACA TTTTGGATGT CATCCGCCAA
GGGAAGGCGC AAGTCGTCAT CAATACGCTG ACGAAAGGAA AGCAGCCGGA AAGCGACGGC
TTCCGCATCC GCCGCGAAGC GGTCGAAAAC GGCATCCCGT GCTTGACGTC GCTCGATACG
GCCCGGGCGA TGCTGCAAGT GCTCGAATCG ATGACATTTT CAACGACGGC GATGACGGAA
GGGCTGGTGC GGTCATGA
 
Protein sequence
MPKRRDIETI LVIGSGPIVI GQAAEFDYAG TQACLALKEE GYKVILVNSN PATIMTDTEI 
ADKVYMEPLT LDFVARIIRK ERPDAILPTL GGQTGLNLAV ELAKAGVLEE CGVEILGTKL
EAIEKAEDRE QFRALMNELG EPVPESAIIH SLEEAYAFVE QIGYPVIVRP AFTLGGTGGG
ICTNEEELIE IVSTGLKMSP VHQCLLERSI AGYKEIEYEV MRDANDNAIV VCNMENIDPV
GIHTGDSIVV APSQTLSDRE YQLLRNASLK IIRALGIEGG CNVQLALDPD SFRYYVIEVN
PRVSRSSALA SKATGYPIAK LAAKIAVGLT LDEMINPVTG KTYACFEPAL DYVVTKIPRF
PFDKFESANR RLGTQMKATG EVMAIGRTFE ESLLKAVRSL EIGVHHLELN EAKTAADDVI
EKRIRKAGDE RLFYIAEALR RGVTVETLHE WSQIDRFFLH KIQNIIEMET VLKNHPGDLD
VLKKAKALGF SDAAIAALWN KTERDIYALR RQHGIMPVYK MVDTCAAEFT SETPYYYSTY
EEENESIVTE KPSVVVLGSG PIRIGQGIEF DYATVHCVWA IKQAGYEAII INNNPETVST
DFSTSDKLYF EPLTAEDVMH VIDLEQPVGV IVQFGGQTAI NLAAELEARG VRLLGTTLED
LDRAEDRDKF EQALSELGIP KPAGKTAVSV EEAVAIAEEI GYPVLVRPSY VLGGRAMEIV
YNRGELLHYM EHAVRVNPQH PVLVDRYITG KEVEVDAIAD GETVVIPGIM EHIERAGVHS
GDSIAVYPPQ TLSAEVIDKI ADYTIRLARG LHIVGLLNIQ FVVSGSDVYV LEVNPRSSRT
VPFLSKITGV PMANLATKAI LGAKLADMGY TTGVCPVRSG VYVKVPVFSF AKLRNVDISL
GPEMKSTGEV IGKDVTFEKA LYKGLVASGI HIQPHGAVLL TVADKDKEEA VDLARRFADI
GYQLLATNGT AETLKAAGIP VTVVNKIHSA SPNILDVIRQ GKAQVVINTL TKGKQPESDG
FRIRREAVEN GIPCLTSLDT ARAMLQVLES MTFSTTAMTE GLVRS