Gene GYMC61_1598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1598 
SymbolcarB 
ID8525461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1623588 
End bp1626716 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content55% 
IMG OID 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_003252713 
Protein GI261419031 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAG ATTCTTCGCT TCAGTCGATT CTCCTGATCG GGTCGGGGCC GATTGTCATC 
GGCCAGGCCG CCGAGTTTGA TTATTCCGGC ACACAAGCGT GCATCGCCTT AAAGGAAGAA
GGATATCGCG TCATTTTAGT GAACAACAAT CCGGCGACGA TCATGACCGA TGAAGTTCAT
GCCGATGCCG TGTATTTTGA ACCGCTCACC GTCGATGTGC TCGAAGCGAT TATTGCCAAA
GAACGCCCGG ACGGGCTGCT CGCCACGTTC GGCGGCCAGA CAGGGCTCAA CTTGGCGTTT
CAGCTGCATG AAGCCGGCGT GCTTGAAAAG TATGGGGTGA GACTGCTCGG AACACCGATT
GAAGCGATCA AGCGCGGGGA AGACCGCGAA GCGTTCCGCG CGTTAATGCA TGAGCTTGGC
GAACCGGTGC CGGAGAGCGA AATTGTCACA AGCGTCGAGG AAGCGGTCGC GTTTGCCGAA
CAAATCGGTT TTCCGATCAT TATTCGTCCC GCGTATACGC TCGGCGGGAC GGGCGGCGGC
ATTGCCGAAA ACATGGAGCA GTTTCTCGCG CTCGTGGAAA AAGGGCTGGC CGAAAGCCCG
ATCCGCCAAT GTTTGATCGA ACGGAGCGTC GCTGGGTTTA AAGAAATTGA ATACGAAGTG
ATGCGCGACC AGTCGAATAC GTGCATCACG GTTTGCAATA TGGAAAACGT CGATCCAGTC
GGCATCCATA CGGGCGATTC GATCGTCGTC GCACCGTCGC AGACGTTGAC CGATGAGGAG
TACCAAATGC TCCGTTCGTC GGCGGTGAAG ATCATTTCCG CATTAGGGAT CATCGGCGGC
TGCAACATTC AGTTCGCCCT TGACCCGAAC AGCAAACAAT ATTACTTAAT CGAAGTCAAC
CCGCGCGTCA GCCGCTCGTC AGCGCTCGCC TCGAAAGCGA CCGGCTACCC AATCGCCCGC
ATTGCGGCGA AGCTGGCGGT TGGCTATACG CTCGCGGAAC TCGTCAATCC GGTGACGAAA
ACGACGTACG CCAGCTTCGA GCCGGCCTTG GACTACGTCG TCGTCAAGTT TCCGCGCTTG
CCGTTTGACA AGTTCCCGCA CGCGGATCGG AAGCTCGGTA CGCAAATGAA AGCGACCGGG
GAAGTGATGG CGATCGATCG CAATATGGAG CGGGCGTTCC AAAAAGCGGT GCAGTCGCTC
GAAGGAAAAA ACAACGGACT GTTTTTGCCG GAGCTTTCGG CGAAAACGGA TGACGAGCTG
AAACAACTGC TTGTTGACAA AGACGACCGC CGCTTTTTCG CCATTTTGGA ACTGCTCCGC
CGCGGGGTGG CGGTCGAAGC CATTCACGAA TGGACGAAAA TCGACCGCTT TTTCCTTGCT
TCGTTCGAAC GGCTCGTGGC GCTCGAAAAA CAGGCGGCAG CCACCACGAT CGATACGATC
GATGAGCCGA CGTTGCGCTT CTTGAAAGAA AAAGGGGCAA GCGACGCCTT TTTGGCCGAA
ACGTGGGGCG TGACGGAGCT TGATGTGCGC AACAAGCGGA AGGAACTTGG CATCGTGCCG
TCATACAAAA TGGTCGACAC GTGCGCGGCC GAATTTCATT CGGAAACGGA TTATTACTAC
TCGACCTATT TCGGCGAAGA CGAGCGGAAG AAAGCGAGCG GCAAGGAGAA AGTGCTGATC
ATTGGAGCCG GACCTATTCG CATCGGCCAA GGCATTGAGT TTGACTACAG CTCCGTTCAT
AGCGTGTTCG CCTTGCAAAA AGAAGGCTAT GAAACGGTGA TGATCAACAA CAATCCGGAA
ACAGTGAGCA CCGACTTTGC CGTCGCCGAC CGTCTGTACT TTGAACCGCT GACGCTTGAG
AGCGTCCTCG ATGTTATCGA AGCGGAACAA ATCAAGCATG TCATCGTTCA ATTTGGCGGA
CAGACGGCGA TCAATTTGGT CAAAGGGCTT GAAGAAGCCG GCGTGTCGCT GTTGGGCGTC
ACGTACGATA TGATTGACCA GCTCGAGGAC CGCGATCGTT TTTACCAGTT GCTTGAGGAG
CTTGACATCC CGCACGTACC GGGCTTGATG GCGAACAACG CCGAGGAGCT CGCCGCCAAA
GCGGCTGAGA TCGGCTGCCC GGTGCTGCTT CGTCCGTCGT ATGTCATCGG GGGCCGCGGC
ATGTTTATCG TCCATAGCGA GGCGCAGCTC GCTGCTTTGA TCGAGCAAGG CGAGTTGACC
TATCCGATTT TGATTGATGC GTATTTGGAC GGGAAAGAGG CAGAAGCGGA CATCGTGACG
GATGGAACAG ACATCGTGCT GCCGGTCATC ATTGAACACG TCGAAAAAGC CGGCGTCCAC
TCCGGTGACA GCTATGCTTG GCTGCCGGCG CAGACGCTCA CAGACGAAGA AAAAGCGAAA
ATCATCGACT ATGCGAGCCG GATTGCGAAA AAACTCGGCT TTAAAGGAAT TATGAACATT
CAATATGTCA TTGCGGACGG CAATGTATAT GTCCTGGAAG TGAACCCGCG CGCGAGCCGG
ACGGTGCCGA TCGTCAGCAA AACGACCGGC GTGCCATTGG CGCAAATTGC GACAAAGTTA
TTGCTTGGGA AATCACTCGT CGACATCGTT GACGAAAAAG CTCGCGGATT GGAGGTCATG
CCGTACGCCG TGTTAAAGTA TCCGGTCTTT TCGACCCATA AGCTGCCGGG GGTTGACCCG
ATGGTTGGAC CGGAGATGAA ATCGACCGGT GAAGGCATCA GCATCGCCGC GACGAAGGAA
GAAGCGGCGT ACAAAGCGTT TTACGCGTAC TTGCGAAAGA AAGCAAACGC CAATGAAATG
TATGTGATTG GCGGCATCGA TGAGACGATG GCCGCGGAAA TCGAAGCGAA ACAGCTTGTG
ATCGTGTCAG ATGTCCCGTT TGCCGATTGG GTGAAGCGCG ACACGGCGCT CGCGGTGATC
AACTTGGGCA AAGAGGAAGA CGAGGCAAAC AAACGAATGA CTGCGTTGTC CCGACAATTG
CTCGTCTTCA CGGAACGTGA GACATTGAAG CTCTTCTTGC AGGCGCTCGA TGTGGATCAT
CTTGATGTGC AGCCGATCCA CGGCTGGTTG GAAAAGAAAA AACAGGCAGA ACAGGCGGTG
ATCGCATGA
 
Protein sequence
MPKDSSLQSI LLIGSGPIVI GQAAEFDYSG TQACIALKEE GYRVILVNNN PATIMTDEVH 
ADAVYFEPLT VDVLEAIIAK ERPDGLLATF GGQTGLNLAF QLHEAGVLEK YGVRLLGTPI
EAIKRGEDRE AFRALMHELG EPVPESEIVT SVEEAVAFAE QIGFPIIIRP AYTLGGTGGG
IAENMEQFLA LVEKGLAESP IRQCLIERSV AGFKEIEYEV MRDQSNTCIT VCNMENVDPV
GIHTGDSIVV APSQTLTDEE YQMLRSSAVK IISALGIIGG CNIQFALDPN SKQYYLIEVN
PRVSRSSALA SKATGYPIAR IAAKLAVGYT LAELVNPVTK TTYASFEPAL DYVVVKFPRL
PFDKFPHADR KLGTQMKATG EVMAIDRNME RAFQKAVQSL EGKNNGLFLP ELSAKTDDEL
KQLLVDKDDR RFFAILELLR RGVAVEAIHE WTKIDRFFLA SFERLVALEK QAAATTIDTI
DEPTLRFLKE KGASDAFLAE TWGVTELDVR NKRKELGIVP SYKMVDTCAA EFHSETDYYY
STYFGEDERK KASGKEKVLI IGAGPIRIGQ GIEFDYSSVH SVFALQKEGY ETVMINNNPE
TVSTDFAVAD RLYFEPLTLE SVLDVIEAEQ IKHVIVQFGG QTAINLVKGL EEAGVSLLGV
TYDMIDQLED RDRFYQLLEE LDIPHVPGLM ANNAEELAAK AAEIGCPVLL RPSYVIGGRG
MFIVHSEAQL AALIEQGELT YPILIDAYLD GKEAEADIVT DGTDIVLPVI IEHVEKAGVH
SGDSYAWLPA QTLTDEEKAK IIDYASRIAK KLGFKGIMNI QYVIADGNVY VLEVNPRASR
TVPIVSKTTG VPLAQIATKL LLGKSLVDIV DEKARGLEVM PYAVLKYPVF STHKLPGVDP
MVGPEMKSTG EGISIAATKE EAAYKAFYAY LRKKANANEM YVIGGIDETM AAEIEAKQLV
IVSDVPFADW VKRDTALAVI NLGKEEDEAN KRMTALSRQL LVFTERETLK LFLQALDVDH
LDVQPIHGWL EKKKQAEQAV IA