Gene SAG1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1042 
SymbolcarB 
ID1013846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1049346 
End bp1052528 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content37% 
IMG OID637316225 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionNP_688052 
Protein GI22537201 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAGC GCACAGACAT ACGTAAAATT ATGGTTATCG GTTCTGGCCC GATTGTGATT 
GGACAAGCAG CAGAGTTTGA CTATTCTGGT ACACAAGCCT GTCTCTCTTT AAAAGAGGAG
GGATACCAAG TTGTTTTGGT CAATTCGAAT CCAGCTACTA TCATGACAGA TAAAGATATT
GCTGATAAAG TTTATATTGA ACCTATTACC TTGGAGTTTG TCACAAGAAT TCTCAGGAAG
GAAAGACCTG ATGCTCTCCT TCCAACTCTT GGAGGTCAAA CTGGATTGAA TATGGCTATG
GCTCTATCAA AAAATGGTAT CCTAGAAGAA TTGAATGTTG AACTATTAGG GACCAAATTA
TCAGCCATTG ATAAAGCTGA AGATCGTGAT TTATTTAAAC AGTTGATGGA AGAACTCAAT
CAACCTATTC CAGAATCTGA GATTGTTAAT TCGGTAGAAG AAGCTATTCA ATTTGCAGAG
CAAATCGGGT ATCCATTAAT TGTTCGTCCT GCCTTTACTT TAGGAGGTAC AGGTGGAGGC
ATGTGCGATA ATCAAGAACA ATTGGTTGAC ATCACGACGA AGGGGTTAAA GTTATCGCCT
GTGACACAGT GCCTTATTGA ACGTTCAATT GCTGGCTTTA AAGAGATTGA ATACGAAGTT
ATGCGTGATG CAGCAGATAA CGCTCTTGTT GTTTGTAATA TGGAGAATTT TGATCCTGTT
GGTATCCATA CAGGGGACTC TATAGTTTTT GCACCCGCAC AAACGTTATC TGATGTGGAA
AATCAGTTAT TACGAGATGC GAGTTTAGAT ATTATAAGAG CTTTGAAAAT TGAAGGAGGA
TGCAATGTTC AACTAGCCCT TGATCCGAAT AGTTTTAAAT ACTATGTCAT TGAAGTTAAT
CCAAGAGTAT CAAGATCTTC TGCTTTAGCT TCAAAAGCTA CTGGATATCC AATTGCTAAA
TTAGCAGCTA AGATAGCTGT TGGTTTGACA CTAGATGAGG TAATTAATCC TATTACAAAA
ACGACTTACG CTATGTTTGA ACCTGCTTTA GATTACGTAG TCGCAAAAAT GCCGAGATTT
CCTTTTGATA AATTTGAAAG TGGCGATAGA AAATTGGGAA CACAGATGAA AGCTACAGGC
GAAGTCATGG CAATAGGACG GAATATAGAA GAATCTTTGT TGAAAGCTTG TCGTTCACTA
GAAATAGGTG TCGATCACAT TAAAATAGCA GATTTAGATA ATGTATCAGA TGATGTTTTA
TTAGAAAAAA TAAGAAAGGC AGAAGATGAT CGCTTATTTT ATCTAGCAGA AGCTTTACGT
AGACATTATA GTATTGAAAA ATTAGCAAGT TTGACTAGCA TTGATTCTTT CTTCCTTGAT
AAGTTAAGAG TGATTGTAGA GTTAGAAGAT CTATTATCTA AAAACAGACT GGATATCAAT
ATTCTCAAAA AGGTTAAAAA TAAAGGTTTT TCAGATAAAG CTATAGCGAG TTTATGGCAA
ATAAACGAAG ATCAAGTTCG TAACATGCGA AAAGAAGCAG GAATTCTTCC AGTTTATAAG
ATGGTTGATA CATGTGCGTC TGAGTTTGAT TCAGCAACGC CCTACTTTTA TTCGACTTAT
GCCGTAGAAA ACGAGTCATT GATATCAGAT AAAGCCTCTA TTTTGGTTTT AGGATCGGGC
CCAATCCGAA TTGGACAGGG AGTTGAATTT GATTATGCAA CGGTTCATTC TGTTAAAGCC
ATCAGAGAGT CTGGTTTTGA AGCGATTATC ATGAACTCTA ATCCAGAGAC GGTCTCAACG
GATTTCTCTA TTTCAGATAA GCTTTACTTT GAACCTTTGA CTTTTGAAGA TGTTATGAAT
GTTATTGACT TAGAAAAACC TGAAGGGGTC ATCTTACAAT TCGGTGGTCA GACTGCTATT
AATCTAGCAA AAGATTTAAA CAAGGCAGGG GTTAAAATAC TAGGCACACA ATTAGAAGAT
TTGGATCGTG CTGAAAATAG AAAACAATTT GAAGCCACTT TACAAGCTCT AAATATCCCT
CAACCACCAG GTTTTACCGC CACTACGGAG GAAGAGGCAG TTAATGCAGC GCAAAAGATT
GGTTATCCCG TGTTGGTTAG ACCATCATAT GTTCTTGGTG GACGAGCTAT GAAGATAGTT
GAAAACGAAG AAGATTTACG TCATTATATG ACAACCGCGG TAAAGGCAAG CCCAGATCAT
CCAGTGCTTA TAGATGCATA CCTTATAGGT AAAGAGTGTG AGGTTGATGC TATATCCGAT
GGTCAGAATA TACTGATTCC TGGAATAATG GAACATATAG AGCGATCTGG TGTTCATTCA
GGAGATTCCA TGGCTGTATA TCCTCCTCAA ACATTGTCTG AGACCATTAT CGAAACTATT
GTAGATTATA CAAAACGCTT AGCAATTGGA TTAAACTGTA TCGGTATGAT GAATATTCAG
TTTGTAATCA AGGATCAGAA AGTATATGTT ATTGAAGTTA ATCCTCGTGC TAGTAGAACA
CTTCCATTCT TATCAAAAGT AACTCACATT CCTATGGCCC AAGTAGCAAC AAAAGTTATC
TTAGGAGATA AACTCTGCAA TTTTACATAT GGTTATGATC TCTATCCAGC TTCAGATATG
GTCCACATTA AAGCCCCTGT CTTTAGTTTT ACCAAACTTG CTAAAGTAGA TAGTTTATTA
GGTCCTGAAA TGAAATCAAC AGGCGAAGTG ATGGGATCAG ATATTAATCT TCAGAAAGCC
TTGTATAAAG CCTTTGAAGC AGCATATCTC CATATGCCAG ACTATGGTAA CATTGTTTTT
ACAGTAGATG ATACAGATAA AGAAGAAGCT TTAGAACTTG CAAAAGTCTA TCAAAGTATT
GGTTATCGTA TTTATGCTAC CCAAGGGACT GCAATTTATT TTGATGCTAA TGGTTTAGAG
ACAGTTTTAG TAGGTAAGTT AGGAGAGAAC GATCGAAATC ATATTCCTGA TTTAATTAAA
AATGGTAAGA TTCAAGCAGT TATCAACACA GTTGGACAAA ATAACATTGA TAACCATGAT
GCTCTCATCA TCAGACGTTC TGCAATTGAA CAAGGAGTTC CTCTATTTAC ATCTTTAGAT
ACTGCACATG CAATGTTCAA AGTTCTTGAA AGCAGAGCAT TTACATTAAA AGTACTAGAT
TAA
 
Protein sequence
MPKRTDIRKI MVIGSGPIVI GQAAEFDYSG TQACLSLKEE GYQVVLVNSN PATIMTDKDI 
ADKVYIEPIT LEFVTRILRK ERPDALLPTL GGQTGLNMAM ALSKNGILEE LNVELLGTKL
SAIDKAEDRD LFKQLMEELN QPIPESEIVN SVEEAIQFAE QIGYPLIVRP AFTLGGTGGG
MCDNQEQLVD ITTKGLKLSP VTQCLIERSI AGFKEIEYEV MRDAADNALV VCNMENFDPV
GIHTGDSIVF APAQTLSDVE NQLLRDASLD IIRALKIEGG CNVQLALDPN SFKYYVIEVN
PRVSRSSALA SKATGYPIAK LAAKIAVGLT LDEVINPITK TTYAMFEPAL DYVVAKMPRF
PFDKFESGDR KLGTQMKATG EVMAIGRNIE ESLLKACRSL EIGVDHIKIA DLDNVSDDVL
LEKIRKAEDD RLFYLAEALR RHYSIEKLAS LTSIDSFFLD KLRVIVELED LLSKNRLDIN
ILKKVKNKGF SDKAIASLWQ INEDQVRNMR KEAGILPVYK MVDTCASEFD SATPYFYSTY
AVENESLISD KASILVLGSG PIRIGQGVEF DYATVHSVKA IRESGFEAII MNSNPETVST
DFSISDKLYF EPLTFEDVMN VIDLEKPEGV ILQFGGQTAI NLAKDLNKAG VKILGTQLED
LDRAENRKQF EATLQALNIP QPPGFTATTE EEAVNAAQKI GYPVLVRPSY VLGGRAMKIV
ENEEDLRHYM TTAVKASPDH PVLIDAYLIG KECEVDAISD GQNILIPGIM EHIERSGVHS
GDSMAVYPPQ TLSETIIETI VDYTKRLAIG LNCIGMMNIQ FVIKDQKVYV IEVNPRASRT
LPFLSKVTHI PMAQVATKVI LGDKLCNFTY GYDLYPASDM VHIKAPVFSF TKLAKVDSLL
GPEMKSTGEV MGSDINLQKA LYKAFEAAYL HMPDYGNIVF TVDDTDKEEA LELAKVYQSI
GYRIYATQGT AIYFDANGLE TVLVGKLGEN DRNHIPDLIK NGKIQAVINT VGQNNIDNHD
ALIIRRSAIE QGVPLFTSLD TAHAMFKVLE SRAFTLKVLD