Gene SAG1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1043 
SymbolcarA-1 
ID1013847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1052559 
End bp1053635 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content36% 
IMG OID637316226 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionNP_688053 
Protein GI22537202 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGTC TATTATTACT GGAAGATGGT AGTGTGTTTG AGGGTGAGGC TTTCGGAGCG 
GATGTAGAAA CAAGTGGTGA AATCGTTTTT AGTACAGGAA TGACAGGGTA TCAAGAATCT
ATAACAGATC AATCTTACAA CGGTCAAATT ATTACTTTTA CTTATCCTCT TATTGGAAAT
TATGGGATAA ATCGAGACGA TTACGAATCT ATTAGACCAA CATGTAAGGG AGTAGTTATT
TATGAATGGG CAGAGTATCC AAGTAACTGG CGTCAACAGA TGACACTTGA TGAATTTTTA
AAATTAAAAG GAATACCTGG AATTTCAGGT ATAGATACTC GAGCATTAAC AAAAATTATT
CGAAAACATG GAACAATGAA AGCTTGTTTA ATAAACGAGG GAAATTCTAT TCATGAAGCA
CTAGAAAACC TTCAAAAAAG TGTTCTTTTA AATGATCAAA TTGAACAAGT ATCAACCAAA
TTAGCTTATG CTTCGCCTGG AGTTGGCAAA AATATTGTAT TAGTTGACTT TGGTTTAAAA
CATTCTATTC TAAGAGAACT GTCACAACGT CAATGTCATA TTACAGTCGT TCCTCATACT
ACGACAGCGC AAGAAATCTT AAATCTCAAT CCAGATGGAG TACTCTTATC TAACGGCCCT
GGGAACCCAG AACAATTACC GAATGCTTTA CAAATGATCC AAGAAATTCA AGGTAAAATT
CCAATTTTTG GTATTTGTAT GGGACATCAA CTATTTGCTA AAGCTAATGG TGCAAAGACT
TATAAAATGA CTTTTGGTCA TCGAGGTTTT AATCATGCTG TTCGTCATTT GCAAACAGGA
CAGGTTGATT TTACAAGTCA AAATCATGGT TATGCTGTCT CGAGAGAAGA TTTCCCTGAG
GCTCTCTTCA TTACACATGA AGAGATTAAT GATAAAACTG TCGAAGGTGT TCGACATAAA
TACTATCCTG CCTTTTCAGT GCAGTTTCAC CCAGATGCAG CACCAGGGCC GCATGATACA
TCTTATCTTT TTGATGAATT TATTAATATG ATTGATGATT TCCAGCAAAA AAGCTAG
 
Protein sequence
MKRLLLLEDG SVFEGEAFGA DVETSGEIVF STGMTGYQES ITDQSYNGQI ITFTYPLIGN 
YGINRDDYES IRPTCKGVVI YEWAEYPSNW RQQMTLDEFL KLKGIPGISG IDTRALTKII
RKHGTMKACL INEGNSIHEA LENLQKSVLL NDQIEQVSTK LAYASPGVGK NIVLVDFGLK
HSILRELSQR QCHITVVPHT TTAQEILNLN PDGVLLSNGP GNPEQLPNAL QMIQEIQGKI
PIFGICMGHQ LFAKANGAKT YKMTFGHRGF NHAVRHLQTG QVDFTSQNHG YAVSREDFPE
ALFITHEEIN DKTVEGVRHK YYPAFSVQFH PDAAPGPHDT SYLFDEFINM IDDFQQKS