Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_0559 |
Symbol | carB |
ID | 4438420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | + |
Start bp | 504503 |
End bp | 507682 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 639676273 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_820030 |
Protein GI | 116627411 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAAC GTTCTGATAT TAAAAAAATT ATGGTTATCG GGTCTGGTCC TATTATCATT GGTCAGGCTG CTGAGTTCGA CTACGCTGGT ACGCAAGCTT GCTTGGCTTT GAAAGAAGAA GGCTACAGTG TTGTTCTTGT TAACTCTAAC CCTGCGACTA TCATGACTGA TAAGGAAATC GCAGACAAGG TTTACATTGA ACCTATTACG CTTGAGTTTG TCACACGTAT TCTTCGTAAA GAGCGTCCTG ATGCCCTCTT GCCAACACTT GGTGGACAGA CTGGTTTGAA TATGGCCATG GAATTGTCAA AAGCTGGTAT TCTTGATGAG CTTGGAGTAG AGCTTTTGGG AACAAAATTG TCTGCCATTG ACCAAGCCGA AGACCGTGAC CTTTTCAAAC AATTGATGGA AGAGTTGGAA CAACCAATTC CAGAATCAGA AATTGTAAAT ACGGTTGAAG AAGCAGTTGC CTTTGCGACA GAGATTGGCT ACCCAGTTAT CGTTCGTCCA GCCTTTACCC TTGGTGGTAC TGGTGGTGGT ATGTGTGCTA ACGAAGAAGA ACTCCGTGAA ATCGCTGAAA ATGGATTGAA ATTGTCACCA GTAACCCAAT GTTTGATTGA GCGTTCGATC GCTGGTTTCA AAGAAATCGA ATACGAAGTT ATGCGTGATG CCGAAGACAA TGCTCTTGTG GTATGTAACA TGGAAAACTT TGACCCAGTT GGTATCCACA CAGGGGACTC AATCGTATTT GCCCCAACGC AAACCTTATC AGATATTGAA AACCAAATGC TTCGTGATGC CAGCTTGAAG ATTATCCGTG CCCTTAAAAT TGAGGGTGGC TGTAACGTCC AGTTGGCGCT TGACCCACAT AGCTTCAAGT ACTATGTTAT CGAAGTAAAC CCTCGTGTAT CACGTTCATC AGCCCTTGCC TCAAAGGCCA CTGGTTATCC AATCGCTAAA TTGGCGGCTA AGATTGCTGT TGGTTTGACA CTTGATGAAA TGATTAACCC AGTCACTGGT ACAACTTATG CCATGTTTGA ACCAGCCCTT GACTATGTGG TTGCTAAGAT TCCACGTTTC CCATTTGATA AATTTGAACA CGGTGAACGT CGTCTCGGTA CTCAGATGAA AGCGACAGGT GAAGTTATGG CTATCGGTCG TAACATTGAA GAGTCACTTC TTAAAGCATG CCGTTCACTT GAAATCGGAG TTTACCACAA TGAAATGTCT GAGCTTGCCG AAGTAACAGA CGATGCCTTG GTTGAAAAAG TTGTTAAAGC TCAAGACGAC CGCCTCTTCT ATATTTCTGA GGCCATTCGT CGTGGTTACA CTATTGAAGA ATTATCAGAG TTGACTAAGA TTGATATCTT CTTCCTTGAT AAATTGCTCC ACATCTTCGA ATTGGAACAA GAATTGGCTG CTCACGTAGG CGATGTTGAT GTTTTAAAAG AAGCTAAACG TAACGGTTTC TCTGATCGTA AGATTGCTGA TCTTTGGAAT CAAACAGCTA ACCAAGTACG TGCGACACGT TTGGAAAATA ACATTGTTCC TGTTTACAAG ATGGTTGATA CTTGTGCGGC TGAGTTTGAA TCATCAACAC CATATTTTTA CTCAACTTAC GAGTGGGAAA ATGAGTCAAT CAAATCTGAT AAAGAATCGG TCATCGTTCT TGGTTCAGGT CCTATCCGTA TCGGACAAGG GGTTGAGTTC GACTATGCGA CAGTTCACTC TGTAAAAGCT ATCCAAGCTG CTGGCTACGA AGCCATCATC ATGAACTCTA ACCCTGAGAC AGTTTCAACA GACTTCTCAG TGTCAGACAA ACTCTACTTT GAACCATTGA CCTTCGAAGA CGTGATGAAC GTTATTGAAT TGGAACAACC TAAGGGTGTG GTGGTTCAGT TTGGTGGACA AACAGCCATC AACTTGGCAG AGCCATTGTC TAAAGCAGGT GTGAAAATCT TGGGTACACA GGTTGCTGAC CTTGACCGTG CAGAAGACCG TGACCTCTTC GAACAGGCTC TTAAAGATCT TGACATTCCA CAACCACCAG GACAAACAGC GACAAATGAA GAAGAAGCAG TTGAAGCGGC TCGTAAGATT GGTTTCCCAG TTCTTGTTCG TCCATCATAC GTTTTGGGTG GACGTGCTAT GGAAATTGTT GAAAATGAAG ATGACCTTCG TTCTTACATG CGCACAGCCG TTAAGGCTAG TCCAGACCAC CCAGTCCTTG TTGATAGCTA TATCATAGGA CGTGAGTGTG AAGTGGATGC CATCTCTGAT GGTAAGGATG TCTTAATTCC AGGTATTATG GAACACATCG AACGTGCGGG GGTTCACTCA GGGGACTCAA TGGCGGTTTA CCCACCACAA ACTCTTTCTA AGAAAATCCA AGAGACTATC GCTGATTACA CGAAACGTTT GGCTATCGGT CTTAACTGTA TCGGTATGAT GAACATCCAG TTCGTTATTA AGGACGAAAC AGTCTATGTT ATCGAGGTTA ACCCACGTGC CAGCCGTACA GTACCATTCT TGTCTAAGGT GACTGATATC CCAATGGCTC AAGTTGCTAC AAACTTGATT CTTGGTAAAT CATTGGCTGA GCAAGGTTAC AAAGATGGTC TTTATCCAGA AAGTAACCAT GTCCATGTCA AAGCACCAGT CTTTTCATTC ACAAAATTGG CTCAGGTAGA TAGTCTCCTT GGACCTGAAA TGAAATCAAC TGGTGAAGTT ATGGGTACAG ATGTGACTCT TGAAAAAGCG CTCTATAAAG CCTTTGAAGC ATCTTACCTC CACTTGCCAA CCTTTGGTAA CGTCATCTTT ACTATTCATG ACGATACCAA AGAAGAAGCC CTTGACTTGG CTCGTCGTTT CGATGCTATC GGTTATGGTA TCTACGCAAC TGAAGGAACA GCTAAGTTCT TGAATGAACA CGGGGTTCAC GCAACGCTTG TTAACAAGTT GGGTGAAAAC GATGACAATG ACATTCCAGC CCTCGTTCGT ACAGGTAAAG CACAAGCTAT TATCAATACA GTTGGTAACA AACGTACTTA TGACGAAGAT GGAGCAGCTA TCCGTAGTTC AGCTATTGAA GCAGGAATTC CACTCTTCAC AGCCCTTGAT ACAGCAGATG CTATGGTGCG TGTGCTTGAA AGCCGCAGCT TTACAACAGA AGCTATCTAA
|
Protein sequence | MPKRSDIKKI MVIGSGPIII GQAAEFDYAG TQACLALKEE GYSVVLVNSN PATIMTDKEI ADKVYIEPIT LEFVTRILRK ERPDALLPTL GGQTGLNMAM ELSKAGILDE LGVELLGTKL SAIDQAEDRD LFKQLMEELE QPIPESEIVN TVEEAVAFAT EIGYPVIVRP AFTLGGTGGG MCANEEELRE IAENGLKLSP VTQCLIERSI AGFKEIEYEV MRDAEDNALV VCNMENFDPV GIHTGDSIVF APTQTLSDIE NQMLRDASLK IIRALKIEGG CNVQLALDPH SFKYYVIEVN PRVSRSSALA SKATGYPIAK LAAKIAVGLT LDEMINPVTG TTYAMFEPAL DYVVAKIPRF PFDKFEHGER RLGTQMKATG EVMAIGRNIE ESLLKACRSL EIGVYHNEMS ELAEVTDDAL VEKVVKAQDD RLFYISEAIR RGYTIEELSE LTKIDIFFLD KLLHIFELEQ ELAAHVGDVD VLKEAKRNGF SDRKIADLWN QTANQVRATR LENNIVPVYK MVDTCAAEFE SSTPYFYSTY EWENESIKSD KESVIVLGSG PIRIGQGVEF DYATVHSVKA IQAAGYEAII MNSNPETVST DFSVSDKLYF EPLTFEDVMN VIELEQPKGV VVQFGGQTAI NLAEPLSKAG VKILGTQVAD LDRAEDRDLF EQALKDLDIP QPPGQTATNE EEAVEAARKI GFPVLVRPSY VLGGRAMEIV ENEDDLRSYM RTAVKASPDH PVLVDSYIIG RECEVDAISD GKDVLIPGIM EHIERAGVHS GDSMAVYPPQ TLSKKIQETI ADYTKRLAIG LNCIGMMNIQ FVIKDETVYV IEVNPRASRT VPFLSKVTDI PMAQVATNLI LGKSLAEQGY KDGLYPESNH VHVKAPVFSF TKLAQVDSLL GPEMKSTGEV MGTDVTLEKA LYKAFEASYL HLPTFGNVIF TIHDDTKEEA LDLARRFDAI GYGIYATEGT AKFLNEHGVH ATLVNKLGEN DDNDIPALVR TGKAQAIINT VGNKRTYDED GAAIRSSAIE AGIPLFTALD TADAMVRVLE SRSFTTEAI
|
| |