Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0378 |
Symbol | carB |
ID | 3926971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 369666 |
End bp | 372896 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637901502 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_507198 |
Protein GI | 88658245 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.546328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAC GTACAGACAT AGAATCAATA TTAGTTATTG GAGCAGGTCC TATAGTTATA GGACAAGCAT GTGAATTTGA TTATTCAGGT ACACAAGCTT GTAAAGTTTT AAAACAAGAA GGATATCGAA TTATCCTCAT TAACTCAAAC CCTGCAACAA TAATGACTGA TCCCGAAATT GCTGATTCAA CATACATTGA ACCAATTACA TCAAACATTA TTGAAAAAAT CATTATAAAA GAAAGACCAA GCGCAATACT ACCAACAGTA GGTGGGCAAA CTGCTCTAAA CGCTGCAATC AACCTATGTG AGTTAGGAAT CTTAGAAAAA TATAACGTAA AACTAATAGG TGCAACAAAA GAAGCAATTA AAAAAGCTGA AGATAGGAAT CTATTCCGAC AAGCAATGGA CAAAATAGGA GTAAAATATC CAAAAAGTAT AATAATAAGA AGTGTTGATG AAATAAATGA CGCATTAGAA TACATAGGAT TACCAGCAAT AATTAGGCCA TCATTTACTC TTGGTGGCAT AGGGGGTGGT ATATCCTATA ATAAAGAAGA GTTTCATAAA AATGTAAGCC ATGGTTTAAA CATTTCCCCA ATATCTGAGG TACAAGTAGA TGAATCCATC ATCGGGTGGA AAGAATTTGA AATGGAAGTT ATGCGAGATA ATAAAGATAA CTGCATTGTA GTTTGTTCTA TAGAAAATCT CGATCCAATG GGAGTACACA CTGGAGATAG TATAACAGTA GCACCTGCAT TGACTTTAAG AGATGTAGAA TATCAAAAAA TGCGAGACAT ATCATTTTCT ATACTTAGAG AAATAGGAGT AGATACTGGT GGATCAAACG TACAGTTTGC AGTAAACCCA AAAAATGACA ATGATTTACT CGTTATAGAG ATGAACCCAA GAGTCTCTAG ATCATCAGCA CTAGCATCAA AAGCAACAGG TTATCCTATA GCTAAGGTTG CCGCAAAATT AGCTATTGGT TATTCTCTAG ATGAGATACA AAATGACTGT ACAGAAGTCA TTCCAGCATC ATTTGAACCA AGCATAGATT ACATTGTTAC AAAAATTCCA CGATTCAATT TTGATAAGTT CCGAGGTACA GAAAAAGAAC TGTCAACCTC CATGAAATCT GTAGGAGAAG TAATGGCAAT TGGAAGATCT TTTCCAGAAT CATTACAAAA AGCACTGTGT TCCTTAGAAG CTGGATATTC TGGATTAAAC GAGTATTTCG ACCAAAACAT AGAAATAGAC CAACTATATA ACTCCCTTGT AAAATTATCT CCAGATAGAA TATTTATAAT TGCAGATGCT TTACGACACA ATATTAGCAT AGAAGAAATC AATAAAATTA CTGGGTACGA TCCTTGGTTT TTAACCCAAA TCAAAAATAT TATACAATGT GAACAACAAA TACAGAAAAA TGGATTACCA AAAGAAGCAG AAGGGTTGTT ATTACTAAAG AAAATGGGGT TTTCTGATAT AAGGTTAGCA TATTTAAGCA AGACTACAGT AGAATATATA GAAGATCTGA GGAAATCTTT ATCCATAAAA CCTGTATATA AACATATAGA TACTTGTGCA GGAGAATTTA AAACCAATGC ATCATATATG TATAGTTGCT ATGAAGGTAA TACTATCAAT GTCACTGAAT GTGAGTCTGT TGTAACAAAT AACAAAAAAG TGATAATTTT AGGCAGTGGT CCTAACCGTA TTGGTCAAGG TATAGAATTT GATTATACAT GTGTACATGC AGCACATACC ATAAGACAAA TGGGGTATGA GTCTATTATG ATAAATTGTA ATCCGGAAAC CGTTTCAACT GATTACGACA TTTCAGATAA ATTATATTTT TCTCCTCTAA CACGAGAAAG TGTTCTAGAC ATCATATATA AAGAACAAGA GACAAACTCT TTATTAGGAG TAATAGTACA ATTTGGTGGA CAAACACCTT TAAAGCTTGC AAAAGTATTA CAAGAGAAAA ACATTAATAT TCTTGGAACA TCCTTTGATT CCATAGATTT AGCTGAAGAT CGCATGAAAT TTCAACAATT GCTTACACAA TTAAATTTAA AGCAACCTTC TAATATCACT TGCAATTCAA TACATGAAGT ATATGAATGC ATCAAAGATT TAGTATTCCC TATACTAGCA AGACCATCGT ACGTACTAGG TGGACAATCA ATGTCAATTA TTCGTGATAG TAATGCTTTA TCAAATTACT TAACTACTTA TAAGAACATA TTCGATCATG GATCATTATT GTTAGATCAA TTTTTAACTC ATGCAATTGA AGTAGATGTA GACGCTATCT GTGATGGAGA AAAGGTATAT ATAGCAGGGA TAATGGAGCA CATAGAAGAA GCTGGTGTAC ACTCAGGAGA TTCTGCATGT TCTTTACCTT CATATAGTTT AACCTCAGAA ATAACAGATG CTATAGCAGA GCAGACAAAA AAAATAGCAC TTGCACTACA AGTCAGAGGA TTTATTAATA TACAATATGC AATACAAGAT GGCGAAATAT ACATATTAGA AGTTAACCCT AGAGCTAGCA GAACTGTACC TTTTATTGCA AAAGCTACAG GTATCCCTAT AGCAAAAATT GCAACAGAAG TATTACTAGG TAAAAAACTT CCTAACGCTC TTGAAGAGAA ACTCACACAT GTAGCTGTAA AGGAAGCAGT ATTTTCTTTC TCAAGATTTC CAAACATAGA TGTTTTATTA GGACCTGAAA TGAAGTCTAC AGGAGAAGTA ATGGGAATCG ATAAATCATT TGAAATAGCA TTCGCAAAAG CACAAATGGC TGCTGGTTAC GAATTACCAA CTAAAGGTAC AGCCTTTATA TCTGTTAAAA ATTCAGATAA ACCACTCATT GTGAAAACAG CACAAATATT AAAAGATATC GGTTTTACTA TATTCTCGAC TAAAGGCACA TCCACTTATT TAAATGAAGC AGGTATTACT ACAGAACATG TAAATAAAGT AAGAGAAGGT AGACCACATA TATTAGACTT ATTACAAGAT GATAAAATAA ATCTAGTAAT TAATACTTCA GAAGGTATCA AGTCCTTTTC AGAAAGTAGT AGCATAAGAA AAACTGCACT GATAAAAAAA ATTCCATATA GTACCACAAT ACCTGGTGCA AGAGCTTTAG CACTTGCAAT AAAAACATTA AAAAAGCAAG GTATTCACGT AGAATCTATG CAGGAATACA CAAAAAAATA A
|
Protein sequence | MPKRTDIESI LVIGAGPIVI GQACEFDYSG TQACKVLKQE GYRIILINSN PATIMTDPEI ADSTYIEPIT SNIIEKIIIK ERPSAILPTV GGQTALNAAI NLCELGILEK YNVKLIGATK EAIKKAEDRN LFRQAMDKIG VKYPKSIIIR SVDEINDALE YIGLPAIIRP SFTLGGIGGG ISYNKEEFHK NVSHGLNISP ISEVQVDESI IGWKEFEMEV MRDNKDNCIV VCSIENLDPM GVHTGDSITV APALTLRDVE YQKMRDISFS ILREIGVDTG GSNVQFAVNP KNDNDLLVIE MNPRVSRSSA LASKATGYPI AKVAAKLAIG YSLDEIQNDC TEVIPASFEP SIDYIVTKIP RFNFDKFRGT EKELSTSMKS VGEVMAIGRS FPESLQKALC SLEAGYSGLN EYFDQNIEID QLYNSLVKLS PDRIFIIADA LRHNISIEEI NKITGYDPWF LTQIKNIIQC EQQIQKNGLP KEAEGLLLLK KMGFSDIRLA YLSKTTVEYI EDLRKSLSIK PVYKHIDTCA GEFKTNASYM YSCYEGNTIN VTECESVVTN NKKVIILGSG PNRIGQGIEF DYTCVHAAHT IRQMGYESIM INCNPETVST DYDISDKLYF SPLTRESVLD IIYKEQETNS LLGVIVQFGG QTPLKLAKVL QEKNINILGT SFDSIDLAED RMKFQQLLTQ LNLKQPSNIT CNSIHEVYEC IKDLVFPILA RPSYVLGGQS MSIIRDSNAL SNYLTTYKNI FDHGSLLLDQ FLTHAIEVDV DAICDGEKVY IAGIMEHIEE AGVHSGDSAC SLPSYSLTSE ITDAIAEQTK KIALALQVRG FINIQYAIQD GEIYILEVNP RASRTVPFIA KATGIPIAKI ATEVLLGKKL PNALEEKLTH VAVKEAVFSF SRFPNIDVLL GPEMKSTGEV MGIDKSFEIA FAKAQMAAGY ELPTKGTAFI SVKNSDKPLI VKTAQILKDI GFTIFSTKGT STYLNEAGIT TEHVNKVREG RPHILDLLQD DKINLVINTS EGIKSFSESS SIRKTALIKK IPYSTTIPGA RALALAIKTL KKQGIHVESM QEYTKK
|
| |