Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1670 |
Symbol | |
ID | 8415969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1968664 |
End bp | 1971900 |
Gene Length | 3237 bp |
Protein Length | 1078 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645024637 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_003182025 |
Protein GI | 257791419 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.00809225 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTAAGC GCGAAGACAT CCAAACCATC CTCGTCATCG GAAGCGGCCC CATCGTCATC GGGCAGGCGT GCGAGTTCGA CTACTCGGGC GCGCAGGCCT GCAAGGTGTT GAAGGCCGAC GGCTATCGCG TGGTGCTCGT GAACAGCAAC CCCGCGACCA TCATGACCGA CCCGGGCCTG GCCGACCGCA CCTACGTCGA ACCCATCACC GTGGAGTTCG TCGAGCAGGT CATTGCGAAG GAGCGCCCCG ACGCCCTCTT GCCCACGCTG GGCGGCCAGA CCGGGCTCAA CACGGCCGTC GAGCTCGCGC GCGCCGGCAT CCTGGCCAAG TACGGCGTGG AGATGATCGG CTGCGACCTC GAGGCCATCG AGCGCGGCGA GGACCGCAAG CAGTTCAACG AGTGCATGGC GAAGCTTGGC ATCGAGACGT CGCGCTCCGG CTACGCCTAT TCCATCGCCG ACGCCGAGGA CATCGTGGCC GAGCTCGGCT ACCCCGTGGT GCTGCGCCCC TCGTTCACGC TGGGCGGCGC GGGCGGCGGC ATCGCGCACG ACGCGGCCGA GCTGCACGAG ATCGTGGGCC AGGGCCTGGA GCTGTCGCCG GCCGGCGAGG TGCTGGTGGA GGAGAGCATC GAGGGCTGGA AAGAGTACGA GATGGAGGTC ATGCGCGACC GCGCCGGCAA CGGCATCATC GTGTGCTCCA TCGAGAACTT CGACGCCATG GGCGTGCACA CGGGCGACTC CATCACGGTG GCGCCCGCGC AGACGCTGAC GGATGTGGAG TACCAGCGCA TGCGCGCGGC GTCGCTGGCC ATCCTCGAGA AGATCGGCGT GGAGACCGGC GGCTCCAACG TGCAGTTCGC CGTCAACCCC GAGAACGGCC GCATGATCGT CATAGAGATG AACCCTCGCG TGTCGCGCTC CTCGGCGCTC GCGTCGAAAG CCACGGGCTT CCCCATCGCG AAGGCGGCCG CGAAGCTGGC CGTGGGCTAC ACGCTCGACG AGATCGTGAA CGACATCACG AAGGCCACGC CCGCCTGCTT CGAGCCGTCC ATCGACTACT GCGTGGTGAA GGTGCCGCGC TTCGCGTTCG AGAAGTTCCA GGGCACCGAC GACACGCTGT CCACGCGCAT GAAGGCCGTC GGCGAGGTCA TGGCCATCGG CCGAACGTTC GAGGAATCGC TCGGCAAGGC CATGCGCTCG CTGGAGAACG GGCGCGCGGG CCTGGGAGCC GACGGCAAGG GCGGCGAGGC CGACGCGTCC GACGAGGCGC TGGAAGACCT CGTGGCGCGC CCCACGGCCG AGCGCATCTT CTACCTGGCC GAGGCGCTGC GCCGCGGCTG GACGGTCGAG CGCGCGAGCG CGGCGAGCCG CGTGGACCCC TTCTTCGTCG CGCGCATGGC CGACATCGTG CGCGTGCAGG AAAACCTGCG CGGCATGGCG CTGGACGAGC TCGATGCCGA CGCGTTCCGC CTGCTCAAGC GCATGGGGCT GGCCGACGCG CAGATCGCGT GCCTCACGGG CTCCGACGAG CTCACCGTGC GCACGTGCCG CAAGATGCTG GGCGTGAGGC CCGCGTTCAA AACCGTGGAC ACCTGCGCCG CCGAGTTCCC CAGCGCCACG GCCTACCATT ACAAGACCTA CGACGCCGAC GAGACCGAGA CGGCGCCGAA GACGCGGCGG CGCGCCATGA TCCTGGGCGC GGGGCCGAAC CGCATCGGCC AGGGCATCGA GTTCGACTAC TGCTGCGTGC ACGCCAGCTA CGCTCTGCAC GAGGTCGGGT TCGAGACGAT CATGGTGAAC TGCAACCCCG AGACGGTGTC CACCGACTAC GACACGTCCG ACAAGCTGTA CTTCGAGCCG CTTACGTTCG AGGACGTCAT GGACATCGTC GACGTCGAGC AGCCCGACGG CGTGGTGGTC ACGCTCGGCG GGCAGACGCC GCTCAAGCTG GCGAACGCGC TGGCGGCGGC CGGCGTGCCC ATCATGGGCA CCTCGCCGGA GGCCATCGAC CTGGCCGAGG ACCGCGACCG CTTCGCCGCC GTCCTCGACG AGCTGGGCAT CGTATACCCG GCCGCCGGCA TGGCCTCCAC GTATCAAGAG GCGTGCGTCG TGGCCGACCG CATCGGCTTC CCACTGCTCG TGCGCCCCAG CTACGTGCTG GGCGGGCGCG GCATGGGCAT CGTCTACGAC GGCGCCCAGC TGGAGAAATA CATGGCGGAG GCCGCGAAGA TCTCGCCCGA CCACCCGGTG TACCTCGACC GCTTCCTCGA AGGCGCGGTG GAGGTGGACC TCGACGCGCT CTGCGACGGC GAGCAGGTGT TCGTGGGCGG CGTGCTGGAG CACATCGAGA TGGCGGGCAT ACATTCGGGC GACTCGGCCT GCTGCACGCC GCCGTTCGCG CTGTCCGAGG CCGTGCAGGC GCAGCTGCGC GCCATCGCGC GCCGCCTGGC GCTGCGCCTG GGCGTGGTGG GGCTCATCAA CATCCAGTTC GCCATCAAAG ACCAGGTCAT CTACATCATC GAGGCGAACC CGCGCGCCAG CCGCACGGTG CCGTTCACCT CGAAGGCCAC GGGCGTTCCG CTGGCGAAGG TCGCCGCGCG CATCATGGCG GGGGAGAAGC TGGCCGATTT GGGGATGCCG CCCGACGACC GCCGGCTCGA GCATTTCAGC GTGAAGGAAG CCGTCATGCC CTTCGGCCGC TTCCCCGGCG CCGACACGGT GCTCGGCCCC GAGATGAAGT CCACCGGCGA GGTCATGGGC ATCGCGCGCA ACTTCCCGGC GGCGTTCGCG AAGACGCAGC TGGCCATCAG CTACGCGCTG CCCGAGGGCG GCACGGTGTT CGTCAGCGTG TGCGACCGCG ACAAGCGCGC CATCGTGCCC ATCGCTCGCG ACATCGCGCG TCTGGGCTTT CGCATCGTGG CCACAGGCGG CACGGCCCGC ACGTTGCGGG CGGCCGGCGT GGAGTGCGAG CAGGTGAGGA AGATCCACGA GGGCGAGGGC AACGTGCGCG ACATGATCGC CGCCGGGGAC ATCGCGCTCA TGATCAACAC GCCGTTCGGC CACGCCACGC GCGCCGACGG CTACGAGCTG CGCCTCGAGG CCGTGAAGCA CGGCGTGACC CACGTGACGA ACCTCGCCGG CGCCCAGGCC ATGGTGGCCG GCATGGAAGC CGCGCGCCTC GGCGGCCTCG CGGCCGTGGC CTTGCAGGAC CTGCCCCAGT GGGAGCTTTC GCGCTGA
|
Protein sequence | MPKREDIQTI LVIGSGPIVI GQACEFDYSG AQACKVLKAD GYRVVLVNSN PATIMTDPGL ADRTYVEPIT VEFVEQVIAK ERPDALLPTL GGQTGLNTAV ELARAGILAK YGVEMIGCDL EAIERGEDRK QFNECMAKLG IETSRSGYAY SIADAEDIVA ELGYPVVLRP SFTLGGAGGG IAHDAAELHE IVGQGLELSP AGEVLVEESI EGWKEYEMEV MRDRAGNGII VCSIENFDAM GVHTGDSITV APAQTLTDVE YQRMRAASLA ILEKIGVETG GSNVQFAVNP ENGRMIVIEM NPRVSRSSAL ASKATGFPIA KAAAKLAVGY TLDEIVNDIT KATPACFEPS IDYCVVKVPR FAFEKFQGTD DTLSTRMKAV GEVMAIGRTF EESLGKAMRS LENGRAGLGA DGKGGEADAS DEALEDLVAR PTAERIFYLA EALRRGWTVE RASAASRVDP FFVARMADIV RVQENLRGMA LDELDADAFR LLKRMGLADA QIACLTGSDE LTVRTCRKML GVRPAFKTVD TCAAEFPSAT AYHYKTYDAD ETETAPKTRR RAMILGAGPN RIGQGIEFDY CCVHASYALH EVGFETIMVN CNPETVSTDY DTSDKLYFEP LTFEDVMDIV DVEQPDGVVV TLGGQTPLKL ANALAAAGVP IMGTSPEAID LAEDRDRFAA VLDELGIVYP AAGMASTYQE ACVVADRIGF PLLVRPSYVL GGRGMGIVYD GAQLEKYMAE AAKISPDHPV YLDRFLEGAV EVDLDALCDG EQVFVGGVLE HIEMAGIHSG DSACCTPPFA LSEAVQAQLR AIARRLALRL GVVGLINIQF AIKDQVIYII EANPRASRTV PFTSKATGVP LAKVAARIMA GEKLADLGMP PDDRRLEHFS VKEAVMPFGR FPGADTVLGP EMKSTGEVMG IARNFPAAFA KTQLAISYAL PEGGTVFVSV CDRDKRAIVP IARDIARLGF RIVATGGTAR TLRAAGVECE QVRKIHEGEG NVRDMIAAGD IALMINTPFG HATRADGYEL RLEAVKHGVT HVTNLAGAQA MVAGMEAARL GGLAAVALQD LPQWELSR
|
| |