Gene Elen_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1670 
Symbol 
ID8415969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1968664 
End bp1971900 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content69% 
IMG OID645024637 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_003182025 
Protein GI257791419 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00809225 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTAAGC GCGAAGACAT CCAAACCATC CTCGTCATCG GAAGCGGCCC CATCGTCATC 
GGGCAGGCGT GCGAGTTCGA CTACTCGGGC GCGCAGGCCT GCAAGGTGTT GAAGGCCGAC
GGCTATCGCG TGGTGCTCGT GAACAGCAAC CCCGCGACCA TCATGACCGA CCCGGGCCTG
GCCGACCGCA CCTACGTCGA ACCCATCACC GTGGAGTTCG TCGAGCAGGT CATTGCGAAG
GAGCGCCCCG ACGCCCTCTT GCCCACGCTG GGCGGCCAGA CCGGGCTCAA CACGGCCGTC
GAGCTCGCGC GCGCCGGCAT CCTGGCCAAG TACGGCGTGG AGATGATCGG CTGCGACCTC
GAGGCCATCG AGCGCGGCGA GGACCGCAAG CAGTTCAACG AGTGCATGGC GAAGCTTGGC
ATCGAGACGT CGCGCTCCGG CTACGCCTAT TCCATCGCCG ACGCCGAGGA CATCGTGGCC
GAGCTCGGCT ACCCCGTGGT GCTGCGCCCC TCGTTCACGC TGGGCGGCGC GGGCGGCGGC
ATCGCGCACG ACGCGGCCGA GCTGCACGAG ATCGTGGGCC AGGGCCTGGA GCTGTCGCCG
GCCGGCGAGG TGCTGGTGGA GGAGAGCATC GAGGGCTGGA AAGAGTACGA GATGGAGGTC
ATGCGCGACC GCGCCGGCAA CGGCATCATC GTGTGCTCCA TCGAGAACTT CGACGCCATG
GGCGTGCACA CGGGCGACTC CATCACGGTG GCGCCCGCGC AGACGCTGAC GGATGTGGAG
TACCAGCGCA TGCGCGCGGC GTCGCTGGCC ATCCTCGAGA AGATCGGCGT GGAGACCGGC
GGCTCCAACG TGCAGTTCGC CGTCAACCCC GAGAACGGCC GCATGATCGT CATAGAGATG
AACCCTCGCG TGTCGCGCTC CTCGGCGCTC GCGTCGAAAG CCACGGGCTT CCCCATCGCG
AAGGCGGCCG CGAAGCTGGC CGTGGGCTAC ACGCTCGACG AGATCGTGAA CGACATCACG
AAGGCCACGC CCGCCTGCTT CGAGCCGTCC ATCGACTACT GCGTGGTGAA GGTGCCGCGC
TTCGCGTTCG AGAAGTTCCA GGGCACCGAC GACACGCTGT CCACGCGCAT GAAGGCCGTC
GGCGAGGTCA TGGCCATCGG CCGAACGTTC GAGGAATCGC TCGGCAAGGC CATGCGCTCG
CTGGAGAACG GGCGCGCGGG CCTGGGAGCC GACGGCAAGG GCGGCGAGGC CGACGCGTCC
GACGAGGCGC TGGAAGACCT CGTGGCGCGC CCCACGGCCG AGCGCATCTT CTACCTGGCC
GAGGCGCTGC GCCGCGGCTG GACGGTCGAG CGCGCGAGCG CGGCGAGCCG CGTGGACCCC
TTCTTCGTCG CGCGCATGGC CGACATCGTG CGCGTGCAGG AAAACCTGCG CGGCATGGCG
CTGGACGAGC TCGATGCCGA CGCGTTCCGC CTGCTCAAGC GCATGGGGCT GGCCGACGCG
CAGATCGCGT GCCTCACGGG CTCCGACGAG CTCACCGTGC GCACGTGCCG CAAGATGCTG
GGCGTGAGGC CCGCGTTCAA AACCGTGGAC ACCTGCGCCG CCGAGTTCCC CAGCGCCACG
GCCTACCATT ACAAGACCTA CGACGCCGAC GAGACCGAGA CGGCGCCGAA GACGCGGCGG
CGCGCCATGA TCCTGGGCGC GGGGCCGAAC CGCATCGGCC AGGGCATCGA GTTCGACTAC
TGCTGCGTGC ACGCCAGCTA CGCTCTGCAC GAGGTCGGGT TCGAGACGAT CATGGTGAAC
TGCAACCCCG AGACGGTGTC CACCGACTAC GACACGTCCG ACAAGCTGTA CTTCGAGCCG
CTTACGTTCG AGGACGTCAT GGACATCGTC GACGTCGAGC AGCCCGACGG CGTGGTGGTC
ACGCTCGGCG GGCAGACGCC GCTCAAGCTG GCGAACGCGC TGGCGGCGGC CGGCGTGCCC
ATCATGGGCA CCTCGCCGGA GGCCATCGAC CTGGCCGAGG ACCGCGACCG CTTCGCCGCC
GTCCTCGACG AGCTGGGCAT CGTATACCCG GCCGCCGGCA TGGCCTCCAC GTATCAAGAG
GCGTGCGTCG TGGCCGACCG CATCGGCTTC CCACTGCTCG TGCGCCCCAG CTACGTGCTG
GGCGGGCGCG GCATGGGCAT CGTCTACGAC GGCGCCCAGC TGGAGAAATA CATGGCGGAG
GCCGCGAAGA TCTCGCCCGA CCACCCGGTG TACCTCGACC GCTTCCTCGA AGGCGCGGTG
GAGGTGGACC TCGACGCGCT CTGCGACGGC GAGCAGGTGT TCGTGGGCGG CGTGCTGGAG
CACATCGAGA TGGCGGGCAT ACATTCGGGC GACTCGGCCT GCTGCACGCC GCCGTTCGCG
CTGTCCGAGG CCGTGCAGGC GCAGCTGCGC GCCATCGCGC GCCGCCTGGC GCTGCGCCTG
GGCGTGGTGG GGCTCATCAA CATCCAGTTC GCCATCAAAG ACCAGGTCAT CTACATCATC
GAGGCGAACC CGCGCGCCAG CCGCACGGTG CCGTTCACCT CGAAGGCCAC GGGCGTTCCG
CTGGCGAAGG TCGCCGCGCG CATCATGGCG GGGGAGAAGC TGGCCGATTT GGGGATGCCG
CCCGACGACC GCCGGCTCGA GCATTTCAGC GTGAAGGAAG CCGTCATGCC CTTCGGCCGC
TTCCCCGGCG CCGACACGGT GCTCGGCCCC GAGATGAAGT CCACCGGCGA GGTCATGGGC
ATCGCGCGCA ACTTCCCGGC GGCGTTCGCG AAGACGCAGC TGGCCATCAG CTACGCGCTG
CCCGAGGGCG GCACGGTGTT CGTCAGCGTG TGCGACCGCG ACAAGCGCGC CATCGTGCCC
ATCGCTCGCG ACATCGCGCG TCTGGGCTTT CGCATCGTGG CCACAGGCGG CACGGCCCGC
ACGTTGCGGG CGGCCGGCGT GGAGTGCGAG CAGGTGAGGA AGATCCACGA GGGCGAGGGC
AACGTGCGCG ACATGATCGC CGCCGGGGAC ATCGCGCTCA TGATCAACAC GCCGTTCGGC
CACGCCACGC GCGCCGACGG CTACGAGCTG CGCCTCGAGG CCGTGAAGCA CGGCGTGACC
CACGTGACGA ACCTCGCCGG CGCCCAGGCC ATGGTGGCCG GCATGGAAGC CGCGCGCCTC
GGCGGCCTCG CGGCCGTGGC CTTGCAGGAC CTGCCCCAGT GGGAGCTTTC GCGCTGA
 
Protein sequence
MPKREDIQTI LVIGSGPIVI GQACEFDYSG AQACKVLKAD GYRVVLVNSN PATIMTDPGL 
ADRTYVEPIT VEFVEQVIAK ERPDALLPTL GGQTGLNTAV ELARAGILAK YGVEMIGCDL
EAIERGEDRK QFNECMAKLG IETSRSGYAY SIADAEDIVA ELGYPVVLRP SFTLGGAGGG
IAHDAAELHE IVGQGLELSP AGEVLVEESI EGWKEYEMEV MRDRAGNGII VCSIENFDAM
GVHTGDSITV APAQTLTDVE YQRMRAASLA ILEKIGVETG GSNVQFAVNP ENGRMIVIEM
NPRVSRSSAL ASKATGFPIA KAAAKLAVGY TLDEIVNDIT KATPACFEPS IDYCVVKVPR
FAFEKFQGTD DTLSTRMKAV GEVMAIGRTF EESLGKAMRS LENGRAGLGA DGKGGEADAS
DEALEDLVAR PTAERIFYLA EALRRGWTVE RASAASRVDP FFVARMADIV RVQENLRGMA
LDELDADAFR LLKRMGLADA QIACLTGSDE LTVRTCRKML GVRPAFKTVD TCAAEFPSAT
AYHYKTYDAD ETETAPKTRR RAMILGAGPN RIGQGIEFDY CCVHASYALH EVGFETIMVN
CNPETVSTDY DTSDKLYFEP LTFEDVMDIV DVEQPDGVVV TLGGQTPLKL ANALAAAGVP
IMGTSPEAID LAEDRDRFAA VLDELGIVYP AAGMASTYQE ACVVADRIGF PLLVRPSYVL
GGRGMGIVYD GAQLEKYMAE AAKISPDHPV YLDRFLEGAV EVDLDALCDG EQVFVGGVLE
HIEMAGIHSG DSACCTPPFA LSEAVQAQLR AIARRLALRL GVVGLINIQF AIKDQVIYII
EANPRASRTV PFTSKATGVP LAKVAARIMA GEKLADLGMP PDDRRLEHFS VKEAVMPFGR
FPGADTVLGP EMKSTGEVMG IARNFPAAFA KTQLAISYAL PEGGTVFVSV CDRDKRAIVP
IARDIARLGF RIVATGGTAR TLRAAGVECE QVRKIHEGEG NVRDMIAAGD IALMINTPFG
HATRADGYEL RLEAVKHGVT HVTNLAGAQA MVAGMEAARL GGLAAVALQD LPQWELSR