Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2234 |
Symbol | |
ID | 9146134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2491779 |
End bp | 2494727 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | leucyl-tRNA synthetase |
Protein accession | YP_003637324 |
Protein GI | 296130074 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.400597 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0120653 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCCG CCTCGCGTCG GTACGATTCC CCGGTGAGCG ACCAGTCCCC GACCCCGGCC CCCGACGACG TCCCCTTCCG CTACACGGCC GCCCTCGCCG AGCAGATCGA GCTCCGCTGG CAGGACGAGT GGGAGAAGCG CGGCACGTAC TTCACGCCCA ACCCGGTCGG CGAGCTCACC GACGGCGAGG GGCGGCACGC CGACCCCGCG GCACGCCCGT TCTTCGTCAT GGACATGTTC CCGTACCCGT CGGGCGCAGG GCTGCACGTC GGGCACCCGC TCGGCTACAT CGCCACGGAC GTCGTGGGCC GGTTCCGACG CATGTGCGGC GACAACGTGC TGCACGCCCT GGCGTTCGAC GCGTTCGGCC TGCCCGCCGA GCAGTACGCG GTCCAGACCG GCCGGCACCC GCGGGTGACC ACCGAGGCGA ACATCGAGAT CATGCAGCGC CAGCTGCGTC GTCTCGGCCT GGCCCACGAC CCGCGCCGCT CGTTCGCGAC GATCGACCCC GACTACGTCC GCTGGACGCA GTGGATCTTC CTGCAGATCT TCGAGTCCTG GTACGACGAG GACGCGGTGC GCCCCGACGG CGGCACCGGG CGGGCGCGGC CGGTGTCCGA GCTGGTCGCC GAGTACGAGG CCGGCACCCG CGCGCTGCCC ACGGACGTCG AGGGCGTCGA GCCCGGTGCG ACGTGGACCG ACCTGGATGC GGCCACCCGC CGCCGGGTCG TCGACTCGCG CCGGCTGGCG TACCTGTCGC AGACGCCCGT CAACTGGGCG CCCGGCCTGG GCACCGTGCT GGCCAACGAG GAGGTCACGG CCGACGGCCG CTCCGAGCGT GGCAACTTCC CGGTGTTCCA GCGCAGCCTG CGCCAGTGGA ACATGCGCAT CACGGCGTAC GCCGACCGCC TGACGGACGA CCTGGACCGC ATCGACTGGC CCGAGAAGGT CAAGGCGATG CAGCGGCACT GGATCGGCCG CTCGACCGGT GCACGCGTGC GGTTCGCCGT GCAGGGCGGG GAGCAGCTCG AGGTGTTCAC GACGCGCCCC GACACGCTGT TCGGCGCCAC GTTCCTCGTC GTCTCGCCCG AGCACCCGCT GCTGGACGAG GTGCCGGCGC AGTGGCCCGA CGGCACGTCG AGCGCCTGGA CGGGCGGGCA CTCCTCGCCG ACGGACGCCG TCGCCGACTA CCGCCGCGAG GCCGCCGCGA AGACCGCGCT CGAGCGGCAG CAGGACGCCG GCCGCAAGAC GGGTGTGTTC ACCGGCCACC TGGCGACCAA CCCGGTCAAC GGCGAGCTGC TGCCGGTCTT CACGGCGGAC TACGTGCTCA TGGGCTACGG CACGGGCGCG ATCATGGCCG TCCCGGGCGG TGACGAGCGC GACTTCCAGT TCGCCCAGGC CTTCGGGCTG CCGGTCGTGT ACACCGTGGA CGCGCCCGAG GGCACCGCCC CCGGCGCACG CACCGGCGAC GGCGCGATCA TCAACTCCGC CAACGACGAG GTCTCGCTCG ACGGCCTGGA CGTGCCGACG GCCAAGGAGC GCATCGTGGC CTGGCTCGAG GAGCACGGCG TCGGGGAGCG CACGATCACC TACCGCCTGC GCGACTGGCT GTTCAGCCGC CAGCGCTACT GGGGCGAGCC GTTCCCCGTC GTCTACGACG AGGACGACAC GCCCATCGCG CTGCCTGCCT CGGCGCTGCC CGTCGAGCTG CCCGAGGTGC CGGACTTCTC GCCGCGCACC TACGACCCGG ACGACGCGAC CTCCGAGCCC GAGCCGCCGC TGGGCCGCAA CACCGACTGG CTGTACGTCG AGCTCGACCT GGGCGACGGC CCGAGGCGCT ACCGCCGCGA CGCCAACACG ATGCCCAACT GGGCGGGCTC GTGCTGGTAC CACCTGCGCT ATCTCGACCC GCGCTCGGAC GACGCGCTGG TCGACCCCGC CCTCGAGGAC TACTGGATGG GCCCCGGTCA CGGGACGCAG GCCGAGGGCT CGACGGGCGG CGTCGACCTG TACGTCGGCG GGGTGGAGCA CGCCGTGCTG CACCTGCTGT ACGCGCGCTT CTGGCACAAG GTGCTGTACG ACCTGGGCCA CGTGCGCAGC GCCGAGCCGT TCCACAAGCT GTTCAACCAG GGCTACATCC AGGGGTACGC CTACACCGAC GAGCGCGGCG TGTACGTGCC CGCGGCCGAG GTCGTCGAGG ACGAGGCGTC GCCGACCGGC TTCCGGTGGA ACGGCGAGCC CGTCCACCGG GAGTACGGGA AGATCGGCAA GTCGCTGAAG AACGCCGTGT CGCCCGACGA GATGTACGAG GCCTACGGCG CCGACACGCT GCGCGTCTAC GAGATGTCGA TGGGTCCGCT GGACCTGTCG CGGCCGTGGG AGACGCGCGC CGTGGTCGGT GCGCAGCGGT TCCTGCAGCG GCTGTGGCGC AACGTCGTCG ACGAGACGAC CGGCGAGCTG GTCGTGACCG AGGACGCGCC GTCGACCGAG ACGCTGCGGG TCCTGCACCG CACGATCGAG GGTGTGCGCG AGGACATGGA GGGCATGCGG ATCAACACCG CGATCGCCAA GCTCATCGTC CTCAACAACC ACGTCACGAC GCTGGAGCGC GCGCCGCGTT CCGTGGTCGA GGCGCTGGTC GTCATGACGG CACCCGTCGC ACCGCACATC GCCGAGGAGC TCTGGGCGCG GCTGGGCCAC GAGCGGTCGG TCGTGCACGC CACGTTCCCG CAGGCGGACC CGCAGCATCT GGTCGAGGAG ACCGTGACCT GCGTGTTCCA GGTGCAGGGC AAGGTGCGCG GCCGCGCGGA GGTGGCGCCG TCGGCGGGCG AGGACGAGCT GCGCGAGCTG GCGCTCGCCG ACGCGGGCGT CCAGCGCGCG CTCGCGGGAC GTGACGTGCG GACCGTGATC GTCCGCGCGC CGCGGCTCGT CAACGTGGTG CCGGCCTGA
|
Protein sequence | MGAASRRYDS PVSDQSPTPA PDDVPFRYTA ALAEQIELRW QDEWEKRGTY FTPNPVGELT DGEGRHADPA ARPFFVMDMF PYPSGAGLHV GHPLGYIATD VVGRFRRMCG DNVLHALAFD AFGLPAEQYA VQTGRHPRVT TEANIEIMQR QLRRLGLAHD PRRSFATIDP DYVRWTQWIF LQIFESWYDE DAVRPDGGTG RARPVSELVA EYEAGTRALP TDVEGVEPGA TWTDLDAATR RRVVDSRRLA YLSQTPVNWA PGLGTVLANE EVTADGRSER GNFPVFQRSL RQWNMRITAY ADRLTDDLDR IDWPEKVKAM QRHWIGRSTG ARVRFAVQGG EQLEVFTTRP DTLFGATFLV VSPEHPLLDE VPAQWPDGTS SAWTGGHSSP TDAVADYRRE AAAKTALERQ QDAGRKTGVF TGHLATNPVN GELLPVFTAD YVLMGYGTGA IMAVPGGDER DFQFAQAFGL PVVYTVDAPE GTAPGARTGD GAIINSANDE VSLDGLDVPT AKERIVAWLE EHGVGERTIT YRLRDWLFSR QRYWGEPFPV VYDEDDTPIA LPASALPVEL PEVPDFSPRT YDPDDATSEP EPPLGRNTDW LYVELDLGDG PRRYRRDANT MPNWAGSCWY HLRYLDPRSD DALVDPALED YWMGPGHGTQ AEGSTGGVDL YVGGVEHAVL HLLYARFWHK VLYDLGHVRS AEPFHKLFNQ GYIQGYAYTD ERGVYVPAAE VVEDEASPTG FRWNGEPVHR EYGKIGKSLK NAVSPDEMYE AYGADTLRVY EMSMGPLDLS RPWETRAVVG AQRFLQRLWR NVVDETTGEL VVTEDAPSTE TLRVLHRTIE GVREDMEGMR INTAIAKLIV LNNHVTTLER APRSVVEALV VMTAPVAPHI AEELWARLGH ERSVVHATFP QADPQHLVEE TVTCVFQVQG KVRGRAEVAP SAGEDELREL ALADAGVQRA LAGRDVRTVI VRAPRLVNVV PA
|
| |