Gene Cfla_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2234 
Symbol 
ID9146134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2491779 
End bp2494727 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content72% 
IMG OID 
Productleucyl-tRNA synthetase 
Protein accessionYP_003637324 
Protein GI296130074 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.400597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0120653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCCG CCTCGCGTCG GTACGATTCC CCGGTGAGCG ACCAGTCCCC GACCCCGGCC 
CCCGACGACG TCCCCTTCCG CTACACGGCC GCCCTCGCCG AGCAGATCGA GCTCCGCTGG
CAGGACGAGT GGGAGAAGCG CGGCACGTAC TTCACGCCCA ACCCGGTCGG CGAGCTCACC
GACGGCGAGG GGCGGCACGC CGACCCCGCG GCACGCCCGT TCTTCGTCAT GGACATGTTC
CCGTACCCGT CGGGCGCAGG GCTGCACGTC GGGCACCCGC TCGGCTACAT CGCCACGGAC
GTCGTGGGCC GGTTCCGACG CATGTGCGGC GACAACGTGC TGCACGCCCT GGCGTTCGAC
GCGTTCGGCC TGCCCGCCGA GCAGTACGCG GTCCAGACCG GCCGGCACCC GCGGGTGACC
ACCGAGGCGA ACATCGAGAT CATGCAGCGC CAGCTGCGTC GTCTCGGCCT GGCCCACGAC
CCGCGCCGCT CGTTCGCGAC GATCGACCCC GACTACGTCC GCTGGACGCA GTGGATCTTC
CTGCAGATCT TCGAGTCCTG GTACGACGAG GACGCGGTGC GCCCCGACGG CGGCACCGGG
CGGGCGCGGC CGGTGTCCGA GCTGGTCGCC GAGTACGAGG CCGGCACCCG CGCGCTGCCC
ACGGACGTCG AGGGCGTCGA GCCCGGTGCG ACGTGGACCG ACCTGGATGC GGCCACCCGC
CGCCGGGTCG TCGACTCGCG CCGGCTGGCG TACCTGTCGC AGACGCCCGT CAACTGGGCG
CCCGGCCTGG GCACCGTGCT GGCCAACGAG GAGGTCACGG CCGACGGCCG CTCCGAGCGT
GGCAACTTCC CGGTGTTCCA GCGCAGCCTG CGCCAGTGGA ACATGCGCAT CACGGCGTAC
GCCGACCGCC TGACGGACGA CCTGGACCGC ATCGACTGGC CCGAGAAGGT CAAGGCGATG
CAGCGGCACT GGATCGGCCG CTCGACCGGT GCACGCGTGC GGTTCGCCGT GCAGGGCGGG
GAGCAGCTCG AGGTGTTCAC GACGCGCCCC GACACGCTGT TCGGCGCCAC GTTCCTCGTC
GTCTCGCCCG AGCACCCGCT GCTGGACGAG GTGCCGGCGC AGTGGCCCGA CGGCACGTCG
AGCGCCTGGA CGGGCGGGCA CTCCTCGCCG ACGGACGCCG TCGCCGACTA CCGCCGCGAG
GCCGCCGCGA AGACCGCGCT CGAGCGGCAG CAGGACGCCG GCCGCAAGAC GGGTGTGTTC
ACCGGCCACC TGGCGACCAA CCCGGTCAAC GGCGAGCTGC TGCCGGTCTT CACGGCGGAC
TACGTGCTCA TGGGCTACGG CACGGGCGCG ATCATGGCCG TCCCGGGCGG TGACGAGCGC
GACTTCCAGT TCGCCCAGGC CTTCGGGCTG CCGGTCGTGT ACACCGTGGA CGCGCCCGAG
GGCACCGCCC CCGGCGCACG CACCGGCGAC GGCGCGATCA TCAACTCCGC CAACGACGAG
GTCTCGCTCG ACGGCCTGGA CGTGCCGACG GCCAAGGAGC GCATCGTGGC CTGGCTCGAG
GAGCACGGCG TCGGGGAGCG CACGATCACC TACCGCCTGC GCGACTGGCT GTTCAGCCGC
CAGCGCTACT GGGGCGAGCC GTTCCCCGTC GTCTACGACG AGGACGACAC GCCCATCGCG
CTGCCTGCCT CGGCGCTGCC CGTCGAGCTG CCCGAGGTGC CGGACTTCTC GCCGCGCACC
TACGACCCGG ACGACGCGAC CTCCGAGCCC GAGCCGCCGC TGGGCCGCAA CACCGACTGG
CTGTACGTCG AGCTCGACCT GGGCGACGGC CCGAGGCGCT ACCGCCGCGA CGCCAACACG
ATGCCCAACT GGGCGGGCTC GTGCTGGTAC CACCTGCGCT ATCTCGACCC GCGCTCGGAC
GACGCGCTGG TCGACCCCGC CCTCGAGGAC TACTGGATGG GCCCCGGTCA CGGGACGCAG
GCCGAGGGCT CGACGGGCGG CGTCGACCTG TACGTCGGCG GGGTGGAGCA CGCCGTGCTG
CACCTGCTGT ACGCGCGCTT CTGGCACAAG GTGCTGTACG ACCTGGGCCA CGTGCGCAGC
GCCGAGCCGT TCCACAAGCT GTTCAACCAG GGCTACATCC AGGGGTACGC CTACACCGAC
GAGCGCGGCG TGTACGTGCC CGCGGCCGAG GTCGTCGAGG ACGAGGCGTC GCCGACCGGC
TTCCGGTGGA ACGGCGAGCC CGTCCACCGG GAGTACGGGA AGATCGGCAA GTCGCTGAAG
AACGCCGTGT CGCCCGACGA GATGTACGAG GCCTACGGCG CCGACACGCT GCGCGTCTAC
GAGATGTCGA TGGGTCCGCT GGACCTGTCG CGGCCGTGGG AGACGCGCGC CGTGGTCGGT
GCGCAGCGGT TCCTGCAGCG GCTGTGGCGC AACGTCGTCG ACGAGACGAC CGGCGAGCTG
GTCGTGACCG AGGACGCGCC GTCGACCGAG ACGCTGCGGG TCCTGCACCG CACGATCGAG
GGTGTGCGCG AGGACATGGA GGGCATGCGG ATCAACACCG CGATCGCCAA GCTCATCGTC
CTCAACAACC ACGTCACGAC GCTGGAGCGC GCGCCGCGTT CCGTGGTCGA GGCGCTGGTC
GTCATGACGG CACCCGTCGC ACCGCACATC GCCGAGGAGC TCTGGGCGCG GCTGGGCCAC
GAGCGGTCGG TCGTGCACGC CACGTTCCCG CAGGCGGACC CGCAGCATCT GGTCGAGGAG
ACCGTGACCT GCGTGTTCCA GGTGCAGGGC AAGGTGCGCG GCCGCGCGGA GGTGGCGCCG
TCGGCGGGCG AGGACGAGCT GCGCGAGCTG GCGCTCGCCG ACGCGGGCGT CCAGCGCGCG
CTCGCGGGAC GTGACGTGCG GACCGTGATC GTCCGCGCGC CGCGGCTCGT CAACGTGGTG
CCGGCCTGA
 
Protein sequence
MGAASRRYDS PVSDQSPTPA PDDVPFRYTA ALAEQIELRW QDEWEKRGTY FTPNPVGELT 
DGEGRHADPA ARPFFVMDMF PYPSGAGLHV GHPLGYIATD VVGRFRRMCG DNVLHALAFD
AFGLPAEQYA VQTGRHPRVT TEANIEIMQR QLRRLGLAHD PRRSFATIDP DYVRWTQWIF
LQIFESWYDE DAVRPDGGTG RARPVSELVA EYEAGTRALP TDVEGVEPGA TWTDLDAATR
RRVVDSRRLA YLSQTPVNWA PGLGTVLANE EVTADGRSER GNFPVFQRSL RQWNMRITAY
ADRLTDDLDR IDWPEKVKAM QRHWIGRSTG ARVRFAVQGG EQLEVFTTRP DTLFGATFLV
VSPEHPLLDE VPAQWPDGTS SAWTGGHSSP TDAVADYRRE AAAKTALERQ QDAGRKTGVF
TGHLATNPVN GELLPVFTAD YVLMGYGTGA IMAVPGGDER DFQFAQAFGL PVVYTVDAPE
GTAPGARTGD GAIINSANDE VSLDGLDVPT AKERIVAWLE EHGVGERTIT YRLRDWLFSR
QRYWGEPFPV VYDEDDTPIA LPASALPVEL PEVPDFSPRT YDPDDATSEP EPPLGRNTDW
LYVELDLGDG PRRYRRDANT MPNWAGSCWY HLRYLDPRSD DALVDPALED YWMGPGHGTQ
AEGSTGGVDL YVGGVEHAVL HLLYARFWHK VLYDLGHVRS AEPFHKLFNQ GYIQGYAYTD
ERGVYVPAAE VVEDEASPTG FRWNGEPVHR EYGKIGKSLK NAVSPDEMYE AYGADTLRVY
EMSMGPLDLS RPWETRAVVG AQRFLQRLWR NVVDETTGEL VVTEDAPSTE TLRVLHRTIE
GVREDMEGMR INTAIAKLIV LNNHVTTLER APRSVVEALV VMTAPVAPHI AEELWARLGH
ERSVVHATFP QADPQHLVEE TVTCVFQVQG KVRGRAEVAP SAGEDELREL ALADAGVQRA
LAGRDVRTVI VRAPRLVNVV PA