Gene Cfla_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1052 
Symbol 
ID9144928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1166621 
End bp1168303 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content73% 
IMG OID 
Productarginyl-tRNA synthetase 
Protein accessionYP_003636156 
Protein GI296128906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.473063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCCGG AGCAGCTCGC CGACGCCCTG CGCGTCGCTC TCGTGCAGGC CGTCGACGAC 
GGCACGTTCG CCCTCGACCC CGCCGCCCTC CCCGAGCGCG TGCACATCGA GCGCCCCCGG
CAGCGCGAGC ACGGCGACTG GGCGACGAAC GTCGCGCTGC AGCTGGCCAA GAAGGCCGGG
AGCAACCCGC GGGCGTTCGC CGAGGAGCTG GCGCGCCGCC TCGCGGACGT GCCCGGGGTC
GCCGCGGTCG ACGTGGCCGG TCCGGGCTTC CTCAACATCC GCCTCGACGC CGCCGCCGCG
GGCGAGCTCG CGCGCGGCAT CGTCGAGCAG GGGGCGGGGT ACGGGCGCAA CGCGACGTTC
GCCGGGCTCA AGGTGAACCT CGAGTTCGTC TCGGCCAACC CCACCGGCCC GCTGCACATC
GGCGGCGTGC GCTGGGCGGC GGTCGGTGAC GCGCTCGCGC GCGTGCTGCA GGCGTCCGGG
GCCGAGGTGA CGCGCGAGTA CTACTTCAAC GACCACGGCG CGCAGATCGA CCGCTTCGCG
CGTTCGCTCG TCGCCCGCGC CCGCGGCGAG GAGGCTCCCG AGGACGGCTA CGGCGGCCAG
TACATCGCCG ACATCGCCGA CCGGGTGATC GCCGACGCGC AGGCCGCCGG TGAGCCCGAC
CCGCGCACGC TGCCCGACGC CGAGGCGACC GAGGCGTTCC GGGCGCGCGG CGTCGAGCTG
ATGTTCGCCG AGATCAAGAC GTCCCTGCAC GACTTCGGCG TCGACTTCGA CGTGTACTTC
CACGAGGACT CGCTGCACGA GTCCGGTGCC GTGGAGCGGG CGGTCGAGCG CCTGCGGGCG
TCGGGCCACA TGTTCGAGGC CGACGGCGCG CTGTGGCTGC GCACCACGGC CTTCGGCGAC
GACAAGGACC GCGTCGTCGT CAAGTCGGAC GGGCAGGCCG CCTACATCGC GGGCGACATC
GCCTACTACC TCGACAAGCG CGAGCGCGGC TTCGACCGCG TCGTCATCAT GCTCGGCTCC
GACCACCACG GGTACGTCGG CCGCATGATG GCGGTCTGCG CGGCGTTCGG CGACGAGCCG
CACGTCAACC TCGAGCTGCT CATCGGCCAG CTCGTCAACC TCGTCAAGGA CGGCGAGCCC
GTCCGGATGT CGAAGCGCGC GGGCACGATC ATCACGATCG ACGACCTCGT CGAGGCGGTC
GGCGTCGACG CCGCGCGCTA CGCGCTGTCG CGGTCGTCGT CGGACCAGCA GATCGACCTC
GACCTCGACC TGCTCGCGAA GGCCACCAAC GAGAACCCCG TCTTCTACGT GCAGTACGCC
CACGCCCGTA CCGCGGCGAT GGCCCGCAAC GCGGCGGAAG CCGGCGTGCG CCGCGAGGAC
GGCTTCGACG CCTCCCTGCT CGACCACGAG TCGGAGGCCA AGCTGCTCGG CCTGCTCGCG
GACTTCCCGC GCGTGGTCGC CCAGGCGGCG GACCTGCGCG AGCCGCACCG CGTCGCGCGC
TACGCCGAGG ACGTCGCCGG CGCCTACCAC AAGTGGTACG ACCAGAAGCG CCGCGTGGTG
CCGTGGGGCG ACGAGGAGCT CACGGACGCC CACCGCACGC GCCTGTGGCT GAACGACGCG
ACGCGTCAGG TGCTCGCGAA CGCCCTCGAC CTGCTCGGGG TGAGCGCGCC GGAGCGGATG
TGA
 
Protein sequence
MTPEQLADAL RVALVQAVDD GTFALDPAAL PERVHIERPR QREHGDWATN VALQLAKKAG 
SNPRAFAEEL ARRLADVPGV AAVDVAGPGF LNIRLDAAAA GELARGIVEQ GAGYGRNATF
AGLKVNLEFV SANPTGPLHI GGVRWAAVGD ALARVLQASG AEVTREYYFN DHGAQIDRFA
RSLVARARGE EAPEDGYGGQ YIADIADRVI ADAQAAGEPD PRTLPDAEAT EAFRARGVEL
MFAEIKTSLH DFGVDFDVYF HEDSLHESGA VERAVERLRA SGHMFEADGA LWLRTTAFGD
DKDRVVVKSD GQAAYIAGDI AYYLDKRERG FDRVVIMLGS DHHGYVGRMM AVCAAFGDEP
HVNLELLIGQ LVNLVKDGEP VRMSKRAGTI ITIDDLVEAV GVDAARYALS RSSSDQQIDL
DLDLLAKATN ENPVFYVQYA HARTAAMARN AAEAGVRRED GFDASLLDHE SEAKLLGLLA
DFPRVVAQAA DLREPHRVAR YAEDVAGAYH KWYDQKRRVV PWGDEELTDA HRTRLWLNDA
TRQVLANALD LLGVSAPERM