Gene Cfla_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1041 
Symbol 
ID9144916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1152932 
End bp1154560 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content72% 
IMG OID 
Productcarboxyl transferase 
Protein accessionYP_003636145 
Protein GI296128895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGGCG TCGCCACCGA CACCGCCGAG CCCGCCGCGG ACACCGCCCC AGCACCCGCC 
CGCACGACCG CCGGGCTGCT CGAGGACCTG CGCGCGCGTC GCGCCGCGGC CGTCGACGCG
GCCGAGGAGA ACGCGGTCGC CAAGCAGCAC GCGCGCGGCA AGAAGACCGC CCGCGAGCGC
ATCGAGTCGC TGCTCGACCC GGGCTCGTTC GTCGAGCTCG ACGCCTTCGT GCGGCACAGG
TCGACGAACT TCGGGCTCGA GAGGAAGCGG GTCGCGGGTG ACGGCGTCGT CGTCGGGCAC
GGCACGGTCG ACGGCCGCCC GGTCTGCGTG TACTCGCAGG ACTTCACGGT CTTCGGTGGC
TCGCTGGGCG AGGTGCACGG GCAGAAGATC ACCAAGGTGA TGGACCTCGC GCTGCGGACG
GGTGTGCCGC TGGTCGGCAT CAGCGACGGC GGTGGCGCGC GCATCCAGGA GGGCGTGGCG
GGCCTCACGC AGTTCGCGGA GATCTTCCGC CGCAACGTCG CCGCGTCGGG CGTGATCCCG
CAGATCAGCC TCATCCTCGG CCCGTCGGCC GGCGGCGCGG TGTACTCCCC GGCGCTCACC
GACTTCATCG TCATGGCCGA CGGCACGTCG AACATGTTCA TCACCGGCCC GGACGTGATC
CGCGCCGTGA CGGGCGAGGA CGTGGGCTTC GAGGAGCTCG GCGGCGCGAC GACGCACAGC
ACGCGCTCCG GCGTCGCGCA CTACATGGCC TCCGACGAGG ACGACGCGAT CGACTGGGTG
CGCACGCTGC TGGCCTACCT GCCGACCAAC AACCTCGCCG AGCCGCCGGC GTACGCGCAG
GAGGCCGACC TCGAGGTCAC CGAGGACGAC CTCGTGCTCG ACACCCTGGT GCCGGACTCG
GACAACCAGC CGTACGACAT GCACACGGTC CTGGAGACGG TCCTCGACGA CGGCAGCTTC
CTGGAGGTCC AGGCGCTGTA CGCGCAGAAC GTGGTCGTCG GGTTCGGGCA CGTCGAGGGG
CAGCCGGTCG GCATCGTCGC GAACCAGCCG ATGCAGATGG CGGGCACGCT CGACATCAAC
GCGGCCGAGA AGGCCGCACG GTTCGTGCGC ACGTGCGACG CGTTCGGCAT CCCGGTGCTG
ACGTTCGTCG ACGTCCCCGG CTTCCTGCCG GGCACGGACC AGGAGTGGAA CGGCATCATC
CGCCGCGGCG CCAAGCTCAT CTACGCGTAC GCCGAGGCGA CCGTCCCGCT GGTCACCGTC
ATCACACGCA AGGCGTACGG CGGCGCGTAC ATCGTCATGG GCTCCAAGCA GCTCGGCGCG
GACGTCAACC TCGCCTGGCC CACCGCGCAG ATCGCGGTGA TGGGCGCGGG TGGCGCCGTG
AACATCCTGC AGCGCGGCGC GCTCAAGGCC GTCGCGGACG CCGGCGGCGA CGTCGAGGCC
GAGCGCCGCC GCCTGACGGC GGAGTACGAG GAGGCGATCG TCAACCCGTG GGACGCGGCG
GACCGCGGCT ACGTGGACGA CGTGATCGAG CCCTCGGCGA CGCGCGCGCA GGTCGTCCGC
TCGCTGCGCC TGCTGCGCAC CAAGCGCGCG AGCCTGCCGC CCAAGAAGCA CGGGAACATC
CCGCTGTGA
 
Protein sequence
MRGVATDTAE PAADTAPAPA RTTAGLLEDL RARRAAAVDA AEENAVAKQH ARGKKTARER 
IESLLDPGSF VELDAFVRHR STNFGLERKR VAGDGVVVGH GTVDGRPVCV YSQDFTVFGG
SLGEVHGQKI TKVMDLALRT GVPLVGISDG GGARIQEGVA GLTQFAEIFR RNVAASGVIP
QISLILGPSA GGAVYSPALT DFIVMADGTS NMFITGPDVI RAVTGEDVGF EELGGATTHS
TRSGVAHYMA SDEDDAIDWV RTLLAYLPTN NLAEPPAYAQ EADLEVTEDD LVLDTLVPDS
DNQPYDMHTV LETVLDDGSF LEVQALYAQN VVVGFGHVEG QPVGIVANQP MQMAGTLDIN
AAEKAARFVR TCDAFGIPVL TFVDVPGFLP GTDQEWNGII RRGAKLIYAY AEATVPLVTV
ITRKAYGGAY IVMGSKQLGA DVNLAWPTAQ IAVMGAGGAV NILQRGALKA VADAGGDVEA
ERRRLTAEYE EAIVNPWDAA DRGYVDDVIE PSATRAQVVR SLRLLRTKRA SLPPKKHGNI
PL