Gene Cfla_2920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2920 
Symbol 
ID9146832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3233541 
End bp3235940 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content73% 
IMG OID 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_003638002 
Protein GI296130752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0505784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.534907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG CTCCTGCGCG CCCCGACGTC CCGTCCGCTC CGCACACGTC CGACACCGTC 
GAGCACGCCG CGGCGACACC CGACCTCGAG CAGCCCTACG CCGAGCTCGG CCTCAAGCCC
GACGAGTACC AGCGGATCCG CGACATCCTC GGGCGTCGCC CCACGGCCGC CGAGCTCGCG
ATGTACTCCG TGATGTGGTC CGAGCACTGC TCGTACAAGT CGTCGAAGAC CCACCTGCGG
CAGTTCGGCG ACAAGACCAC GCCCGCGATG AAGGAGCACC TGCTCGTCGG CATCGGGGAG
AACGCCGGCG TCGTCGACAT CGGCGACGGC TGGGCCGTGA CGTTCAAGGT CGAGTCGCAC
AACCACCCGT CGTTCGTCGA GCCCTACCAG GGTGCCGCGA CGGGCGTCGG CGGGATCGTG
CGCGACATCA TCTCGATGGG TGCGCGGCCC GTCGCCGTCA TGGACCAGCT GCGCTTCGGC
GCCGTGGACC ACCCGGACAC GGCGCGCGTC GTGCACGGCG TCGTCGCCGG CGTCGGGGGG
TACGGCAACA GCCTGGGGCT GCCGAACATC GGCGGCGAGC TCGTCTTCGA CGCCTGCTAC
CAGGGCAACC CGCTGGTCAA CGCGCTGTGC CTCGGTGTGC TGCGCCACGA GGACATCCAC
CTGGCCAACG CCTCGGGCGT CGGCAACAAG GTCGTGCTGT TCGGCGCCCG CACGGGCGGC
GACGGCATCG GCGGCGCGTC GATCCTCGCG TCGGAGACGT TCGACGACGC CAAGCCGTCC
AAGCGTCCGT CGGTGCAGGT CGGCGACCCG TTCATGGAGA AGGTGCTCAT CGAGTGCTGC
CTCGAGCTCT ACGCGGCCCA GGTCGTCGAG GGCATCCAGG ACCTCGGCGC CGCGGGCATC
TCCTGCGCGA CCAGCGAGCT CGCGTCCAAC GGTGACGGCG GTATGCACGT CGACCTCGAG
AACGTGCTGC TGCGCGACCC CACGCTGACG GCCGGCGAGA TCCTCATGTC GGAGTCGCAG
GAGCGCATGA TGGCGGTCGT CCGGCCCGAC AAGCTCGACG AGTTCCTCGA GATCACGCGC
AGGTGGGACG TCGAGACGGC CGTGATCGGC GAGGTCACCG GCACCGGGCG CCTGACGATC
GACCACCACG GCGACCGGAT CGTCGACGTC GACCCGAGGA CGGTCGCGCA CGAGGGCCCG
GTGTACGACC GCCCCTACGC GCGCCCGGCC TGGCAGGACG GCCTCGTCGC GGACTCCGTG
AGCACGCCTG AGGGTGCGGC GCGCTACGCG CGCCCGGAGA GCGCCGACGA GCTGCGCGCG
ACACTCCTGC AGCTCCTGGG CTCGCCGAAC CTCGCGTCAC CGGCATGGGT CACCGACCAG
TACGACCGCT TCGTGCAGGG CAACACGGCC CTCGCGCAGC CCGACGACTC GGGCGTGGTG
CGCGTCGACG AGACCACGGG CCTGGGCGTC GCGCTGGCCA CCGACGCCAA CGGCCGCTAC
GCCAAGCTCG ACCCGTACAC GGGTGCGCAG CTCGCGCTCG CCGAGGCGTA CCGCAACGTC
GCCACGGTCG GTGCGCGGCC CGTGGCCGTC ACCGACTGCC TGAACTTCGG CACGCCCGAG
CACCCGGACA CGATGTGGCA GCTCGTCGAG GCGATCCGCG GCCTGGCCGA CGCGTGCCAG
ACCCTCGAGG TCCCCGTGAC CGGCGGCAAC GTCTCGCTCT ACAACGGCAC GGGCGAGCCC
GGGCAGATCG ACTCGGCGAT CCACCCGACC CCCGTCGTCG GCGTGCTCGG GGTCCTCGAC
GACGTCGCGC GCGCGGTGCC GTCGGGCTGG ACCGCGCCCG GCCAGGCCGT CTACCTGCTG
GGCACCACGC GTCCGGAGCT CGACGGGTCC GCCTGGGCGG ACGTCGTCCA CCGGCACCTG
GGCGGCGTGC CGCCGCAGGT CGACCTCGAC GCCGAGCGCC GCCTCGCGCA GGTGCTGGTC
GCCGCGGCGC GCGACGAGCT CGTCGACGCC GCGCACGACC TGTCCGAGGG TGGCCTGGCG
CTCGCGCTCG TCGAGTCGAG CCTCCGGTAC GGCGTCGGTG TCCAGGTCGA CCTCGGCGCA
CTGTGCGCGC GCGACGGCCT CACGGCCTTC GAGGCGCTGT TCTCGGAGTC GCAGGCCCGT
GCGATCGTCG CCGTGCCCCG CTCCGAGGAG GTCCGCCTCC TCGACCTGTG CACCGCTCGC
GGCGTCCCGG CGCTGCGCCT GGGCGAGACG GCCGAGACGT GCACGCTCGG TGCGGGCACC
CCGGTCGACG ACGACGAGCA CGTGCACGCC GCCGCCGTCG AGGTCCGCGG CCTGTTCACG
CTCCCGCTGA CCGAGGCGCG CGAGACCTGG GAGCGCACGC TCCCGGCGCT CTTCGCCTGA
 
Protein sequence
MTTAPARPDV PSAPHTSDTV EHAAATPDLE QPYAELGLKP DEYQRIRDIL GRRPTAAELA 
MYSVMWSEHC SYKSSKTHLR QFGDKTTPAM KEHLLVGIGE NAGVVDIGDG WAVTFKVESH
NHPSFVEPYQ GAATGVGGIV RDIISMGARP VAVMDQLRFG AVDHPDTARV VHGVVAGVGG
YGNSLGLPNI GGELVFDACY QGNPLVNALC LGVLRHEDIH LANASGVGNK VVLFGARTGG
DGIGGASILA SETFDDAKPS KRPSVQVGDP FMEKVLIECC LELYAAQVVE GIQDLGAAGI
SCATSELASN GDGGMHVDLE NVLLRDPTLT AGEILMSESQ ERMMAVVRPD KLDEFLEITR
RWDVETAVIG EVTGTGRLTI DHHGDRIVDV DPRTVAHEGP VYDRPYARPA WQDGLVADSV
STPEGAARYA RPESADELRA TLLQLLGSPN LASPAWVTDQ YDRFVQGNTA LAQPDDSGVV
RVDETTGLGV ALATDANGRY AKLDPYTGAQ LALAEAYRNV ATVGARPVAV TDCLNFGTPE
HPDTMWQLVE AIRGLADACQ TLEVPVTGGN VSLYNGTGEP GQIDSAIHPT PVVGVLGVLD
DVARAVPSGW TAPGQAVYLL GTTRPELDGS AWADVVHRHL GGVPPQVDLD AERRLAQVLV
AAARDELVDA AHDLSEGGLA LALVESSLRY GVGVQVDLGA LCARDGLTAF EALFSESQAR
AIVAVPRSEE VRLLDLCTAR GVPALRLGET AETCTLGAGT PVDDDEHVHA AAVEVRGLFT
LPLTEARETW ERTLPALFA