Gene Cfla_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2024 
Symbol 
ID9145919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2252741 
End bp2255011 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content69% 
IMG OID 
Productglycoside hydrolase family 10 
Protein accessionYP_003637118 
Protein GI296129868 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGCA CCATCGGACG GGCGGCGCTG GTCGCCGCGG TGTCGCTCGG CGCACTGGTC 
GGTGCCGCCG CTCTGACGCC TGCGGCCGCA GCGCCGAGCG GGTCGGGCCC GAACCCCGCC
GACTACTCCA CCACCGGCGC CCTGCCCAAC CACACGATCT ACAAGCCGGT GAACCTTCCG
TCGCAGCGCA TGCCCATCGT GGTGTGGTCC AACGGTGCGT GCTCGGCGGA CGGCACGTCG
GCGCAGAACT TCCTCAAGGA GATCGCCTCC TGGGGCTTCC TCGTCGTCTC CAACGGCCGC
CCCAACGGAA GTGGCAGCTC GAACTCCACG TGGCTGACGC AGGCCATGGA CTGGGCCGTG
GCCCAGAACT CGAACAGCAG CAGCGACCTG TACAACAGGC TCGACACCAG CAAGATCGGT
GTCGCCGGCT TCTCGTGCGG CGGCATCGAG GCCTACGCGG TCTCCGGTGA CCCGCGCGTC
ACGACGACCG GCATCTTCAG CAGCGGTCTG CTGAACGACG CGGACGACTA CCAGCTGCGC
CGTCTCGACC ACCCGATCGC CTACATCATC GGTGGCCAGA GCGACATCGC GTACCCCAAC
GCGATGGACG ACTGGGGCAA GCTGCCGCAG GGCCTGCCCG CCTTCATGGG CAACCTCAAC
GTCGGTCACG GCGGTACGTA CCACGAGACC AACGGCGGGG CCTTCGGCTT CGCGGCCCAG
CAGTGGTTCC GGTGGCAGCT CAAGGGCGAC ACGACGGCGG CGCAGACGTT CGTCGGTCAG
AACTGCGGCC TGTGCCGCAA CGGCTGGCAG GTTCAGCAGA AGAACCTCAC GGTGACCCAG
CCGACGCAGT CGCCGACGCC GTCGCCGACC CCCTCGCCGA CGCAGAGCAC CCCGCCGGTC
CAGACGCCCA CCCCGACGCC GACGCAGACG ATCCCGTCGT CCCAGCCCGG CGCGACGCTG
CAGGCCGCGG CGGCCCGCAC GGGCCGGTAC TTCGGTGTGG CCCTGGCGGC GGGCAAGCTC
AACGACTCGA CCTACACGAC CATCGCGAAC CGTGAGTTCA ACATGGTGAC GGCCGAGAAC
GAGATGAAGA TGGACGCCAC GGAGCCGAAC CAGAACCAGT TCAACTTCTC CCAGGGCGAC
CGGATCCTGA ACTGGGCGAC GCAGAACGGC AAGCAGGTGC GTGGGCACGC GCTGGCGTGG
CACTCCCAGC AGCCGGGCTG GATGCAGAAC ATGTCCGGCA CGCAGCTGCG CAACGCGATG
CTCAACCACG TCACGCGGGT CGCGACGTAC TACAAGGGCA AGATCCACAG CTGGGACGTG
GTGAACGAGG CGTTCGCCGA CGGCAACGGC GGTGCGCGTC GTGACTCGAA CCTGCAGCGC
ACCGGTGACG ACTGGATCGA GGCCGCGTTC CGCGCCGCGC GCGCCGCGGA CCCGGGCGCC
AAGCTCTGCT ACAACGACTA CAACACCGAC AACTGGACGT GGGACAAGAC CCAGGCCGTC
TACCGCATGG TCCGTGACTT CAAGTCGCGC GGCGTGCCGA TCGACTGCGT GGGTTTCCAG
TCGCACTTCA ACGCCCAGTC GGCGTACAAC AGCAACTACC GCACGACGCT GTCGAGCTTC
GCCGCCCTGG GTGTCGAGGT GCAGATCACC GAGCTGGACA TCGAGGGCTC GGGTCAGCAG
CAGGCGCAGA CGTACGCGAA CGTCGTGAAC GACTGCCTCG CCGTGCCGGC CTGCAAGGGC
ATCACGGTCT GGGGTGTGCG TGACTCCGAC TCGTGGCGCT CGTACGGCAC CCCGCTGCTG
TTCGACAACT CGGGCAACAA GAAGGAGGCC TACACGGCCA CCCTCAACGC CCTGAACGCC
GCGGCGCCGG TCCCGACGCA GGAGCCGACG CAGACGCCGA CGCAGGAGCC GACGCAGACG
CAGGAGCCGA CGCAGACGCC GACGGTCACC CCGACGCAGG ACCCGGGCCA GGGCTCGGGT
GCGTGCACCG CGACCTACAC GGTCGCGAAC CAGTGGGGTG AGGGCTTCGT CGCCGACGTC
ACGGTCACGG CCAACCAGGA CCTCACCGGA TGGAAGGTGA GCATCCAGCT CCCCGGCGGT
GCCGGGGTCT CGCAGACCTG GAACGGGACG CGAGGGAGCG CCAGCACCGG CCTCGTCACT
GTGGCGAACG CCGGCTGGAA CGGTCGTGTC GCAGCTGGTC AGAGCACCAG CTTCGGCTTC
CAGGGGACCG GCAACGGGGC GGGCGCGACC GTCTCGTGCG AGGCCGCCTG A
 
Protein sequence
MRRTIGRAAL VAAVSLGALV GAAALTPAAA APSGSGPNPA DYSTTGALPN HTIYKPVNLP 
SQRMPIVVWS NGACSADGTS AQNFLKEIAS WGFLVVSNGR PNGSGSSNST WLTQAMDWAV
AQNSNSSSDL YNRLDTSKIG VAGFSCGGIE AYAVSGDPRV TTTGIFSSGL LNDADDYQLR
RLDHPIAYII GGQSDIAYPN AMDDWGKLPQ GLPAFMGNLN VGHGGTYHET NGGAFGFAAQ
QWFRWQLKGD TTAAQTFVGQ NCGLCRNGWQ VQQKNLTVTQ PTQSPTPSPT PSPTQSTPPV
QTPTPTPTQT IPSSQPGATL QAAAARTGRY FGVALAAGKL NDSTYTTIAN REFNMVTAEN
EMKMDATEPN QNQFNFSQGD RILNWATQNG KQVRGHALAW HSQQPGWMQN MSGTQLRNAM
LNHVTRVATY YKGKIHSWDV VNEAFADGNG GARRDSNLQR TGDDWIEAAF RAARAADPGA
KLCYNDYNTD NWTWDKTQAV YRMVRDFKSR GVPIDCVGFQ SHFNAQSAYN SNYRTTLSSF
AALGVEVQIT ELDIEGSGQQ QAQTYANVVN DCLAVPACKG ITVWGVRDSD SWRSYGTPLL
FDNSGNKKEA YTATLNALNA AAPVPTQEPT QTPTQEPTQT QEPTQTPTVT PTQDPGQGSG
ACTATYTVAN QWGEGFVADV TVTANQDLTG WKVSIQLPGG AGVSQTWNGT RGSASTGLVT
VANAGWNGRV AAGQSTSFGF QGTGNGAGAT VSCEAA