Gene Cfla_3694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3694 
Symbol 
ID9147610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp4079168 
End bp4083241 
Gene Length4074 bp 
Protein Length1357 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003638761 
Protein GI296131511 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGT CCTCGCGTGC CGCCGTGAGC GGTCCCGTCG CCCTTGCCCT GCTCGGGATG 
CTGCTGGTAC CCGTCGCCTC GTCGGCGTCC GGTGCGCCGG CGCCCACCGG GGACACCTCG
GCCGAGGCCG GTGGAGTGGT CGAGCTGGAC GAGTCCGGCC TGACCCCGGA GCAGGGTGAC
GAGGTCCTGC CCGACGACTG GCGCGAGGTC GACGACGAGG CGCACGTCGT GCTGAACGAC
GCGGACGGCA TCACCGTCCT CGGCGCCCGG GCCTCCGACG GGTACGCGTG GGAGACCGTG
GCGACCGTGC CGGTGCCGTG GGCGGACACC GACCTGTGGG TGGCGAACAG CTGCGTGACG
TCGAGCGGCG ACCGGATGGC CGTGGTGTAC GCGCCCCGTG CCGCCACCAA CGACGAGTCG
GCCTTCTCTG CCGGCGCGCG CGCCGCGATC GTCGACCTGC GCTCCGGTGA GGTCGCCGAG
CTCGGCAGCG GGTACACGAT CGCCTACTAC AACCCGGGCT GCGGCGAGGA CGACACGGTC
ACGATCACGC GGCTCGAACC CGACGAGGGC GTCACGCGGG TGGCCGTGGT CGACGCGCAG
ACGGCGGAGA TCACGACGAA GGTCGACCTG GAGGGCCAGG TCACGTCGGC CATCGCCCGG
GAGGACGGGA CGGTGCTGGC GGCCCGCGGT GCCGACCTCG TCGCCGTGGC CCCCGGTGCG
ACCGCACCCG AGGTGGTCGT GGCGGACACG GGCGCCGCGT ACGACCTCAC GCTCGACGGG
TCCGGGCGTC TGGCGTACGT CGCACGGGAC GGCGAGGGCG CGGACACGTC GACCGCGTAC
GTCACGACCC TCGACGACGA CACCCCCGCA CAGGCGATCG GCACCGGGCC CGTCACGCAG
ACGGGCGTCG CGCCGGCGGG AGAGGAGGGC TTCTACCTCA CCGGGAAGGA CGTCGAGCCG
ACCCCCGGCC TGGACGGGGT GGAGCTGCTG CCGGAGGCGA GCCCCGGCTC GGTGATCTCC
TCGAGCGGTG AGCTCGTCGT CGACGCGGTC ACCTCGACCG GCCTCGCCGA CGTGCCGGAG
ATCAAGAAGC CGGCAGCGAC CGCCGTGGTC CCCGTGGCGG AGGCCGAGAT CGCCGCGACC
GTCGTGGACT CCGGGACGAC GCTGACGTAC GCCATGGACG AGCCGGCCCC GGCTGCGCCC
CTCGAGGCGC GCGCCACCGG CACCGCCCGC CCGTCCGGCT TCGCGGCCCG GCAGGAGGCG
ACGGCATCGG CCTCGGTGAC CTCGACGGGC GGCGCGGGCG ACCCGAGCAG CCCGATCGAG
GCCGAGCGCT ACTGCGCCGT CCCGCGCAAC GACCCGGCCA ACCAGGCGTA CCAGCCCAAG
CCCCGGCAGG TCGAGTGGGC CGTCGACCGG GCCGTGAAGG GCCAGCTCAC CCAGACGCGG
CCCGCCAACT GGCGCAACCT CGGGATGTCG AGCTACTCCC CGCAGGGCAT GTTCCCGGCA
CCGCAGCTCA CCGGTGGCGG GACGATCCCG CCCCAGATCG TGCTGGGCGT CCTCGCGCAG
GAGTCGAACC TGTGGCAGGC GTCGCGCTTC ACCGCCCCCG GCAACACGGG CAACCCGCTG
ATCGGCGACT ACTACGGGAC CGACGACACG TGGTCCATCT GGCAGATCGA CTACACCGAG
GCCGACTGCG GTTACGGGGT CGGGCAGATC ACGGACGGCA TGCGGCTGGC CGGCCACGAG
CGCACCGACG ACTCCGGCCG CAAGATCGAG ACGGCCCTGC CGTACGCCCA GCAGCGCGCG
ATCGCGCTGG ACTACACCGC GAACGTCGCG AAGTCCGTGC AGATGCTCAG CCAGAAGTGG
AACCAGCTGG CGGCCGCCGG GATGCTCGTC AACGGCGGTG ACCCCAAGTA CATCGAGAAC
TGGTTCCTCA CGACGTGGGC CTACAACACC GGGTTCTACC CCTACGTGAA CGACAGCACA
CCCTGGGGTG TCGGCTGGTT CAACAACCCG GTGAACCCCA ACTACGACCC GCGACGGACA
CCGTTCCTCA GGTCGCAGGG CGACGCGGCC AAGCCGCAGC AGTGGCCCTA CCCGGAGAAG
GTCATCGGGT GGGCCGCGTA CGGCACGTCC TTCGTGGAGA CGCAGTGGGC CGACCCTGCC
AAGCGCGAGT ACCCGCAGCG CCTGGTGTCG TCCTACACGA CGGCCTGGTG GAACGACCCC
ACGTACCGTG ACATGGCCCA GCCGAAGGTC GACACCTTCT GCAAGCCGGA CGTGAACGAC
TGCGACCCGT CGCTGGCGCA GCCGTGCAAG CTCGGCTCGT ACAAGTGCTG GTGGCACGGC
CCCGTCCGGT GGAAGTCGAG CTGCGACCTC TACTGCGGCC AGGGCTTCGA GCGCTTCCCC
GCCAGCTACG CCACCGAGGC CAGCGCCATG GCGTCCACGC TGCCGGCCAA GACGCTGGAA
TCGAGCTTCC TGCCCAACTG CAGCTACCCG CCGACCGGCG TCGTCGTGGT CGACAACACC
ACGCACCCGA CGGCCCGCAA CAGCAACGAG TGCACACGCA GAGCGACCGT CGGCTCGTTC
AGCCTGTCGT TCGGCAGCCC CGACTCGGAC GGCCGGTACC CGTCGAAGGT CGACTTCCAC
CAGCAGGGCG GCGGGTTCAA CGGCCACTTC TGGTTCGCGC ACATGCAGCA CCGTGTCAAG
ACGCAGCGCC ACATCACCGG GACCTGGGAC CGCGGCGCCA ACCTGAACGG GAAGTGGACG
CGCGTGTGGG TGCACCTCCC CGACTACGCC GGCTGGACCC AGCAGGCCGG CTACACGATC
GACCTGGGCA ACGGCACGAC CCAGAAGCGC TACCTGCCGC AGCGCCGCTA CAAGAACGAG
TGGGTCAGCC TCGGGGTCTT CGAGATGAAG GGCTCGCCCA AGGTCTCGCT GTCGAACGTC
CTCGTCGACG AGAGCACGAC GGGCCGGGTC TGGTACGACG GCGCGCACCA GGACATCGAG
GGGTACGACA ACGTCGCCTG GGACGCGGTC GGATTCCAGG TGCTGCCCGG CAAGCCCGAC
GACTTCGTCG TGTCGCTCGG CGACTCGTAC GCCTCGGGTG AGGGCGCCGG CGACTACGTG
CCGTGGAGCG ACAACAACGG CACCAACCCG CGCGCGAGGA ACGCCTGCCA CCAGTCGGTG
AACGCCTGGA TCCGCAGGAC CACGCTGCCG GGCGACGCGG AGCCGCTGGG CGCGCGCGCC
GACGCTGCCG ACACCGGGCT GGACCTCCAG CTCCTCGCCT GCTCAGGTGC GCAGACGGAG
GACCTGCTGC CCTACTACCA CATCGACACC CCGCAGGCGC CGGAGAACGC GGAGAGGCAG
ACCGGCAGGT ACGGCCAGTA CGGCACCGTG TCGCAGCTCG ACGCCGGCTA CCTCGACACG
AACACCACCC TCGTGACCCT GTCGATCGGT GGGAACGACA TGCAGTTCGG TCCGATCCTG
GCCGCGTGCA TCAAGGCCAA CTTCGCCGGC ACGCCCGAGG ACCGCTCGTT GGCCGACTGC
TCCCGGACCG TCCTCGAGGA CGACACCCTG CCGGCACAGG CCGCCAGCAA GGAGCGGGTG
GACACGAAGA TCGCCGACAG CCTGATCACG CTCGTGCAGC TCGTGCGCGA TCGTGCGCCG
AACGCGCGCA TCGCGATCTT CGGGTACCCC AAGCTGTTCG AGACCACGAC GTCCTGCGTG
CTGATCAACG AGGTCAACCA GGTCTGGCTG AACGAGCTCG CCGACGGCCT GAACGCCAAG
ATCGCCGAGA CCGCCGCCAT CCTGGAGACC TACGACGAGC CCGGGTCGCC CGAGATCTTC
TTCGTCGACA CCCAGGCGTA CTTCACGGGC AAGAACCTGT GCACGGGCGC CGACTCGGGT
CTCACCGGCC TCCAGTTCGG CGTGACCCCC GGCGAGGACC CGCAGTTCCC CCGGCCCTGG
CCCCGCATCG TGGTGGACAA CGACGTCGCG TCGCAGACGT CGGTGCACCC GAACACGAAC
GGGACCGGCT TCTACGCCCA GGCGCTGGAG GACGCCCTTG CGGACCTCCC GTGA
 
Protein sequence
MRLSSRAAVS GPVALALLGM LLVPVASSAS GAPAPTGDTS AEAGGVVELD ESGLTPEQGD 
EVLPDDWREV DDEAHVVLND ADGITVLGAR ASDGYAWETV ATVPVPWADT DLWVANSCVT
SSGDRMAVVY APRAATNDES AFSAGARAAI VDLRSGEVAE LGSGYTIAYY NPGCGEDDTV
TITRLEPDEG VTRVAVVDAQ TAEITTKVDL EGQVTSAIAR EDGTVLAARG ADLVAVAPGA
TAPEVVVADT GAAYDLTLDG SGRLAYVARD GEGADTSTAY VTTLDDDTPA QAIGTGPVTQ
TGVAPAGEEG FYLTGKDVEP TPGLDGVELL PEASPGSVIS SSGELVVDAV TSTGLADVPE
IKKPAATAVV PVAEAEIAAT VVDSGTTLTY AMDEPAPAAP LEARATGTAR PSGFAARQEA
TASASVTSTG GAGDPSSPIE AERYCAVPRN DPANQAYQPK PRQVEWAVDR AVKGQLTQTR
PANWRNLGMS SYSPQGMFPA PQLTGGGTIP PQIVLGVLAQ ESNLWQASRF TAPGNTGNPL
IGDYYGTDDT WSIWQIDYTE ADCGYGVGQI TDGMRLAGHE RTDDSGRKIE TALPYAQQRA
IALDYTANVA KSVQMLSQKW NQLAAAGMLV NGGDPKYIEN WFLTTWAYNT GFYPYVNDST
PWGVGWFNNP VNPNYDPRRT PFLRSQGDAA KPQQWPYPEK VIGWAAYGTS FVETQWADPA
KREYPQRLVS SYTTAWWNDP TYRDMAQPKV DTFCKPDVND CDPSLAQPCK LGSYKCWWHG
PVRWKSSCDL YCGQGFERFP ASYATEASAM ASTLPAKTLE SSFLPNCSYP PTGVVVVDNT
THPTARNSNE CTRRATVGSF SLSFGSPDSD GRYPSKVDFH QQGGGFNGHF WFAHMQHRVK
TQRHITGTWD RGANLNGKWT RVWVHLPDYA GWTQQAGYTI DLGNGTTQKR YLPQRRYKNE
WVSLGVFEMK GSPKVSLSNV LVDESTTGRV WYDGAHQDIE GYDNVAWDAV GFQVLPGKPD
DFVVSLGDSY ASGEGAGDYV PWSDNNGTNP RARNACHQSV NAWIRRTTLP GDAEPLGARA
DAADTGLDLQ LLACSGAQTE DLLPYYHIDT PQAPENAERQ TGRYGQYGTV SQLDAGYLDT
NTTLVTLSIG GNDMQFGPIL AACIKANFAG TPEDRSLADC SRTVLEDDTL PAQAASKERV
DTKIADSLIT LVQLVRDRAP NARIAIFGYP KLFETTTSCV LINEVNQVWL NELADGLNAK
IAETAAILET YDEPGSPEIF FVDTQAYFTG KNLCTGADSG LTGLQFGVTP GEDPQFPRPW
PRIVVDNDVA SQTSVHPNTN GTGFYAQALE DALADLP