Gene Cfla_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2031 
Symbol 
ID9145927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2262404 
End bp2265781 
Gene Length3378 bp 
Protein Length1125 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003637125 
Protein GI296129875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGG TCGACTCGCT GTTCGGGCTC ATCCCGGCGG CGTCGACGGG CCAGCAGTGG 
GTGGCCCTCG ACCTGCAGCT CGTCAACTGG GGCGGCTACG ACGGGCACCA TCGCGTGCGC
CTGGCGTCGA CCGCGACGCT GCTGTCGGGC GGGTCGGGCT CCGGCAAGTC CACGCTCATG
GACGCGTACA TCGCCCTGCT CATGCCGCAC ACGACGCCGT TCAACGGCGC GTCCAACGGT
GGCGTGGTCG GGCGCCCGCG TGGCAAGGAC CAGCGCAACA TCCTGTCGTA CGCGCGCGGC
AAGCTCGACG AGTCGCGCAC CGAGGAGGGC ACGCGCCAGC GCGTGCTGCG CGGCGACGGC
AAGGACACGT GGTCGGCGAT CGCGATGACG TGGGCGGACC AGTCGGGCAC GCGCTTCACC
GCGGTGCGCG CCTGGTACGT GCCGTCGGCC GCGCGGACGC TGGAGGACGT CACGGCGGTG
CGCGCGACGT GCGACGGGCC GTTCGACCTG CGGGACCTCG AACCCGCCGC CGGCCAGCGC
CTCGCCAGAC CCGCCGTCAC GGCCGCCGGG CTGGGCTGCT TCGACACCGA CCGCGAGTTC
ACCGCGCGGT TGCACTCGAC GCTCGGCATC GGCGCCGCCG GCGACGGCGG CAAGGCCGTC
GCGCTGCTGG GCCGGATCCA GGCCGGGCAG CAGATCACGA CCGTCGACGC GCTGTACAAG
GCGATGGTGC TCGAGGAGCC GGACACGCTC ACCACGGCGG ACGCGGTGGT GGAGCAGTTC
GACAAGCTCT CGGGCACGCG CGAGCAGATG ATCACCGCGC GCCAGCAGGT CAAGGCGCTC
GAGCCGATCC GCGAGCACCG CGCCGCGATC GAGCAGGCCG CGGCCCGGCT GCGCGTCGTC
GACGCCGTCG GCGGGTTCGA CGACGGCACG TCGCCGGCCG CGCTGTGGCG CCACGAGCGC
CGGCTCGGCC TGCTGCGGGC GGTCGAGTCC GACCTGCGGA CGCGCCACCG CGAGGCGCAG
CGCGTCGCGG CGGAGACGTC CGCCCGTGCG GCTGCGGCCC GGGCCGAGCG TGACGGCGTC
AAGCAGACGC TGTGGGCCTC CGGCGGCGAC CGCCTCGCGA CCGCGCAGCG CGAGCTGCAC
GGCGTGGCTG CGCGCGTCGA GGAGGTCGCC CGCGCGCGGG CCCGGCTCGA CGAGGTGCTG
ACCTCCACGC TCGGCACGTC CGTGACGTCG CTCGACGAGT TCACGGACCT CGTCGGCCGC
GCGCGCCACG CCTTGGCGGA CAGCGACGCG AAGGGCGCCG CGCGCCAGGC GCTGTTCGAC
GCGATGTCGG AGCGCAAGGA GGCCGCCGCG GACCTCGCGG TGCTGCGCCG CGACCACGCG
GACGCCAAGC ACCGGCACGA CAACATCCCG GGCGACCTGC ACGCCACGCG CGCCGCACTG
GCCGAGGCCG CTGGCCTGAC GCCCCAGGAC CTGCCGTTCG TGGCCGAGCT CGTCGAGGTC
CGCACCGAGC ACGAGCCGTG GCGCGAGGCG TTCAACCTCG CGCTCGGCGG CTTCGCGACC
CTAATGCTCA TCGACGTGGC GCACCTCCAG GCGTTCCGCC GGGCGATCGA CAGCGTGCGC
ACGGGCCGGC GCATCCGGTT CGAGGGCGTG CCGGCCGGCC TGCGCGACGA CATCGGCCTG
GACGGGCGCA CGCTCCCGGG GCGGCTGGAC TACCGGCAGT CGCCGTTCAC CGGCTGGTTG
AAGATCGAGC TGTCGACGCG GTTCGCGTAC GTCTGCGTCG ACACGCCCGG TGAGCTCGCG
CAGCACGAGA AGGCGCTGAC CCGCGGTGGT CAGCTCTCCG AGGGCCGGCG CGGCGCGCAC
GGCGGGCAGG GCGCGCGCAA CGTCCTGGGC TTCACCAACA CGCGTCGGCT GACCGACCTC
AACCGCCGGC TCGAGGCCGC CGAGGAGCGG CTGCGGGACG CCGAGGCGCG CGTCGGCGAG
GCCGAGGCCG CATGGGACCG GCACGACGCG ACGCTGCGCG CGTACGCCAC GGTCGTCGAG
CTCACGTGGG ACCAGGTCGA CGTCGCGGGC GTCGAGGCCG AGCGCGACCG CTGGCGGCGC
GTCGTCGACG AGGTGACGTC CGGGAACCCC GACGTCGTGC GCCTGCAGGA GCGGGCCGTC
GAGCTCGACG TGCTCATCCA GGACCTCACC GAGCAGCTGG GCCGCACCAA GGGCGCGGCC
ACGGAGCTGG GCGAGCAGTG GTCGCAGGTC ACCGACCAGG TCGACGTCGC GCAGGGCGCT
CTCGACGCCG CGCAGGACGC GGGCACGGTG CTGGACGACG AGCAGCGGGC CTACCTCGAG
CGCGTGCTCG GCGGCACGGA CGAGGATCTG CCACGTGTCG ACGACCGCAC CGACCCGGCC
GCCGCGCTCG CGGCGTTCGA CGGTGTCGTC GCGCGCGCCG CGGACCTCCT GAACGCCGAC
CGCACCGCCG CGCAGCAGAC CGTCGGTGCG TCGCGCGAGG CGCTGCGGCG CGCGTTCGAG
ACGTTCGTCG AGCGCTGGCC CGACCCGAAC CTCGGCACCG ACCCCGACGC CTCGTACGGC
GACTACGAGC GGATCCTCAC CGAGCTCGAG ACGCAGGGTC TGCACGAGCT CGAGGCCGAG
TGGCGCGCGA GCCTGCTGCG CCTGTCGGGC AACGACCTGG CGGACCTGCA CAGCGCCCTG
TCGCGGTCGG TCCGCGAGAT CAAGGAGCGC ATCCGGCCGG TCAACGACAT CCTCGCGGAC
CTGCCGTTCG CGGACGACGA CCACCGCCTG CGCATCGACG CCCGCGACAC CCAGTCGACG
GTCGTGGCAC GGTTCCGCAA GGAGCTGCGC GACCTGCGCG AGGTGCTGTC CACCGAGGCC
ACGGACGCCG AGCGCGAGCG CCGGTACCAC CGCATGGCGA AGGTCATCGA CCGGATCCGG
CGCACCGCTC CGGACTTCGC GGACCTCGTC GACGTGCGCC GTCACGTGCG GCTGTCCGCC
GAGAAGGTCG ACCTCGAGGG CAACCACGTC GCGCTGTACG ACCACATCGG CGAGAAGTCG
GGCGGTGAGT CGCAGGAGCT CGTCGCGTTC ATCGTGGGCG CCGCGCTGCG CTACCAGCTG
GGCGACGCCG GGGCATCACG GCCGCGGTAC GCGCCGGTGT TCCTCGACGA GGCGCTCATC
AAGGCCGACG CGCGGTTCAC CGGGCGGGCC ATCGGGGCGT GGCGCGGTCT GGGCTTCCAG
CTCGTCATCG GTGCGCCGAA CGACAAGTTC AGCGCGCTGG AGCCGCACGT GGACCTCAAG
TACGTGGTGC TCAAGGACAC GGCGGGCCGG TCGCGGACGA AGGCGGTCGC GGGAGTGGCG
GCGGACGCCG GAGCGTAG
 
Protein sequence
MTMVDSLFGL IPAASTGQQW VALDLQLVNW GGYDGHHRVR LASTATLLSG GSGSGKSTLM 
DAYIALLMPH TTPFNGASNG GVVGRPRGKD QRNILSYARG KLDESRTEEG TRQRVLRGDG
KDTWSAIAMT WADQSGTRFT AVRAWYVPSA ARTLEDVTAV RATCDGPFDL RDLEPAAGQR
LARPAVTAAG LGCFDTDREF TARLHSTLGI GAAGDGGKAV ALLGRIQAGQ QITTVDALYK
AMVLEEPDTL TTADAVVEQF DKLSGTREQM ITARQQVKAL EPIREHRAAI EQAAARLRVV
DAVGGFDDGT SPAALWRHER RLGLLRAVES DLRTRHREAQ RVAAETSARA AAARAERDGV
KQTLWASGGD RLATAQRELH GVAARVEEVA RARARLDEVL TSTLGTSVTS LDEFTDLVGR
ARHALADSDA KGAARQALFD AMSERKEAAA DLAVLRRDHA DAKHRHDNIP GDLHATRAAL
AEAAGLTPQD LPFVAELVEV RTEHEPWREA FNLALGGFAT LMLIDVAHLQ AFRRAIDSVR
TGRRIRFEGV PAGLRDDIGL DGRTLPGRLD YRQSPFTGWL KIELSTRFAY VCVDTPGELA
QHEKALTRGG QLSEGRRGAH GGQGARNVLG FTNTRRLTDL NRRLEAAEER LRDAEARVGE
AEAAWDRHDA TLRAYATVVE LTWDQVDVAG VEAERDRWRR VVDEVTSGNP DVVRLQERAV
ELDVLIQDLT EQLGRTKGAA TELGEQWSQV TDQVDVAQGA LDAAQDAGTV LDDEQRAYLE
RVLGGTDEDL PRVDDRTDPA AALAAFDGVV ARAADLLNAD RTAAQQTVGA SREALRRAFE
TFVERWPDPN LGTDPDASYG DYERILTELE TQGLHELEAE WRASLLRLSG NDLADLHSAL
SRSVREIKER IRPVNDILAD LPFADDDHRL RIDARDTQST VVARFRKELR DLREVLSTEA
TDAERERRYH RMAKVIDRIR RTAPDFADLV DVRRHVRLSA EKVDLEGNHV ALYDHIGEKS
GGESQELVAF IVGAALRYQL GDAGASRPRY APVFLDEALI KADARFTGRA IGAWRGLGFQ
LVIGAPNDKF SALEPHVDLK YVVLKDTAGR SRTKAVAGVA ADAGA