Gene Cfla_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2004 
Symbol 
ID9145899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2231668 
End bp2233302 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content69% 
IMG OID 
ProductAAA ATPase central domain protein 
Protein accessionYP_003637098 
Protein GI296129848 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.36132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC CCGCTGCCCC TGGACGCGAC CTGCACCGCG AGCTCGCGGT GCTCGCGGCG 
AAGAACGAGC GGCTCAGCGA GGCGCTCGTC GCCGCACGCG AGCAGATCCT CGATCTCAAG
CGTCAGGTGG ACGACCTGGC CAAGCCGCCC GGGACGTACG CCACCTTCCT CGGCGCGCGC
GCCGACGGCA CGGTCGACAT CGTCTCCGCC GGTCGCAAGA TGCACGTCGG CGCGAGCCCC
TCGCTCGACG TCCACCACCT GCGGCCCGGG CAGGAGGTCA TGCTCAACGA GGCCCTCACG
GTCGTCGAGG CCGGCGGCTA CGAGCAGGTC GGCGAGATCG TCACCGTCAA GGAGATGCTC
GGCGAGGGGC GTGCCCTGGT GATCGGGCGT GGCGACGAGG AGCGGGTCGT GCGGTTCGCC
GGCCAGGTGG CCGACACCGG CGTGCGCGTC GGCGACGCGC TGACGATCGA CCCGCGCAGC
GGGTTCGTGT TCGAGGTGAT CCCGCGGGCC GAGGTCGAGG AGCTCGTGCT CGAGGAGGTC
CCGGACATCG ACTACACGGA CATCGGTGGC CTGGGCCCGC AGATCGAGGC GATCCGCGAC
GCCGTCGAGC TGCCGTTCCT GCACCCCGAG CTGTTCCGCG AGCACGGGCT CAAGCCGCCC
AAGGGCGTGC TGCTCTACGG CCCGCCGGGA TGCGGCAAGA CGCTCATCGC CAAGGCCGTC
GCGCACTCGC TGGCCGCGAC GGCGGCGGCA GCGCGCGGTG AGGACGTCGC CGACGCCCGC
TCCTTCTTCC TCAACGTCAA GGGACCCGAG CTCCTCAACA AGTACGTCGG GGAGACCGAG
CGGCACATCC GCCTGATCTT CGCCCGGGCG CGCGAGAAGG CGTCGCAAGG GCACCCGGTC
GTCGTGTTCT TCGACGAGAT GGAGTCGCTG TTCCGCACCC GCGGCACCGG GGTGTCCAGC
GACGTCGAGA CGACGATCGT GCCGCAGCTG CTCTCGGAGA TCGACGGCGT CGAGCGGCTC
GACAACGTCA TCGTCATCGG CGCGTCGAAC CGCGAGGACA TGATCGACCC CGCGATCCTG
CGCCCCGGCC GCCTGGACGT GAAGATCAAG ATCGAGCGGC CCGACGCGGA GGGCGCGCGG
GAGATCTTCG CCAAGTACCT CACGCCGGAG CTGCCGATCC ACGCCGACGA CCTCGCCGAG
CACGGCGGGT CGGGCCAGGC GGCCGTCGAG GCGATGATCC GGCGCGTCGT CGAGCGCATG
TACTCCGAGT CCGACGAGAA CCGGTTCCTC GAGGTGACGT ACGCCAGCGG CGACAAGGAG
GTCCTGTTCT TCAAGGACTT CAACTCCGGC GCGATGATCC AGAACGTCGT CGACCGTGCC
AAGAAGTCCG CGATCAAGGA CCTCCTGGCC ACGGGACAGC GCGGCATCCG CGTCGACCAC
CTGCTCTCGG CGTGCGTCGA CGAGTTCAAG GAGAACGAGG ACCTGCCCAA CACCACCAAC
CCGGACGACT GGGCGCGGAT CTCCGGCAAG AAGGGCGAGC GGATCGTCTT CATCCGCACG
ATCGTCCAGG GCAAGAAGGG TGTCGAGGCG TCGCGGACCA TCGAGAACGT GACGAGCACC
GGCCAGTACC TGTGA
 
Protein sequence
MTEPAAPGRD LHRELAVLAA KNERLSEALV AAREQILDLK RQVDDLAKPP GTYATFLGAR 
ADGTVDIVSA GRKMHVGASP SLDVHHLRPG QEVMLNEALT VVEAGGYEQV GEIVTVKEML
GEGRALVIGR GDEERVVRFA GQVADTGVRV GDALTIDPRS GFVFEVIPRA EVEELVLEEV
PDIDYTDIGG LGPQIEAIRD AVELPFLHPE LFREHGLKPP KGVLLYGPPG CGKTLIAKAV
AHSLAATAAA ARGEDVADAR SFFLNVKGPE LLNKYVGETE RHIRLIFARA REKASQGHPV
VVFFDEMESL FRTRGTGVSS DVETTIVPQL LSEIDGVERL DNVIVIGASN REDMIDPAIL
RPGRLDVKIK IERPDAEGAR EIFAKYLTPE LPIHADDLAE HGGSGQAAVE AMIRRVVERM
YSESDENRFL EVTYASGDKE VLFFKDFNSG AMIQNVVDRA KKSAIKDLLA TGQRGIRVDH
LLSACVDEFK ENEDLPNTTN PDDWARISGK KGERIVFIRT IVQGKKGVEA SRTIENVTST
GQYL