Gene Cfla_0954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0954 
Symbol 
ID9144829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1051027 
End bp1054299 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content77% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003636060 
Protein GI296128810 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0173723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0708717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATGGC GTGGCCGGCT GCTGCGCGGC CGAGGTCAGG ACCAGGCCCC GGTCGTGCTG 
GTCGTCACGC TCGTGGCGGT CGCGTCCGCG ACCCTCGTCG GCCTGCTCGC CGGGCTGCTG
CACGTCGCCG AGCGCGACGC CGTGCCGCAG GCCATCAGCC GGCTCGACCC GCAGCGCACG
CACCTCGAGG CGACGCTGTG GGTGCGCGGG GACGACGTCG AGCCGGCGCT CGACAGGGCC
CGCGACGGCC TCGCCCGCAT CACGGGGGAC GTCCCGACCA CCGAGCGGAC GTGGCTGATC
GGCGGGCTGC GCGCGCTGCC GACCGAGCTC GGCGTCCCAC CCGAGCTGAC GTACCTCGCT
GCACTCCCCC ACGACGACGA GGACCTCGTG CGCCTCGCGA GCGGACGCTG GCCCGCCGCC
TCGACCGACG CGGACGGCCG CGTCGAGGTG AACGTCCCGG TCGTCGCCGC GCAGGCGCTC
GGGTGGGAGG TCGGCAGCAC CGTCCACGCC CGGCCGTGGG GCGAGGAGCA CGGCGACGCG
TTCGTCGTCG TCGGGACCCA CGAGCCCGCG GGTCCGCGCA GCGCCTGGTC GCGCGACCGG
CTGCGCGGCC AGGGACGCAG CGCAGGGTTC AACCTGCCCG GGTCGGCCGG TCTGATCCGC
ACCACCGCGT GGGGACCGCT CGTCGTCGAC CCCGCGGTCC TCACCAGGCC CCAGATGGTC
GACACCGCCT ACCTCGTGGT CGAGCCCGAC CTCGCCGCGT CGACGGCCGA CGCGGTGGCG
GCGCTGCGCA CGCAGGTCGA CGACGGCGCG CGGATCCTCT CCGACGCACT GACCGGCCCC
GTCAGCGGAC GGCTGCAGAC GGACGTCGAC ACGACGATCG ACGCGACGTG GCGCGAGCTG
GTGGTGACGC GCGCCGCGGT GGTGACGATC GGCCTGCTGC TGGGCACGCT GGCCACGACC
GTGCTGCTGC TGACGGCCCG CCTGCTCGCG GAACGGCGAG CCGGGGAGGC CGAGCTGCTC
GCAGCCCGCG GCGCGTCTCC TGCGCAGCTG CGCTCGACGG TGCTGCTCGA GGCCCTCGTG
CTGGCGACCC TCACGTGGCT CGTCTCCCCG TGGCTCGCGC GCGGCGCGCT GGCGGTCGTG
ACCCGCTCGG GCCCGCCGGC CGAGGCCGGC TACACCGTGC CGGAGGGCGT GCCGGGCGGC
GTGCTGCTCG CCTGCGGCGC CATCGCCGTC GCTCTCGCCG TCACGCTGTG CGTGCCCGCG
TGGCACACCG CGGGGTCGAC GTCGCGCTCC GTGCACGGCG GACTGCTGCG CGTCGGCGGT
GACCTCGCGC TCCTCGTGCT GGGCGCGCTG GCGCTGGGCC AGCTCGTCGC CTACGGCTCG
CCGCTGACGC GCGGCGCCGA CGGCCCGCGG CTGGACCCCG TGCTGGTCGC CGGTCCCGCG
CTGGTGTGCC TCGCGGCGGC GACGGTGGCG CTGCGCCTCG TCGCGCCCGT CGCGCGCGCC
GGCGAACGGC TCGCGCGTGG CGCCCGCTCG CTGGTGCTGC CGCTCGCGGC CTGGCAGGTG
GCACGGCGGT CCGCGGTGGC CACCGGCACC GTGCTCGTGG TGGTCGTCGC AGTCGCGGCC
GGCACGTTCA GCGCCGCGTT CGCGGCGACC TGGCGCACGT CCCAGGTCGA GCAGGTCGAC
CTCGCGCTGG GCACCGACCT GCGGGCCGAC GCCATCGAGG AGGACCCCCT CGCCGCGTCG
GCCGCGCTCG CCGCCGCGAC CGCCGCGTAC CCGGACGCGC ACGGCCAGCC CGTCACGGAC
CGCGTGGTCG GGATGGGCCC GCGGGGCAAC ACGGGCGGGG TCGGCGGCAG GCTGGTCGCG
CTCGACGCGT CCCGTCCTCA GGACCTGCGG GGGCGCTCGA CCACCCCGTG GCGCGAGGTG
GTCGCCGGGC TGCACGCCGA CGAGCCGTCG ACGACCGTCG GGACCGAGCT GCCCGCCGGC
ACGCAGTGGC TCGTGCTCAC GGGCTCGGTC GACACCGACC CCTTCCTCAG CGGCACCGCG
GTGCTGGGCC TCGGCGTCGA GGACGACCAG GGTGTCCTCA CGCAGCTCCC GCCGCGCACC
GCGCCGCTCG GGAGGCCGTT CGAGGTCGTG CTCGAGGTGC CCGTCGCCGA CCGGCTGCGT
GTCGTGGCAA CCGACCTGAC CGTGAGCGTC CACGAGCCCG AGAGCGTCCT GACGACCGAC
TCCCGCCTGG TGCCCGTGCG GACCACCCTG ACGTCACTGC GCGCGGTCCC GCGCTCGGCC
GGCATCGGCA GGGAGCTCGA CGTCCACGAG GCACCCGCGC AGCCCGTGCC GCTGCGCCTC
GACGGCTGGA CCGGGGTCGT CACGCAGGGC GAGAGCACCG TGGGCGCCCC GGAGCTCGTA
CCGGTGGGTG CCCCGGGCGC GTGGCACGTG ACCGGCACGA TGAAGGTCGA CACGTGGGGC
ACCCCGCCGG TGCGCGTGCT CAGCGCCGCG TGGCCGCTCC CCCGGACCGT GCCCGCCGCC
GTGAGCGAGT CGGTGCTCGA CGTCCTCGAG ACGCGGCAGG GCCTGACGAT CACGGTGGCC
GGCGTGTCGG TCCCGCTCCA GGTCGAGCGG CTCGTCGAGC AGGTCCCGGG CGTGCCGCGC
GGGATCGGGG TCGTGGTCGA CCGCACGACA CTCTCGCGGG CCGTGCTCAC CGCGGGCGGC
CGCACGGACC TGCTCGACTC CTGGTGGGTC GCGGCCCCGG CGACCACCAC CGCTGCGCTC
GCCGCCGACC TCGCGGGCGT CGACGCCGAC GTCACGACAC GGGCGGCCGA GCGGCACGAC
GCGCTCACCG GGCCCGTACG GGTCGCGGTC CCGACCGCGG TGTCGCTGGT CGCGGTGTCC
GCCGTGCTGC TCGTGCTCGT CGGCACGGGC GCCGTCGCCG CCGCGTCGCT GCGCTCGCGC
CGCCTGGAGC TCGCACGGCT CCAGGCGCTC GGCGCCTCGC GTGCCGGGCT CGTCGGAGGG
CTGCTCGCCG AGACCACGCT GCTCGTCACG GTGGGGGCCC TCGCGGGGCT CGCGGCCGGC
TACGGGCTGG CCGCCGCGGT CGCCCCGCTG CTGACGATGT CGCCCGACGG CCGCACGCCC
GCACCCGAGC CCTGGCTCGT GTGGGGCTGG GGAACGCAGT CGCTGCGCAC CCTCGGCGTC
GTCGCCGCCG CGTGCGCCGT CACCGCCCTC GTCGCCGTGC TCGGCGTGCG CCGCACGTCG
GGAGCCGCGC TGCGGATGGG GGACGACCGA TGA
 
Protein sequence
MRWRGRLLRG RGQDQAPVVL VVTLVAVASA TLVGLLAGLL HVAERDAVPQ AISRLDPQRT 
HLEATLWVRG DDVEPALDRA RDGLARITGD VPTTERTWLI GGLRALPTEL GVPPELTYLA
ALPHDDEDLV RLASGRWPAA STDADGRVEV NVPVVAAQAL GWEVGSTVHA RPWGEEHGDA
FVVVGTHEPA GPRSAWSRDR LRGQGRSAGF NLPGSAGLIR TTAWGPLVVD PAVLTRPQMV
DTAYLVVEPD LAASTADAVA ALRTQVDDGA RILSDALTGP VSGRLQTDVD TTIDATWREL
VVTRAAVVTI GLLLGTLATT VLLLTARLLA ERRAGEAELL AARGASPAQL RSTVLLEALV
LATLTWLVSP WLARGALAVV TRSGPPAEAG YTVPEGVPGG VLLACGAIAV ALAVTLCVPA
WHTAGSTSRS VHGGLLRVGG DLALLVLGAL ALGQLVAYGS PLTRGADGPR LDPVLVAGPA
LVCLAAATVA LRLVAPVARA GERLARGARS LVLPLAAWQV ARRSAVATGT VLVVVVAVAA
GTFSAAFAAT WRTSQVEQVD LALGTDLRAD AIEEDPLAAS AALAAATAAY PDAHGQPVTD
RVVGMGPRGN TGGVGGRLVA LDASRPQDLR GRSTTPWREV VAGLHADEPS TTVGTELPAG
TQWLVLTGSV DTDPFLSGTA VLGLGVEDDQ GVLTQLPPRT APLGRPFEVV LEVPVADRLR
VVATDLTVSV HEPESVLTTD SRLVPVRTTL TSLRAVPRSA GIGRELDVHE APAQPVPLRL
DGWTGVVTQG ESTVGAPELV PVGAPGAWHV TGTMKVDTWG TPPVRVLSAA WPLPRTVPAA
VSESVLDVLE TRQGLTITVA GVSVPLQVER LVEQVPGVPR GIGVVVDRTT LSRAVLTAGG
RTDLLDSWWV AAPATTTAAL AADLAGVDAD VTTRAAERHD ALTGPVRVAV PTAVSLVAVS
AVLLVLVGTG AVAAASLRSR RLELARLQAL GASRAGLVGG LLAETTLLVT VGALAGLAAG
YGLAAAVAPL LTMSPDGRTP APEPWLVWGW GTQSLRTLGV VAAACAVTAL VAVLGVRRTS
GAALRMGDDR