Gene Cfla_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3661 
Symbol 
ID9147577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp4045740 
End bp4047038 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID 
Producttransposase mutator type 
Protein accessionYP_003638731 
Protein GI296131481 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.356207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones103 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA CGATCGAGCC CGTGAAGCTC GATGAGGATG AGAACACGCA GATCGCCCGG 
ACGATGGTCG CCCAGGCGCG GGCGTCCGGG CTGGACCTGG TCGGCCCGGA CGGGCTGCTC
GCGGGTTTGA CCAAGCAGGT CCTCGAGCTG GCCCTCGAGG AAGAGCTGAC CGACCACCTC
GGGTATCGAC CGGGTGAGCG GGAGGGCAAG TCAGGTTCGA ACGAGCGCAA CGGGAGCCGC
TCGAAGACCG TGATCACCGA GATCGGCCCG GTGCAGATCG ACGTGCCGCG TGACCGTGAG
GGCACCTTCG AGCCGGCGAT CGTCAAGAAG CGCCAACGCC GGTTGGAGGG CGTCGACGAG
CTCGTGCTCT CACTGACGGC GCGGGGGCTG ACGACCGGGG AGATCTCGGC GCACTTCGCC
GAGGTCTACG GCACCCAGGT CTCCAAGGAC ACGATCAGCC GGATCACCGA GAAGGTCACC
GCGGAGATGG CCGAGTGGCA GACCCGCCCC CTGGACGTGG TCTATCCGGT GATCTTCATC
GACGCGATCG TGGTCAAGGT CCGCGACGGC GCGGTGACGA ACAAGCCGTT CTACGTCGTG
ATCGGCGTGA CCACCCGCGG GGAGCGCGAC ATCCTGGGGA TCTGGGCCGG GGACGGCGGG
GAGGGAGCGA AGTACTGGTT GAACGTGCTC ACCGAGATCA AGAATCGCGG CGTCACGGAC
GTGTGCATAG CGGTCTGCGA CGGCCTCAAG GGGCTGCCGG AGGCGATCAC CACGGTCTGG
GAGCTGACCC AGGTGCAGAC CTGCGTGATC CACCTGATCC GCAACACGTT CCGCTACGCC
GCCCGCCAGG ACTGGGACGC CGTGGCCCGC GACCTCAAGC CGATCTACAC CGCGGTCAAC
GCCGAGCAGG CGAGCGCGCG GATGGACGAC TTCGCCGACA AGTGGGCCGG CAAGTACCCG
GCGGCGGTCA AGCTGGTGGC GCACCGCCTG GCCCGAGTTC GTCCCGTTCC TGGACTACGA
CGTGGAGATC CGCAAGATCA TCTGCACGAC CAACGCGATC GAGAGTCTGA ACGCGCGCTA
CCGGCGAGCA GTCCGGGCCC GGGGGCACTT CCCGAACGAC GCCGCGGCCC TGAAGTGCCT
CTACCTGGTG ACCCGAGCGC TGGACCCCAC CGGACGGGGC CGGGCACGAT GGGTCATCCG
ATGGAAGGCC GCCCTCAACG CGTTCGCCAT CACCTTCGAC GGGCGCATCA ACCCCTCGAA
CCTCTGAAGA ACGCCGGGCC CCGCCCACCG TTCATCTGA
 
Protein sequence
MNDTIEPVKL DEDENTQIAR TMVAQARASG LDLVGPDGLL AGLTKQVLEL ALEEELTDHL 
GYRPGEREGK SGSNERNGSR SKTVITEIGP VQIDVPRDRE GTFEPAIVKK RQRRLEGVDE
LVLSLTARGL TTGEISAHFA EVYGTQVSKD TISRITEKVT AEMAEWQTRP LDVVYPVIFI
DAIVVKVRDG AVTNKPFYVV IGVTTRGERD ILGIWAGDGG EGAKYWLNVL TEIKNRGVTD
VCIAVCDGLK GLPEAITTVW ELTQVQTCVI HLIRNTFRYA ARQDWDAVAR DLKPIYTAVN
AEQASARMDD FADKWAGKYP AAVKLVAHRL ARVRPVPGLR RGDPQDHLHD QRDRESERAL
PASSPGPGAL PERRRGPEVP LPGDPSAGPH RTGPGTMGHP MEGRPQRVRH HLRRAHQPLE
PLKNAGPRPP FI