Gene Cfla_1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1349 
Symbol 
ID9145233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1494701 
End bp1496401 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content78% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003636446 
Protein GI296129196 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000440573 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCGCGC AGCCCGGGAC CGTCCCTGCG CAGACGTCAG AGCCTGCGCC GGGAGCCGTC 
GCCGAGCCCG CGACGGCCCC GCAGATCCTC GCGCGCCCGC TCGCGCAGAC GTCCCTGCTC
GACGCCGTGC GGGACCTGCG GCGCGACGTC GACGCCACGT CGTTCCCGCT CGAGATCCCC
GGGGTCGGCG ACGCGCGGGC GTCGCGCGCA CGACTCGTCG ACCAGCTCGA CGAGCACCTC
GTGCCCCGCC TGACGGAGCT GTCCGCGCCG GCGGTGGTCG TCGTGGCCGG GTCCACGGGT
GCCGGCAAGT CGACGCTCGT GAACTCCCTG GTCGGGCGTG AGGTGACCGC GGCCGGCGTG
CTGCGACCGA CGACGCGCGA GCCCGTCCTC GTGCACCACC CGCTGGACAC GGACCTGCTG
TCCCACCACC CCGTGCTCGA CGAGGTCGAC GCCGTCGCGG TCGACACCGT GCCGCGGGGC
ATCGCGATCC TCGACGCACC CGACCTCGAC TCCGTCCTCG ACTCCAACCG CGACACCGCG
CACCGTCTGC TCGAGGCGGC CGACCTGTGG CTCTTCGTCA CGACCGCGTC GCGCTACGGG
GACGCGCTGC CGTGGCAGGT GCTCCGCTCG GCCGTCGAAC GCAGCACGTC CGTCGCGATG
GTGCTCAACC GCGTGCCCGC CGCCTCGCTG CCCACCGTGC GCGGCGACCT GCTCGAGCGG
CTGCGTGCCC ACGGCCTGGC GGGATCCCCG CTCTTCGTCA TCCCCGACGT GGGTCCGCAC
TCCGGCCCGC TGGCCGGCCC CGTCGTGGCG CCCGTCCTGC GCTGGCTCAC CACGCTGGCC
GGCCCGGACC GGGCCCGCAC GGTCGTCGCC CGCACGCTGC GCGGCTCGCT CGCCGCACTG
CGCCCGTGGG TCGACGAGCT CGCGGAGGCC GTGCAGGACC AGGCCGACGC CGCGGCACGG
ATCTCCCGCA CGCTGGACGA GGCAACCGCC GCACCGGGCG ACGCCGCCGC CCGGACGGTG
CGCTCGGGAG CTGTCGCCGA CGGCGCCGTG CGCGCCCGCT GGGCCGAGCT CGTCGCCAAG
GGAGCACCGT TCGCGCGCCT CGTCGGCCGG TCGGGACGCG TCCGCGGCTC CTCGCGCACC
GCACGCGCCC GCGCGGCCGC GGTCGCGCCC CTCATGTCGG ACCTGACCGA GTCCACGGCG
TCGGTGCTCA CGGCGGTCGG GCTGCGCGCG GGCGCCGCGC TGCGCGCGTC GCTCACCGGG
CCGCAGGCAC CGCCCGGCGG GGACTCGGTC CTCGCGCGCT GGCCCGACGG CGAGGCGTCG
CGCGGGGCGG CCGCCGAGCG CGCCGCCCGT GCGTGGTCGG GAGAGGGTGC GCGGCACGTC
CGGGTGCTGC TCGCCGGGAG CGGCGCGGAC GCCCGTCGCC GGGCGCAGGT GAGCAGGGCC
GTGGGGGAGG AAGGGCTGAC CGCCCTCGTC CTCGCCGCGG CCGCCGGCGT CGACGAGGCG
GCCGCGGCCG CCCGCACGCT GCTGGGCGAC CCCGCCGACG AGGTCGTCAC CGCGTTGCGC
GACGACCTCG CGCGGCGCGC GCGCACGCAG GTGGACCTCG AGCGCACCAT CGCCGAGCGC
ACGCTCGACG ACCCCGACCT CGCGGCGGAC GCGTCGTCGC GCCTGCGTCT GCGACTCGCC
GTCCTCAAGG GGCTGACGTG A
 
Protein sequence
MSAQPGTVPA QTSEPAPGAV AEPATAPQIL ARPLAQTSLL DAVRDLRRDV DATSFPLEIP 
GVGDARASRA RLVDQLDEHL VPRLTELSAP AVVVVAGSTG AGKSTLVNSL VGREVTAAGV
LRPTTREPVL VHHPLDTDLL SHHPVLDEVD AVAVDTVPRG IAILDAPDLD SVLDSNRDTA
HRLLEAADLW LFVTTASRYG DALPWQVLRS AVERSTSVAM VLNRVPAASL PTVRGDLLER
LRAHGLAGSP LFVIPDVGPH SGPLAGPVVA PVLRWLTTLA GPDRARTVVA RTLRGSLAAL
RPWVDELAEA VQDQADAAAR ISRTLDEATA APGDAAARTV RSGAVADGAV RARWAELVAK
GAPFARLVGR SGRVRGSSRT ARARAAAVAP LMSDLTESTA SVLTAVGLRA GAALRASLTG
PQAPPGGDSV LARWPDGEAS RGAAAERAAR AWSGEGARHV RVLLAGSGAD ARRRAQVSRA
VGEEGLTALV LAAAAGVDEA AAAARTLLGD PADEVVTALR DDLARRARTQ VDLERTIAER
TLDDPDLAAD ASSRLRLRLA VLKGLT