Gene Cfla_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0653 
Symbol 
ID9144523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp708460 
End bp709791 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content77% 
IMG OID 
ProductUDP-N-acetylglucosamine 
Protein accessionYP_003635764 
Protein GI296128514 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.119521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000107829 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCACCG ACCGGGCGCC GCGCGTCGCC ATGCTCTCGG TCCACACGTC CCCCCTCGAC 
CAGCCCGGCA CGGGCGACGC GGGCGGCATG AACGTGTACG TCCTGGAGCT CGCGCACGCG
CTGGCCGCGC GTGGGGCCCG GGTCGAGGTC TTCACGCGCG CGACGCGGTC CGACGTGCCG
GAGACGGTCG TCCTCGACGG CGTCGACGCC GCGGGGCGCG CCCTGACGGC CGACGACGCG
CGCGACGTGC TGCTCGCGCA CGACGTGCCG CCGGGCGTCA CGCCGCCCGT GCTCGTCCAC
CACGTGCCTG CGGGCCCGTT CGAGGCGCTC GACAAGAACG ACCTGCCGGG CGTGCTGTGC
GGCATGGCCG CGGGCGTCCT GCGCTCCGAG GCGGCGCGCC GTCCCGGCTG GTACGACGTC
GTGCACTCCC ACTACTGGCT GTCCGGGCAG GTCGGCGCGA TCGCGGCGCA GCGCTGGGAG
GTGCCGCTCG TGCACACCGC GCACACCCTC GCCAAGGTGA AGAACGCGTC GCTCGGGCCG
GGGGACAGCG CCGAGCCGAG CGTGCGCGTC GTGGGCGAGG AGCAGGTCGT CGCGGACGCC
GACGCGCTCG TCGCGTCCAC GCCCGTGGAG GCGCGTGAGC TCGTCGAGCT GTACGGCGCC
GACCCGGCGC GCGTGCACGT CGTCGAGCCG GGCGTCGACC TCGAGCGGTT CCGTCCCGGC
GGCCCGGGCG CGCGCGACGA GGCGCGACGG CGGCTCGGTC TGCCGACCGA CCGGCCGGTC
GTGCTGTTCG CCGGGCGCGT GCAGCCGCTC AAGGCGCCGG ACGTGCTGGT GCAGGCGGTC
GGGGTGCTGC GTGCGAGCGG GCGGCCCGTC CCGCTGCTCG TCGTGCTCGG CGGCCCGTCG
GGCCGGCCGA CGGCGGTGCG TGAGCTGCGC GCCCTGGCCG TGACGCTCGG GGTCGACGAC
GACGTGGTCG TGCGCCCGCC CGCGCCGCGT GACGAGCTCG TCTCCTGGTA CCGCGCGGCG
GACCTCGTCG CGATGCCGTC GCGCTCGGAG TCGTTCGGGC TGGTCGCCGT CGAGGCGCAG
GCCAGCGGCA CGCCGGTGCT GGCGGCCGAC GTCGGCGGCC TGCGGACCGT CGTCGAGGAC
GACGTCTCCG GTCGCCTCGT GCCGGGCCAC GACCCTCAGG TGTGGGCCGA GGTGATCGCC
GACGCGCTCG CTGACGCCCC GCGCCGCGCC CGCTGGGCCG CCGGCGCCCG TCAGGTGGCC
GAGCGTTACG CGTGGACCAC GGCCGCCGAC CAGGTGCTCA AGGTCTACGC GGTCGCCGCC
GAGCCCCGCT GA
 
Protein sequence
MTTDRAPRVA MLSVHTSPLD QPGTGDAGGM NVYVLELAHA LAARGARVEV FTRATRSDVP 
ETVVLDGVDA AGRALTADDA RDVLLAHDVP PGVTPPVLVH HVPAGPFEAL DKNDLPGVLC
GMAAGVLRSE AARRPGWYDV VHSHYWLSGQ VGAIAAQRWE VPLVHTAHTL AKVKNASLGP
GDSAEPSVRV VGEEQVVADA DALVASTPVE ARELVELYGA DPARVHVVEP GVDLERFRPG
GPGARDEARR RLGLPTDRPV VLFAGRVQPL KAPDVLVQAV GVLRASGRPV PLLVVLGGPS
GRPTAVRELR ALAVTLGVDD DVVVRPPAPR DELVSWYRAA DLVAMPSRSE SFGLVAVEAQ
ASGTPVLAAD VGGLRTVVED DVSGRLVPGH DPQVWAEVIA DALADAPRRA RWAAGARQVA
ERYAWTTAAD QVLKVYAVAA EPR