Gene Cfla_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2010 
Symbol 
ID9145905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2238612 
End bp2239859 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content76% 
IMG OID 
Productcysteine/1-D-myo-inosityl 2-amino-2-deoxy-alpha- D-glucopyranoside ligase 
Protein accessionYP_003637104 
Protein GI296129854 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCAGCT GGCCCGCGCC CCAGATCCCT CGGCTGCCCG GGACCGGTGA GCCGGTCCGT 
GTGCTCGACA CCGCCACCGG GCGCCTCGTC GTGGCCGCGA CCGGCCCCCA CGCCCGCCTG
TACGTGTGCG GCATCACCCC CTACGACGCG ACCCACCTCG GCCACGCGTC CACGTACGTC
GCCTTCGACG TCCTCGTGCG CGCATGGCTC GACGAGGGCA AGACCGTCAC CTACGCGTCG
AACGTCACCG ACGTCGACGA CCCGCTGCTC GAGCGCGCCA CGGCCACAGG CGTCGACTGG
CGCGACCTCG CGGCGCAGCA GACCGCGCTG TACGCGTCCG ACATGACGAC GCTCGGGGTC
GTGCCCCCGG ACGTCTACCG CGGCGTCGTG GAGTCCGTGC CACAGGTCGT CGCGGCCGTC
GACGCGCTCC TCTCGCGCGA CGCCGCCTAC CGGCTGCCCG CCCCGGACGG CGGCGACGAC
GTGTACGCCG ACCTGTCCGC CGATCCGGGT TTCGGCTCCG TCGCCGGGCT CGAGCACGCG
GCGATGCTGG CACTGAGCGC CGAGCGTGGC GGCGACCCCG ACCGCCCGGG CAAGCGGTCG
CCCCTCGACC CGCTGCTGTG GCGGGCCGAG CGCCCCGGCG AGCCCGCCTG GGACGCGCCG
GGTCTCGGCC GTGGACGCCC GGGCTGGCAC GTGGAGTGCG CGGTCATCGC CTCCGACGGT
CTCGGCGTGC CGTTCGACGT GCAGGGGGGC GGCTCCGACC TCGCGTTCCC GCACCACGAG
TCGAGCGCGT CGCACCTGCG CGTGCTGACC GGGACGCCCC AGCCTGCCGC CGCGCACGTG
CACACCGGGA TGGTGGGCTA CCGGGGCCAC AAGATGAGCA AGTCGCTCGG CAACCTCGTC
CTCGTCTCGC AGCTGGTCGC CGACGGCGTC GAGCCCATGG CCGTGCGTCT CGCGGTGCTC
GCGCACCGCT ACCGCTCGGA CTGGGAGTGG ACCGACGACG TGCTGGCCAC AGCGCAGCAG
CGGGTCGCCA GGTGGCGGCG CGCCCTGTCC GGCAACGGCG GGCCGGCAGC CCAGCCGGTC
CTCGACGGCG TGCGGGCGGC CGTCGCGGAC GACCTCGACA CCCCGCGGGC GCTCGCGGTC
GTGGACGCGT GGGCAACCGC CGCCCTCGCC GGCGAGGTGC CGTTCGAGGA GGGCGCGCCC
GGGGTCGTGG CGCGCACGGT CGACGCCCTG CTCGGCGTGC GCATGTGA
 
Protein sequence
MLSWPAPQIP RLPGTGEPVR VLDTATGRLV VAATGPHARL YVCGITPYDA THLGHASTYV 
AFDVLVRAWL DEGKTVTYAS NVTDVDDPLL ERATATGVDW RDLAAQQTAL YASDMTTLGV
VPPDVYRGVV ESVPQVVAAV DALLSRDAAY RLPAPDGGDD VYADLSADPG FGSVAGLEHA
AMLALSAERG GDPDRPGKRS PLDPLLWRAE RPGEPAWDAP GLGRGRPGWH VECAVIASDG
LGVPFDVQGG GSDLAFPHHE SSASHLRVLT GTPQPAAAHV HTGMVGYRGH KMSKSLGNLV
LVSQLVADGV EPMAVRLAVL AHRYRSDWEW TDDVLATAQQ RVARWRRALS GNGGPAAQPV
LDGVRAAVAD DLDTPRALAV VDAWATAALA GEVPFEEGAP GVVARTVDAL LGVRM