Gene Cfla_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2200 
Symbol 
ID9146100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2455252 
End bp2457009 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content79% 
IMG OID 
ProductFHA domain containing protein 
Protein accessionYP_003637290 
Protein GI296130040 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.632754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00361499 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGACC GCACGCACCA CGAGTACCAG GACGGACCCT GGACGGCCGT CGTGTCCGAC 
GGGTTCCTCG CGCTGGTCGA GCCGGACGCG CCGTGGCGCC TCGTCGACGG CCTGTGGCGG
GTGGCCAGCG AGGGCGGCGA CGTGCTGGGC GCCCTCGGCG TGGTCGCGGC CGACGGGTTC
GCTGCCCTGC CCGGGTTCGC GGTGCTGCGC GCCGACGGCG ACGGTGCCGT GCACGCGGTG
CTGCGTGGTG CGGTGCGGCT CCGCCTGCAC GGCGTGGACG GCACCCAGGA GGTCACGGCC
GGTGACGGGG CCGTGTGGAC CGAGCACCGT GCGTTGGGCG TGCTCGGCCT CGAGCTCGCT
GTCGACGCGG TGCCCGACGC GACCTGGTGG CCGCTGACCG GCGGTGTCGT GCGGTCCGGC
GGTCTGCGGA CCGGTGAGGT GCCGGACGCC GCCACGGTGG TGCCCGTGGC GCTGCCGGAC
ACCGCGACGC ACGAGCAGCC GATCGTGCCG GAGCCCGCCG GCGACGCGGT GCCGGAGCAC
CCCGTGGTGG TCGACGTCGC CGCGGTGCCG GGCGGCTCGG TGCCGGGCGG CGCGGTCGCC
GCGGCGGAGC CGCCGCGGCC GTCCGGGGCG GAGATCGAGC CGTCGGCGGA GACCGAGTCG
GCGGAGGAGA GCGAGCCGGC GGAGGAGAGC GAGCCGGCGG CGGAGACCGA GTCGTCGGCG
GAGACGGGGC CGTCGGGGGA GACCGAGCCG CCCGCAGGGT CCGAGCCGGA GGCGGCGCCC
GAGCCTGCGG GCCCCACCGT GGCCGCGCCG GTCGAGGCCG CCGTCCCCGC CGATCCCGCG
GGCGTGGTCG AGCCCGACGT GGCGACGCTG CCGGCACCCG AGCTCGGCCT GCCCGAGCCG
CTCGAGCCGG CGCCCGCCGC GCCGGTCGAG CAGGTGGAGG TCGACCCGTG GGCGCCCGCG
CCGGTCGCAG CCGCGCTGCC GGCGGCACCC GTCGAGCCCG ACGTGCCCAC CGAGCGGCTC
TCGCCCGAGG AGCTGCTCGA GGCGGCCGGC CCGCCCGCAT GGTCCGGCGC GGAGGCGGTG
GCGTGGTCGG CGGCGGGGCC CGCGGACACG CCGGCGCCGG CACCGGAACT GGCACCAGCC
CCGGAGCGCG AGCCCGAGAT CCCCTGGTGG CCGCTGGGCG ACGCGGGGAC CGCCGAGCCC
GCGCCGACCC CCGCCGCGCC CGCGTCCACG CCCGCCCCGC CCCCGTTCGC GCCGGTGACC
GCCCCCGCGG CGGTCGCCGA CGAGACGGCG GGGTCCGACG ACCACGACGG CATGACGATC
CTGTCCTCCG ACCTCGCGCG GCTGCGCGAC CGTCTCCCGG CCTGGTCGCA GGACGCCGAG
CCCGGGCCGT TCCCCGTGCC GCAGCCCGCG CCGCTGGCCG CGCGCATGGT GCTCTCGACG
GGACTCGTGG TCGCGCTCGA CCGCGCCGTC CTGCTGGGAC GCGCGCCGCA GGTCGCGCGC
GTGTCCAACC GTGAGCTGCC GCGCCTGGTG ACGGTGCCGA GCCCCAACCA GGACATCTCG
CGGACGCACG CCGAGGTGCG CGTCGAGGGC GACCACGTCA TCGTCACCGA CCTCGACTCC
ACCAACGGGG TGCACGTGTC GCGGCCCGGC GAGGGCGTGC GGCGGCTGCA CCCCGGCGAA
CCGAGCGTCG TGGGGCCCGA CGAGGTCGTC GACCTGGGCG ACGGCGTCAC GTTCACCGTG
GAGCGCAGCG CGTCGTGA
 
Protein sequence
MSDRTHHEYQ DGPWTAVVSD GFLALVEPDA PWRLVDGLWR VASEGGDVLG ALGVVAADGF 
AALPGFAVLR ADGDGAVHAV LRGAVRLRLH GVDGTQEVTA GDGAVWTEHR ALGVLGLELA
VDAVPDATWW PLTGGVVRSG GLRTGEVPDA ATVVPVALPD TATHEQPIVP EPAGDAVPEH
PVVVDVAAVP GGSVPGGAVA AAEPPRPSGA EIEPSAETES AEESEPAEES EPAAETESSA
ETGPSGETEP PAGSEPEAAP EPAGPTVAAP VEAAVPADPA GVVEPDVATL PAPELGLPEP
LEPAPAAPVE QVEVDPWAPA PVAAALPAAP VEPDVPTERL SPEELLEAAG PPAWSGAEAV
AWSAAGPADT PAPAPELAPA PEREPEIPWW PLGDAGTAEP APTPAAPAST PAPPPFAPVT
APAAVADETA GSDDHDGMTI LSSDLARLRD RLPAWSQDAE PGPFPVPQPA PLAARMVLST
GLVVALDRAV LLGRAPQVAR VSNRELPRLV TVPSPNQDIS RTHAEVRVEG DHVIVTDLDS
TNGVHVSRPG EGVRRLHPGE PSVVGPDEVV DLGDGVTFTV ERSAS