Gene Cfla_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2204 
Symbol 
ID9146104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2460491 
End bp2461861 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content75% 
IMG OID 
ProductCBS domain containing protein 
Protein accessionYP_003637294 
Protein GI296130044 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00693904 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGGCG TCCCCGTCGG GCTGCTCGTC GCCGTCGCCG TCGTCGGGAT CCTGCTGGCC 
GCGGCGCTGT CCGCGGGCGA GGTCGCGGTG CTGCGCGTCA CGCGCGCCCG CGTCACCGAG
CTCGAGGCCG AGCGTCCCGG TGCGGCCGCG CGGGTGCGCC GGCTCGTCGA CGACCCGGCA
CGCGTCACGG CCGCGGCGTC GTTCGTGCGG CTCCTCGGCG AGATGACGGC CACCGTGTGC
CTCACGCTCG CCATCAGCGC CGGCAGCCTG TCGTGGTGGG CCACGGCGCT GCTCGCGATC
GCCGCGTGCG CCGTCGTCGC GTTCCTGCTC GTGCGCGTCA GCCCGCGCAG CATGGGCCGG
CGCCACCCCG TCGGCGTGCT CGCGAACCTG TCGCGTCTGC TGCTCGCCGT CACCGCGCTC
GCGGGCGGCG TGGGCCGCGG TGGCGAGGCC GCGACCACCA CGAGCGAGCA GGACGACGCC
GAGCTGCGCG ACATGGTCGA GCGCGTCAGC GAGTCCGACG CGATCGAGGA GAACGAGCGC
GAGATGTTCC GCTCGGTGCT CGAGCTCGGG GACACCCTCA CGCGCGAGGT CATGGTGCCC
CGCACGGACA TGATCACGAC GCAGGCGGAC ACGCCCCTGC ACAAGGTGCT GGCGCTGCTG
CTGCGCTCCG GCTTCTCGCG CGTGCCCGTG GTGGGGGAGT CGGTCGACGA CGTGGTCGGG
GTGCTCTACC TCAAGGACGT CGTGCGCCGC ATCCCCGCCC ACGGTCACGG CCACGGCAAC
GGCGACGGCG ACCCGCTCGA CGCGCCCGCG GCGTCCCTCG CGCGTCCCGC GGTCTACGTG
CCGGAGTCCA AGCCCGTCGA CGAGCTGCTC CTGGAGCTGC GCGACGGGTC CAGCCACATC
GCGCTCGTCG TGGACGAGTA CGGCGGCATC GCCGGGCTCG TGACCATCGA GGACGCGCTC
GAGGAGATCG TCGGCGAGCT CACCGACGAG CACGACGCCA GCGCGCCCGT CGTCGAGGAG
CTCGAGGACG GCGGCTACCG CGTCCCGGCG CGCCTGGGTC GCGACGAGCT CGGCGACCTG
TTCGGCCTCG AGGTCGAGGA CGAGGACGTC GACACCGCGG CCGGTCTGCT CGCCAAGGCG
CTCGGCAAGG TGCCCCTCCC GGGTGCCGTC GGTGAGATCC ACGGTCTGCG GCTCGAGGCC
GAACGTGTCG AGGGCCGCCG CAAGCGCCTG GCGACCGTGC TCGTGCACCG GGCCGAGGAG
GCCACGGAGG ACGCCGCCCC CGCTACACCT GCCCGCGGCA CCCATGCCGC GGGAACCCCG
TCGCGCGGCA CCCCCACCGT GCGGGACCAC GGCTCGGAGG CCGCCCGATG A
 
Protein sequence
MSGVPVGLLV AVAVVGILLA AALSAGEVAV LRVTRARVTE LEAERPGAAA RVRRLVDDPA 
RVTAAASFVR LLGEMTATVC LTLAISAGSL SWWATALLAI AACAVVAFLL VRVSPRSMGR
RHPVGVLANL SRLLLAVTAL AGGVGRGGEA ATTTSEQDDA ELRDMVERVS ESDAIEENER
EMFRSVLELG DTLTREVMVP RTDMITTQAD TPLHKVLALL LRSGFSRVPV VGESVDDVVG
VLYLKDVVRR IPAHGHGHGN GDGDPLDAPA ASLARPAVYV PESKPVDELL LELRDGSSHI
ALVVDEYGGI AGLVTIEDAL EEIVGELTDE HDASAPVVEE LEDGGYRVPA RLGRDELGDL
FGLEVEDEDV DTAAGLLAKA LGKVPLPGAV GEIHGLRLEA ERVEGRRKRL ATVLVHRAEE
ATEDAAPATP ARGTHAAGTP SRGTPTVRDH GSEAAR