Gene Cfla_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1960 
Symbol 
ID9145854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2179831 
End bp2181309 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content68% 
IMG OID 
ProductRNA binding S1 domain protein 
Protein accessionYP_003637054 
Protein GI296129804 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCT CCACCCCCGC GAAGCCCGCC AGCCCGCAGG TCGCGGTGAA CGACATCGGC 
TCGGCCGAGG ACTTCCTCGC CGCGATCGAC GCCACCATCA AGTACTTCAA CGACGGCGAC
ATCGTCGAGG GCACCATCGT CAAGGTCGAC CGCGACGAGG TCCTGCTCGA CATCGGTTAC
AAGACCGAGG GCGTCATCCC CTCCCGCGAG TTGTCCATCA AGCACGACGT GGACCCCGGC
GAGGTCGTGA AGGTCGGCGA CGCGGTCGAG GCCCTCGTCC TCCAGAAGGA GGACAAGGAG
GGTCGGCTGA TCCTGTCCAA GAAGCGCGCG CAGTACGAGC GGGCGTGGGG CACGATCGAG
AAGATCAAGG AGGAGGACGG CGTCGTCACC GGCACCGTCA TCGAGGTCGT CAAGGGCGGC
CTCATCCTCG ACATCGGCCT GCGCGGCTTC CTGCCGGCCT CCCTCGTCGA GATGCGTCGC
GTCCGCGACC TCCAGCCGTA CGTCGGCAAG GAGATCGAGG CGAAGATCAT CGAGCTCGAC
AAGAACCGCA ACAACGTCGT GCTGTCGCGG CGTGCGTGGC TCGAGCAGAC GCAGTCCGAG
GTGCGCTCGA CCTTCCTGCA GACCCTGCAG AAGGGCCAGG TCCGTCCCGG TGTCGTGTCC
TCGATCGTCA ACTTCGGCGC GTTCGTCGAC CTGGGCGGCG TGGACGGGCT CGTGCACGTC
TCCGAGCTGT CCTGGAAGCA CATCGACCAC CCGTCCGAGG TCGTCGAGGT CGGCCAGGAG
GTCACGGTCG AGGTGCTCGA GGTCGACTTC GACCGCGAGC GGGTCTCGCT GTCGCTGAAG
GCGACGCAGG AGGACCCGTG GCAGGCGTTC GCCCGGACGC ACGCCATCGG CCAGGTCGTG
CCCGGCAAGG TCACCAAGCT CGTCCCGTTC GGTGCGTTCG TGCGCGTCGA GGACGGCATC
GAGGGCCTTG TCCACATCTC GGAGCTGGCC GTGCGCCACG TCGAGATCCC GGAGCAGGTC
GTGCAGGTCG GCGACGACGT GTTCGTCAAG GTCATCGACA TCGACCTCGA GCGTCGCCGC
ATCTCGCTGT CGCTCAAGCA GGCGAACGAG GGCTTCGACC CCGAGTCGGA CGACTTCGAC
CCCGCGCTGT ACGGCATGGC GGCCGAGTAC GACGAGCAGG GCAACTACAA GTACCCCGAG
GGCTTCGACC CGACGACGAA CGAGTGGCTC GAGGGCTTCG AGGCCCAGCG CGAGGCGTGG
GAGGCCCAGT ACGCGGCCGC CCACGAGCGC TGGGAGGCGC ACCGCAAGCA GGTCGCCGCG
GCGGCGCAGG CCGACCTCGA GGCGTCGACG TCGGAGTCCG GCGCCGCCTC GAGCGGCCCG
GTGCCCTCGA CGTACTCGTC GGCCAGCGAG GAGGCCACCG GCACCCTCGC GTCGGACGAG
GCGCTCGCCG CGCTGCGCGA GAAGCTCACC GGCAACTGA
 
Protein sequence
MSISTPAKPA SPQVAVNDIG SAEDFLAAID ATIKYFNDGD IVEGTIVKVD RDEVLLDIGY 
KTEGVIPSRE LSIKHDVDPG EVVKVGDAVE ALVLQKEDKE GRLILSKKRA QYERAWGTIE
KIKEEDGVVT GTVIEVVKGG LILDIGLRGF LPASLVEMRR VRDLQPYVGK EIEAKIIELD
KNRNNVVLSR RAWLEQTQSE VRSTFLQTLQ KGQVRPGVVS SIVNFGAFVD LGGVDGLVHV
SELSWKHIDH PSEVVEVGQE VTVEVLEVDF DRERVSLSLK ATQEDPWQAF ARTHAIGQVV
PGKVTKLVPF GAFVRVEDGI EGLVHISELA VRHVEIPEQV VQVGDDVFVK VIDIDLERRR
ISLSLKQANE GFDPESDDFD PALYGMAAEY DEQGNYKYPE GFDPTTNEWL EGFEAQREAW
EAQYAAAHER WEAHRKQVAA AAQADLEAST SESGAASSGP VPSTYSSASE EATGTLASDE
ALAALREKLT GN