Gene Cfla_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1823 
Symbol 
ID9145716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2034142 
End bp2035278 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content70% 
IMG OID 
Producttwitching motility protein 
Protein accessionYP_003636919 
Protein GI296129669 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.227924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.842745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCG ACGAGGTCCT GCGGGAGATG GTGCGCCTCG GGGCGTCCGA CGCGCACTTC 
ACGACGGGCT CGCCGCCGAT GGTGCGGCTC TCGGGCGCGC TGCAGCCCCT CGAGCAGTTC
GGGCCGGTGA TGCCCGACGG CCTGCGGCGC TCGCTCTACG CGATCCTCAC GCAGAAGCAG
CGGGAGCGGT ACGAGGAGGA GCTCGAGCTC GACGTCTCCT ACGCGGTGCG GGGTCTGGCC
CGGTTCCGCG TCAACGTCTA CCAGCAGCGG GAGTCGATCG GCGCGGCGTT CCGCGTGATC
CCGTACGAGA TCAAGCCGCT CGAGGAGCTC GGGGTGCCGG CCGTCGTGGG CACCTTCGCG
GGGCTCCCGC GCGGGCTCGT CCTGGTGACG GGTCCGACCG GGTCGGGCAA GTCGACGACG
CTCGCCTCGA TCATCGACCT CGCGAACCGC ACGCGTCAGG ACCACATCAT GACGGTGGAG
GACCCGATCG AGTTCCTCCA CCGCCACAAG AAGTCGCTGG TCAACCAGCG CGAGGTCGGT
GCGGACACCC ACTCGTTCGC GAACGCGCTC AAGCACGTCC TGCGCCAGGA CCCCGACATC
ATCCTGGTCG GGGAGATGCG TGACCTGGAG ACCATCAGCG TCGCGCTCAC GGCGGCCGAG
ACGGGTCACC TGGTCTTCGC CACCCTGCAC ACGCAGGACG CCGCCCAGAC GATCGACCGC
GTCATCGACG TCTTCCCGTC GCACCAGCAG GCCCAGGTCC GTACGCAGCT GGCCGGCGCG
ATCCAGGCGG TCGTCTGCCA GACGCTGTGC AAGCGCGCGG ACGGGCCGGG ACGCGCCGTG
GCGACCGAGG TGCTGGTGGC GACCCCCGCC ATCCGCAACC TCATCCGCGA GGGCAAGACG
CACCAGATCT ACTCCTCCAT GCAGGCCGGC GCGAAGCAGG GGATGCACAC GATGGACCAG
CACCTGGCGG ACCTGGTGAA GCAGGGCAAG ATCACCTACG AGGTGGGGCT CGAGAAGTCC
CATCACGCCG AGGACTACAA CCGGTTGACC GGGAGGTTCT CGGGGGCGTC GCAGGGCGCG
GCCGGCATGG GTGACGAGGT CACGATGGGC TCCGGCTCGT ACGGGCAGGC GTTCTGA
 
Protein sequence
MPIDEVLREM VRLGASDAHF TTGSPPMVRL SGALQPLEQF GPVMPDGLRR SLYAILTQKQ 
RERYEEELEL DVSYAVRGLA RFRVNVYQQR ESIGAAFRVI PYEIKPLEEL GVPAVVGTFA
GLPRGLVLVT GPTGSGKSTT LASIIDLANR TRQDHIMTVE DPIEFLHRHK KSLVNQREVG
ADTHSFANAL KHVLRQDPDI ILVGEMRDLE TISVALTAAE TGHLVFATLH TQDAAQTIDR
VIDVFPSHQQ AQVRTQLAGA IQAVVCQTLC KRADGPGRAV ATEVLVATPA IRNLIREGKT
HQIYSSMQAG AKQGMHTMDQ HLADLVKQGK ITYEVGLEKS HHAEDYNRLT GRFSGASQGA
AGMGDEVTMG SGSYGQAF