Gene Cfla_3563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3563 
Symbol 
ID9147479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3952946 
End bp3955273 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content74% 
IMG OID 
Productglycoside hydrolase family 9 
Protein accessionYP_003638634 
Protein GI296131384 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCCA GCACCCGCCG GCGCGCCGCC GGCCTGGCAG CCTCGGCCGC GGTCGCGGTC 
GCCGCCGCCC TCACCGTCAC GGGCGCCACC GCGCCCGCGG CCGTCGCCGC CGCCGACCCG
AGCACCCCGG TCAAGGTCAA CCAGGTCGCC TACGTCCCCG GTGTCGCCAA GGTCGCGACG
CTCGTCAGCT CCGCGACGTC CCCCGTACCG TGGACGCTGC GCGACGCGTC CGGCCGCACG
GTCGCCTCGG GCAGCACCAC CGTCAAGGGC GCGGACTCCC TGTCCGGCGA CCGCACCCAC
CTCGTCGACT TCTCGTCGTA CGACACCCCG GGCACCGGGT ACGTGCTGGT CGCGGGCGGC
TCCGAGAGCC TGCCGTTCGA CATCTCCGCC GACCCGCTGA AGCAGCTCCG GTACGACGCG
CTCGCGTTCT TCTACCACCA GCGCTCGGGC ATCGCGATCG AGTCGCAGTA CGTCGGAGCC
GCGTACGCGC GCGCCGCCGG GCACCTGGGT GTCGCCCCCA ACCAGGGCGA CACGAGCGTG
CCGTGCCGGC AGTCCTGCGG GTACAGCCTC GACGTGCGCG GCGGCTGGTA CGACGCGGGC
GACCACGGCA AGTACGTCGT CAACGGCGGC ATCGCGACGT GGCAGCTGCA GAACGCGTAC
GAGCGGACGC TGCACGTCGA CGGCGCGGAC CGCGCGGCGC TCGGTGACGG GAAGCTGCGC
ATCCCCGAGC GCGCCAACGG GGTGCCGGAC GTGCTCGACG AGGCGCGCTG GGAGGTCGAG
TTCCTGCTGC GCATGCAGGT GCCGGCCGGG CGCACCGACG CCGGGATGGT CCACCACAAG
ATGCACGACG AGAACTGGAC GGGGATGCCC ACGATCCCGT CGCAGGACTC GTCGCGCCGC
ATCCTCGCGC CCGTCAGCAC GGCCGCGACG CTCAACATGG CGGCCGTCGC CGCACAGGCA
GCGCGCCTGT GGGAGCCGTA CGACGCGACG TTCGCCGCGA AGGCGCTGAG CGCCGCGAGG
ACCGCGTACG CCGCGGCGAA GGCCAACCCG AACCGCATCG CGTCGGCAAG CGACGGCACC
GGCGGCGGGG CGTACGGCGA CACCTCCGTG ACCGACGAGT TCTACTGGGC GGCCGCCGAG
CTGTACGCGA CGACGGGCGA GTCGGCCTAC CGCGCCGATG TCACGGGCTC GTCATTCTAC
AAGGGCACGA GCTTCGCGCA GCGCGGGTAC GACTGGGGCT GGACCGGCGG GCTCGGTGAC
ACGACGCTCG CACTCGTCCC GACCGACCTG CCCGCGTCGG ACGTCGCGGC GACGCGCTCG
GCGATCGTGT CCTTCGCGGA CGCGGCGCTC TCCCGCCTGG CGGGCCAGGC GTACCCGGCA
CCGAACAACG CCGGCAGCGT CTACTACTGG GGCTCCAACG GCCAGGTGAC CAACAACGCC
AACGCGCTGG CCCTCGCGTA CGACTTCACG GGCCAGGCGA AGTACCGGAC CGCCGTGTTC
GGCGCGCTCG ACTACCTGCA GGGCCGCAAC CCGCTGAACC AGTCGTACGT CGCCGGCTAC
GGCGAGAAGC CCGTGCGCAA CGTGCACCAC CGGTTCTGGG CGAACCAGCT CGACGCGTCC
CTGCCGACCG CGCCCCCCGG GTCGCTGTCC GGCGGCCCGA ACAGCGAGCT GCAGGACCCG
TACGCGGCGG CGCAGCTCGC CGGGTGCGCG GCGCAGAAGT GCTTCGTCGA CCACATCGAC
GCGTACTCGG TCAACGAGGT CGCGATCAAC TGGAACTCGG CGTTCGCGTG GCTGACGGCG
TGGGCGGCGG AGCAGGCGGG CGGCGCGACG CCGACGCCGA CCCCGACGCC CACCCCGACC
CCCACGCCGA CGCCGTCGGC GACGCCGAGC CCCACCCCGT CGCCGACCCC GTCGGTGACG
CCGACCCCCA CCCCGTCGCC GACCCCGTCC GCGACGCCGA GCCCCACCCC GTCGCCGACC
CCCTCCGCGA CGCCGAGCCC GACGCCCCAG CCGGCTGCGG CCTGCCGGGT GACGTACTCG
GCGAACTCAT GGGGCTCGGG CTTCACGGCG GCGGTCGAGG TGACCAACAC CGGCTCCACC
GCGTGGTCGT CGTGGCGCCT CGGCTTCACG TTCGCGGGTG ACCAGAAGGT CACCCAGGGC
TGGAGCGCGA CGTGGGACCA GTCGGGCTCG ACGGTGACCG CGACCAACGC CGCGTGGAAC
GGCCAGCTCG CGCCCGGCGC GACGACGAGC ATCGGGTTCA ACGGCTCGTA CAGCGGCACG
AACGCGGCAC CGACCGCGTT CACGGTCAAC GGGGCCGCCT GCTCCTGA
 
Protein sequence
MPPSTRRRAA GLAASAAVAV AAALTVTGAT APAAVAAADP STPVKVNQVA YVPGVAKVAT 
LVSSATSPVP WTLRDASGRT VASGSTTVKG ADSLSGDRTH LVDFSSYDTP GTGYVLVAGG
SESLPFDISA DPLKQLRYDA LAFFYHQRSG IAIESQYVGA AYARAAGHLG VAPNQGDTSV
PCRQSCGYSL DVRGGWYDAG DHGKYVVNGG IATWQLQNAY ERTLHVDGAD RAALGDGKLR
IPERANGVPD VLDEARWEVE FLLRMQVPAG RTDAGMVHHK MHDENWTGMP TIPSQDSSRR
ILAPVSTAAT LNMAAVAAQA ARLWEPYDAT FAAKALSAAR TAYAAAKANP NRIASASDGT
GGGAYGDTSV TDEFYWAAAE LYATTGESAY RADVTGSSFY KGTSFAQRGY DWGWTGGLGD
TTLALVPTDL PASDVAATRS AIVSFADAAL SRLAGQAYPA PNNAGSVYYW GSNGQVTNNA
NALALAYDFT GQAKYRTAVF GALDYLQGRN PLNQSYVAGY GEKPVRNVHH RFWANQLDAS
LPTAPPGSLS GGPNSELQDP YAAAQLAGCA AQKCFVDHID AYSVNEVAIN WNSAFAWLTA
WAAEQAGGAT PTPTPTPTPT PTPTPSATPS PTPSPTPSVT PTPTPSPTPS ATPSPTPSPT
PSATPSPTPQ PAAACRVTYS ANSWGSGFTA AVEVTNTGST AWSSWRLGFT FAGDQKVTQG
WSATWDQSGS TVTATNAAWN GQLAPGATTS IGFNGSYSGT NAAPTAFTVN GAACS