Gene Cfla_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0456 
Symbol 
ID9144322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp486121 
End bp487827 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content72% 
IMG OID 
ProductRicin B lectin 
Protein accessionYP_003635570 
Protein GI296128320 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCGCA CCCGTCCACA CGGCAGGCTG CACGCGCTCG CGCTCGCGCT CCTGCTGCTC 
GCCTCCCCCG CGCTCGCGTC CCCCGTCGCG CCCGCCGCCG CGGCACCCGC CCCCACCGTC
GACGAGCCCG CGCCCCTGCC CCCGCTGGGC TGGAACTCCT GGAACACCTT CTACTGCAAC
ATCAACGAGC AGATGATCCG GCAGACCGCC GACGCGATGG TGAGCACGGG CCTGGCGGCC
GCGGGCTACC AGTACGTCGT CGTCGACGAC TGCTGGATGC AGGACACCCG CGGCCCGGAC
GGCAACCTGC GGCCCCACAC GTCGCGCTTC CCGTCCGGCA TGAAGGCCCT CGGCGACTAC
ATCCACTCCA AGGGCCTGAA GTTCGGGCTG TACCACGCGC CGCGGGAGAA GACCTGCGAC
CAGTACTTCA ACAACCGCCC GGGCACGTCG TCCAACGGCA ACGAGACGCG CGACGCGCAG
CTCTTCGCGT CGTGGGGCGT CGACTACGTC AAGCACGACT GGTGCGACCC GCGCGGCAGC
ATCCAGGAGC AGGTCGACCT GTTCAAGCGG TTCGGCGACG CCCTGAAGGC CACCGGCCGG
CCGATCGTCT ACTCGATCAA CCCCAACAGC GCCCACGACA ACACGGCCCC GCGGTACTCC
GGGTGGGGCG CGTTCGCCGA CATGTGGCGC ACGTCGGAGG ACCTCAAGGA CGCCTGGTCG
ACGGGCTGCC CGCCGTCCGA CCAGTGGTGC TTCGTCGGCA TCACCGAGGC GCTCGACGTC
ATCGAGCCGA TGCGCGAGTG GACGCGGCCC GGGCAGTACA ACGACCCCGA CATGCTCATG
GTGGGCGTGC GCGGCACCCT GTCGCCCACC GAGAACCGCG CGCACATGAG CATGTGGGCG
ATGCTCTCGG CGCCCCTCAT CATGGGCAAC GACGTCCGGA ACATGAGCGC CGACGTGCGC
TCGGTCCTCA CCAACCGTGA CGTGCTGGCG ATCGACCAGG ACCCGCTCGT GCGTCAGGCC
GACCGGGTGC GGGACGACGG CGACGCCGAG GTCTGGGCCA AGCCCCTGGC CGACGGGTCC
GCGGCGGTCG CGCTGCTCAA CCGCGGCAAC AGCGCGCGCA GCATCTCGGC GACCCTCGCC
GAGGCCGGGC TGCCGGGCGG CACCGCGTCG TACCGCGAGG TGTGGAGCGG CGCGACGGGC
CAGACGTCGG ACCGCATCAC GACGACCGTC CCGGCGCACG GGGTGGCGCT GTACCGCGTC
ACGCCGGGCA GCAACCCGGG ACCGACGCCG ACCACGTCGC CGACTCCCAC GCCGACGACG
CCCCCGGGAT CGTCGTTCGC GCTGGTCAGC GCCGCCTCCG GGCGGTGCCT CGACGCGCCG
AACAGCGCGA CGACCAACGG CACCCGGCCC GTGATCTGGG ACTGCCACGG GCGCGACAAC
CAGCGCTGGG CCGCCGACGG CGCCACGCTG CGCGTGCTCG GCCGGTGCCT CGACGCCCCG
AACGGCGCGT CCGCCGGCAC CGCCGTCCAG CTGTACGACT GCCACGGCGG CACCAACCAG
CAGTGGACCA CGCAGTCGAA CGGCACGATC CGCGGCGTCG CGTCGGGCCT GTGCCTCGAC
GTGGACCGCA ACCTCACGGC CAACGGCACG GGCGTGCTGC TGTGGCACTG CACGGGTTCG
GCGAACCAGG TCTGGAGCCG TCGGTGA
 
Protein sequence
MSRTRPHGRL HALALALLLL ASPALASPVA PAAAAPAPTV DEPAPLPPLG WNSWNTFYCN 
INEQMIRQTA DAMVSTGLAA AGYQYVVVDD CWMQDTRGPD GNLRPHTSRF PSGMKALGDY
IHSKGLKFGL YHAPREKTCD QYFNNRPGTS SNGNETRDAQ LFASWGVDYV KHDWCDPRGS
IQEQVDLFKR FGDALKATGR PIVYSINPNS AHDNTAPRYS GWGAFADMWR TSEDLKDAWS
TGCPPSDQWC FVGITEALDV IEPMREWTRP GQYNDPDMLM VGVRGTLSPT ENRAHMSMWA
MLSAPLIMGN DVRNMSADVR SVLTNRDVLA IDQDPLVRQA DRVRDDGDAE VWAKPLADGS
AAVALLNRGN SARSISATLA EAGLPGGTAS YREVWSGATG QTSDRITTTV PAHGVALYRV
TPGSNPGPTP TTSPTPTPTT PPGSSFALVS AASGRCLDAP NSATTNGTRP VIWDCHGRDN
QRWAADGATL RVLGRCLDAP NGASAGTAVQ LYDCHGGTNQ QWTTQSNGTI RGVASGLCLD
VDRNLTANGT GVLLWHCTGS ANQVWSRR