Gene Cfla_0986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0986 
Symbol 
ID9144861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1093054 
End bp1095105 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content78% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003636091 
Protein GI296128841 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0589275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.732347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGTGG CGAGCGGGCG GACGACGGGG GACACGCAGG ACACGCGGTC GCACGGGGCG 
GGCCGGCGGG CCGGGCGCCG CGCGGCGACG CTCCTGGTGC CGGTGCTCGC GGTGACCACC
CTCGCGGCGT GCACCGCCGA GGAGGTCGAC CCGGCGCGTG CCGGGACCGT GGTCGTCTCG
GTGGACCTGC CCTTCGCCTC GCTCAACGGC GCGACGGCCG CCGGTCGTGC GCCGGGCAGC
GTGCTGGTGC GCGGTCTGGT GCAGTCCGGG TTCTCGGCGA TCGAGCCCGA CGGCACGGTC
CGCCCGGACG AGTCGTTCGG CACCGTCGAG AAGGTCGGCG ACGACCCGCT GACGGTGCGC
TACACGATCG CGCCGACCGC GCGGTGGTCC GACGGCGTGC CCGTCACCCC CGCGGACCTG
CTGCTCGAGT GGGCGGCGCG CAGCGGTCAG CTGGACGAGG TCGTGCCCGA GCTCGACGCG
GACGGCGTGG TCGCGCACAC CCGGGACGAC GTCGTCGTCT TCGGTGCCGC GTCCGCCGCG
CTCGCGCGCG CCGCGTCGGT CCCGACCGTC GAGGGCGACA CCGTCACGGT CGTCTACGAC
GCCCCTGTCG CCGACTGGCG CACCGCGCTG GACGTCAACC TGCCCGCGCA CGTCCTGGGG
CGCCTCGCGC TCGATCCCGA CGCGCCCGCG CTGCCGATCC CGGGGGCGAC TCCGTCGGCC
ACGGCCACGG GCACCACGTC GCCGTCGGCG CAGGCCGCGG CGGACGCCAC CGCGACCACG
TCGCCCTCGC CCGTCGCAGC GGCGTCCGCG TCGCCCTCCG GCGACGCGGA TCCCGACGCC
ACTCCCGAGC CTGCGGGCGA GCGCGACGAC GCCGGGGCCG GCGAGGAGCT CGAGGAGGCC
GCCGGCTGGG CGCAGGCGGT GGTGACGGCC GTGCAGCAGC AGGACCGGTC GGCGCTCGTG
CCGATCTCGC GGGTCTGGCG TGCGGCCGGG CGCGCGGGGG ACGTCACGGC GGACCCGACG
CTCACGACGA CGACCGGACC GTACGTGCTC GCGGACGTGG GGGGAGCGGG CGTCGAGATG
GTCCGCAACG AGCGGTACGC GGGGGAGGCG CCGGCGGCGT ACGACCGGGT GCGGGTGCGC
ACGGACCTCG ATCCCCTCGC CCAGCTCGAC GCGCTCGCCG CGGACGAGGT CGACGTGGCA
GCACCGGTGA GCACGTCCGA CGTCCTGGCA GCGGCGGAGG GCCTCGAGGA CGTCGCGATC
GCCACGGGCG GGGACGCCGT GCTCCAGCTC GTGCTGCAGC AGGACGCCGG TGGCGTCTTC
GATCCCGGCT CGTACCAGGA CGCGCCCGAC CCGGCGGCGA CGGTCGCCGC ACTGCGCGCG
GCGTTCCTCG TGAGCGTGCC GCGCGAGGAG GTCGTGGTCG ACGCGGTGCG CCCGCTGTGG
GCGCGTGCGC AGGTGTCGGA GGTGGTCGCG GCGCAGGTGG CGCCGGCGGC GACGCCCACG
CCGGTGGCGT CGGCCACCGC CGCGGCGGCG GCCGACGGGC CCGTGGAGGT GCGGGTGCTG
ACGAACACCG CCGACCCCCT GCGCGCGGCG GTGCTCGACG CGCTGACGAC GGCGGCCGCC
GAGCAGGGCT TCGAGGTGGT CCCCGTGGCG ACGGCGGACG CGGCGCAGAG CCTGCGCACG
CGCCCGGAGG ACTGGGACGC CGCGCTCGTA CCCGTCGCCC AGGAGGACCT GCCCGTCGCC
GCCTTCGCGG CGCGCTGGCG CAGTGGGGGC GCCACCAACG TCACGGGCCA CGCGGACCCC
GCGCTCGACG AGGTGCTCGA CGCGCTGGTC GCGCAGCCGG ACCCGGACGC CGCGGGTGCG
CAGGTCGCGG ACGCGTCGGC CGCCCTGCGC ACGTGGGGCG CCGTGCTCCC GGTGGTGCGC
ACACCCGTCC TGACGGTGTC CGCCACGCGT GACGCGGCGG AGGACCGCGG GCTGCCGGTG
GTCGCGGACG TCCCGGTGCT CACACCTGCT GCGGCGGACC TCACATGGTG GTGGAACTGG
ACACGACGGT AG
 
Protein sequence
MSVASGRTTG DTQDTRSHGA GRRAGRRAAT LLVPVLAVTT LAACTAEEVD PARAGTVVVS 
VDLPFASLNG ATAAGRAPGS VLVRGLVQSG FSAIEPDGTV RPDESFGTVE KVGDDPLTVR
YTIAPTARWS DGVPVTPADL LLEWAARSGQ LDEVVPELDA DGVVAHTRDD VVVFGAASAA
LARAASVPTV EGDTVTVVYD APVADWRTAL DVNLPAHVLG RLALDPDAPA LPIPGATPSA
TATGTTSPSA QAAADATATT SPSPVAAASA SPSGDADPDA TPEPAGERDD AGAGEELEEA
AGWAQAVVTA VQQQDRSALV PISRVWRAAG RAGDVTADPT LTTTTGPYVL ADVGGAGVEM
VRNERYAGEA PAAYDRVRVR TDLDPLAQLD ALAADEVDVA APVSTSDVLA AAEGLEDVAI
ATGGDAVLQL VLQQDAGGVF DPGSYQDAPD PAATVAALRA AFLVSVPREE VVVDAVRPLW
ARAQVSEVVA AQVAPAATPT PVASATAAAA ADGPVEVRVL TNTADPLRAA VLDALTTAAA
EQGFEVVPVA TADAAQSLRT RPEDWDAALV PVAQEDLPVA AFAARWRSGG ATNVTGHADP
ALDEVLDALV AQPDPDAAGA QVADASAALR TWGAVLPVVR TPVLTVSATR DAAEDRGLPV
VADVPVLTPA AADLTWWWNW TRR