Gene Cfla_3630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3630 
Symbol 
ID9147546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp4017268 
End bp4018836 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003638700 
Protein GI296131450 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCAA CGACGACGCT GGCCGACCCG GCGACCACCA CCCGGGACGC CCGGGACCTC 
GACGCGATCA TCACCTACCA CGGGACCGAG CCCGAGAACG CCCTCATCCC CGGCCACACC
ACCGAGTCGG GCGGGGTCAA GGTGATCGCG GCCCTGTTCC GGGGTCTGGT CTCCTACGAC
CCCGTCGACG CCCACCCGCG CAACGCCGTC GCCGAGTCGA TCGAGTCCGA CGACTCCCGG
GTCTTCACGA TCCGTCTGCG CCGCGACTGG ACGTTCCACG ACGGCACGCC CGTCACGGCC
CACAGCTTCG TGGACGCCTG GAACCACACG GCCTACGGAC CGAACAGGAT GCTGGGCTCG
ACGTTCCACG CGCACATCGA CGGCTTCGCG GAGGTCAACG CCCCCGACTC CACCGTCACG
GCCATGTCAG GGCTGACCGT GCTCGACGAC CACACGTTCA CGGTCACGCT GTCCGCGCCG
TTCGCCGAGT TCCCGGTCAC GCTCGGGTGC AGCGCGTTCT TCCCGCTCCC GGCGTCGTTC
TTCACCGACC GCGAGGGGTT CGAGGCGCAC CCGATCGGCA ACGGCGCCTT CCGCTTCGTG
TCCCGCCGGC CCGAAATCAA CATCCTCCTG CACCGCTACG AGGGGTACGC CGGTGACGAC
CGCCCGCAGA TCGGCGGCGT CGAGTTCAAG ATCTACGCGA GCCTGGACGA CGCCTACCAC
GACGTCGTGG CCGACCGGCT CGACTACCTC GACGTGACGC CGTACTGGTC GCTGCAGGAC
GACCGCTACC TGCGCGACCT ACCCGGGCGC ACGCACGACC GGACGTACCT CGGCATCCAG
ACGATCTCGT TCCCGCTCTA CGACGCCCGG TACGCCGACG CGCGCGTGCG GCAGGCCATC
TCCATGTCGA TCGACCGGGA GCAGCTGGTC GAGACGATCT TCCGGGGGCA CCAACTGCCG
GCCGACGGCC TGGTCCCGCC CGCGGTCTCC GGGCGCATCG AGGGCCAGGG CGGGCAGCTG
TGCACCTACT CCCCCGCCCG GGCCAAGGAG CTCTTCGACG CGGCGGGCTT CGAGGGCGAC
ATCGAGCTCA CGTCCAACGT CGACTCACCC AACCGCCCGT GGATGGAGGA GATCTGCGCG
TCGGTCGAGG AGGTGCTCGG CGTGCGCTGC CGCTTCCTCG CCATCCCGAC CATGGGCGAG
TTCCGTCGCA GGCTCAACGC CCTCGAGGTC ACGGCGATGT TCCGCTCCGG CTGGATCGCC
GACTACCCCT CGATCGAGAA CTTCCTGAGT CCCATGTTCC GCACCGGTGC CACCGACAAC
GTCGGCAGGT ACAGCAACCC GGCCGTGGAC GCGCTGCTCG ACGCCGCCGA CTCGGCCCCC
ACCCAGGAGG AGGCGTGGGC GAGGTACCAG GAGGCCGAGC GGGCCATTCT CCACGACATG
CCGACGATCC CGATCTGGCA CCAGAGCACG CTCTCGGCGT GGTCGACCCG ACTGCGTGAC
GTTCAACCGA ACCCCTTCCG TGAGCTGGAC CTCTCCAGCG TCACGGTGAC CTCCGCGCCG
ACGAGCTGA
 
Protein sequence
MQATTTLADP ATTTRDARDL DAIITYHGTE PENALIPGHT TESGGVKVIA ALFRGLVSYD 
PVDAHPRNAV AESIESDDSR VFTIRLRRDW TFHDGTPVTA HSFVDAWNHT AYGPNRMLGS
TFHAHIDGFA EVNAPDSTVT AMSGLTVLDD HTFTVTLSAP FAEFPVTLGC SAFFPLPASF
FTDREGFEAH PIGNGAFRFV SRRPEINILL HRYEGYAGDD RPQIGGVEFK IYASLDDAYH
DVVADRLDYL DVTPYWSLQD DRYLRDLPGR THDRTYLGIQ TISFPLYDAR YADARVRQAI
SMSIDREQLV ETIFRGHQLP ADGLVPPAVS GRIEGQGGQL CTYSPARAKE LFDAAGFEGD
IELTSNVDSP NRPWMEEICA SVEEVLGVRC RFLAIPTMGE FRRRLNALEV TAMFRSGWIA
DYPSIENFLS PMFRTGATDN VGRYSNPAVD ALLDAADSAP TQEEAWARYQ EAERAILHDM
PTIPIWHQST LSAWSTRLRD VQPNPFRELD LSSVTVTSAP TS