Gene Cfla_2520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2520 
Symbol 
ID9146424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2823671 
End bp2824801 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content68% 
IMG OID 
Productputative sugar ABC transporter, substrate- binding protein 
Protein accessionYP_003637607 
Protein GI296130357 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGG CGTGGAAGAA GAGCGCGATC GCGATGGTCG CCGCAGGCAT GCTGCTCGGC 
TCGCTCGCGG CGTGCAGCAG CGAGCGCGAG CCCACCACGG AGGGCACCGG CGACGCCGGC
AGCTCCGAGG ACACCGTCGT CGGCATCGCG ATGCCGACGA AGGCGCTCGA GCGGTGGAAC
CGCGACGGTG CGCACCTCGA GGGCCTGCTG CAGGACGCCG GCTTCGAGAC GAGCCTGCAG
TTCGCCGACA ACAAGGTCGA CCAGCAGATC ACGCAGCTCG AGAACATGAT CAACCAGGGC
GCGGACATCC TCGTCATCGC CTCGATCGAC GGCACGGCGC TCGCGCCGAC CCTCGAGCAG
GCCGCCGAGC AGGGCATCAC CGTCATCGCG TACGACCGCC TCATCAACGA CACCCCGAAC
GTCGACTACT ACGCGACGTT CGACAACTAC GGCGTCGGCA AGATGCAGGG CGAGTTCATC
GTCGAGCAGC TCGACCTCGC CGGCGGTGCC GGCCCGTTCA ACCTCGAGCC GTTCGCCGGC
TCGCCCGACG ACAACAACGC GAAGTTCTTC TTCGCCGGTG CCTGGGACGT CCTCAAGGAG
TACGTGGACA GCGGCCAGCT CGTCGTCCCG TCCGGCAAGG CCCCCGCGTC CAACGACGAC
TGGCAGTCCA TCGGCGTCCA GGGCTGGAGC TCCGACACGG CCCAGTCCGA GATGGAGAAC
CGCCTCAACT CGTTCTACGC GGGCGGCACC AAGGTCGACG TCGTCCTGTC GCCCAACGAC
TCGCTGGCCC TCGGCATCGC CCAGGCGCTC GCGGGCAACG GCTACGCGCC CGGCCCGGAC
TACCCGATCC TCACGGGGCA GGACGCCGAC AAGGCCAACG TCCTCAACAT GATCGAGGGC
AAGCAGTCCA TGTCCGTCTG GAAGGACACC CGCACGCTGG GTGACCGCAC CGCCACGATG
ATCGAGCAGA TCGTCGCCGG TGACGAGGTC GAGGTGAACG ACGAGGAGAC CTACGACAAC
GGCGAGAAGG TCGTCCCGAC CTACCTCCTG CCGCCGCAGG TCATCACGCC GGACACGGTG
CAGACCCTCG TGGACTCGGG CTTCTACACG GCCGCCGACC TCGGCCTGTG A
 
Protein sequence
MSLAWKKSAI AMVAAGMLLG SLAACSSERE PTTEGTGDAG SSEDTVVGIA MPTKALERWN 
RDGAHLEGLL QDAGFETSLQ FADNKVDQQI TQLENMINQG ADILVIASID GTALAPTLEQ
AAEQGITVIA YDRLINDTPN VDYYATFDNY GVGKMQGEFI VEQLDLAGGA GPFNLEPFAG
SPDDNNAKFF FAGAWDVLKE YVDSGQLVVP SGKAPASNDD WQSIGVQGWS SDTAQSEMEN
RLNSFYAGGT KVDVVLSPND SLALGIAQAL AGNGYAPGPD YPILTGQDAD KANVLNMIEG
KQSMSVWKDT RTLGDRTATM IEQIVAGDEV EVNDEETYDN GEKVVPTYLL PPQVITPDTV
QTLVDSGFYT AADLGL