Gene Cfla_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1950 
Symbol 
ID9145844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2170130 
End bp2172019 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content76% 
IMG OID 
Productglycoside hydrolase 15-related protein 
Protein accessionYP_003637044 
Protein GI296129794 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGGGC CGATGGACGC CCCGACGCCC TGCCCGCCGG CCGGCGCGCT GCTCGCGCAC 
CAGGTGCCGA TCGAGGACTA CGCCGTGCTG GGTGACGGCC ACACCGCCGC GCTCGTCTCG
CGACGGGGCT CGGTCGACTG GCTGTGCCTG CCACGGTTCG ACTCCGACGC GTGCTTCGCC
GCGCTGCTCG GCACGCCGCG CCACGGTCGC TGGCTGCTCA CGGTCCCCGA CGCGACGGAC
GTGACGCGGC ACTACCGCGG CGACTCGTTC GTGCTGGAGA CGACGTACCG TGCGCCGCGG
GGTGAGGCGC TCGTGACCGA GGCGATGCCG CTCGGCGACG GGCGTGCCGA CCTCGTGCGG
CGCATCGAGT GCACGCACGG CGAGGTCGAC GTCGAGCACG AGTGGGTCGT GCGGCTCGGC
TACGGTGCCG TCGAGCCGTG GGTCCGGCGG GAGCAGGACG CCGACGGGCA CGAGGCGATC
CGGGCGATCG CCGGGCCCGA CTCCCTCGTG CTGCGCGGCG ACAGGCTCCC GGAAGCCGTC
GACCACCGGC ACCGGGACCG GTTCACGCTC CGGGCGGGCC AGGCCGTGGA GCTCTCGCTG
ACGTGGGTGC ACTCGTGGCA GCCCACGCCG TCACGGCTCA CCGTGCCGGA CCGCGTCGAC
GCGACCGCCG TCGCGTGGGG GCTGTGGGCG CGCGGCTGCA CGTACGACGG GCCGCACCGC
GAGGCCGTGG TGCGGTCGCT GCTCGTCCTG CGTCTGCTGA CGGACCTCAC CACGGGGGGC
ATCGTCGCGG CCGTCACCAC GTCGCTGCCC GAGACGTTCG GCGGCGAGCG CAACTGGGAC
TACCGCTACT GCTGGCTGCG CGACGCCGCG CTCACGCTCG AGGCACTCGT CGAGCAGGGC
TTCCGCCAGG AGGCGACGCA GTGGCGCGCG TGGCTCGAGC GCGCCGTCGC GGGTGACCCG
CGGGACCTGC AGATCATGTA CCGGCTCGAC GGCGGACGAC GGCTGCCGGA GGTCCTGCTC
GAGCACCTGC CGGGGTACGC CGGGTCCCGC CCCGTGCGGG TCGGCAACCT CGCGGCGGGT
CAGGTCCAGC ACGACGTGCT CGGCGAGGTG ATGTCCGCGC TCGCCGCGGC GCGCGACGCC
GGTCTGCCCG AGACGGAGGG CTCGTGGGCG CTGCAGTGCC GGCTCGTCGA CGACCTCGCC
GCGTCGTGGC GCACCCCCGA CCGGGGCATC TGGGAGATCC GCGGCGAGCC CCAGCACTTC
ACGCACTCGA AGGTCATGGC GTGGGCCGCG CTGGACCGGG CGGTGTCCGG CGTCGAGCGG
CACGGCCTGC CCGGCCCCGT CGAGCGGTGG CGCCGTGTGC GCGAGGAGAT CCGTGCGGAC
GTCATGGCGC ACGGGTGGTC ACCCGAGCGG AGCACGTTCG TCCAGCACTA CGGCGCGGAG
CACACCGACG CGTCGCTGCT CCAGCTGGTG CAGGTCGGCT TCGTGCCGGC GGACGGGCCG
CACGCGCTTG GCACCCTCGC AGCGGTGCGC GACGAGCTCG AGGTGGCGCC GGGCCTGCTG
CTGCGCTACC GCACCGACCG CACGGACGAC GGGCTGGCCG GTGCGGAGGC ACCGTTCCTC
GCGTGCTCGT TCTGGCTCGC CGACGCGCTC GCACGCGCGG GCGAGGTGGA CGAGGCGAGC
CGCGTGCTCG ACGTGCTCGT CCCGCTGGCG AACGATGTCG GGCTGCTCGC CGAGCAGTAC
GACCCGTACG CAGGGCGCAT GGTCGGCAAC GTCCCGCAGG CGCTGTCGCA CCTCGCCCTC
GTGCGGGCCG CGCACAGCCA CGCGCGCGCC GTGCGGCCGG CGTCGGGGAC CGGCCCCGAC
GGCGGCCGGG CGGCGGCCGT GGCACGCTGA
 
Protein sequence
MLGPMDAPTP CPPAGALLAH QVPIEDYAVL GDGHTAALVS RRGSVDWLCL PRFDSDACFA 
ALLGTPRHGR WLLTVPDATD VTRHYRGDSF VLETTYRAPR GEALVTEAMP LGDGRADLVR
RIECTHGEVD VEHEWVVRLG YGAVEPWVRR EQDADGHEAI RAIAGPDSLV LRGDRLPEAV
DHRHRDRFTL RAGQAVELSL TWVHSWQPTP SRLTVPDRVD ATAVAWGLWA RGCTYDGPHR
EAVVRSLLVL RLLTDLTTGG IVAAVTTSLP ETFGGERNWD YRYCWLRDAA LTLEALVEQG
FRQEATQWRA WLERAVAGDP RDLQIMYRLD GGRRLPEVLL EHLPGYAGSR PVRVGNLAAG
QVQHDVLGEV MSALAAARDA GLPETEGSWA LQCRLVDDLA ASWRTPDRGI WEIRGEPQHF
THSKVMAWAA LDRAVSGVER HGLPGPVERW RRVREEIRAD VMAHGWSPER STFVQHYGAE
HTDASLLQLV QVGFVPADGP HALGTLAAVR DELEVAPGLL LRYRTDRTDD GLAGAEAPFL
ACSFWLADAL ARAGEVDEAS RVLDVLVPLA NDVGLLAEQY DPYAGRMVGN VPQALSHLAL
VRAAHSHARA VRPASGTGPD GGRAAAVAR