Gene Cfla_1429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1429 
Symbol 
ID9145315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1585504 
End bp1586919 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content73% 
IMG OID 
Productcellulose-binding family II 
Protein accessionYP_003636526 
Protein GI296129276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0718119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGAA CCCGCAGCAG GACGGCTGCA CTGCTGGGGA CGGTCACGCT GGTCGCGGCA 
GGCGTCGTCG CCGCGGCCAC CGCCCCCGCA CACGCCGCCC AGACCATCAC CAACGTCGCC
TACGCACCGG CGGAGCCCTC CGGCAGCCGG GGCCACCTCC TGGACCTCTA CATCCCCGAC
GGCAACGGCC CGTTTCCCCT CGTCCTGTGG TCGACCGGGT CGGCGTGGTC GTCCGACGAC
GGCAAGTCGG GCGCGAGCGC CATCGCCCAG CAGCTCAACC CCCGCGGCAT CGCCGTCGCG
GGCGTCAGCG TGCGCTCGGC CTCGCAGGCG AAGTTCCCCG CGCAGGTCCA CGACATCAAG
TCGGCCACGC GGTTCCTGCG CAGCAACGCG GCGCAGTACC GGCTGAACCC GAACCAGTTC
GCCAGCATGG GCGACTCGTC GGGCGGCTGG GTGGCCGCGA TGGCTGCGGT GTCCAACGGC
AACGCGTACC TCGAGGGCAC CGTCGGCACG ACCGGCGTCT CGAGCGACGT GCAGGCCGGC
GTCGACTTCT TCGGCCCCAC CGACTTCGCG CGGCTCAAGG AGCAGGACCC GGGTGGCTTC
ATCGACCACG ACAGCCCCAG CGCGCCCGAG GGCCAGCTGC TCGGCTGCGC CACGCCGACG
TGCCCGGACA AGGTGCGCCA GGCCAACCCG CTGACGTACG TCGACGCGCA GGACCCGCCG
ATGCTCCTGC TGCACGGGCA GGCGGACAAC GTCGTCCCGC ACGCCCAGAC GGTCATCTTC
TACGACGCGC TGAAGGCCGC GTGCGTCGAC ACCCAGTTCT TCTCGGTCCC CGGCGCCGGC
CACAGCCACG CCGACGTGAC GAGCTCCTCG CGCTTCGGCC GGCAGACAGT CCGCACCGTC
GAGGGTTGCC GCGAGACGGT CACGCAGGGC ACGCCGAACC CGAGCTGGGA CACCATCGCG
GCGTTCCTCA AGGACGCGTG GGCGGGCGGC ACGTCGACAC CGACGCCCAC CCCGACGCCG
ACGACCAGCC CCACGCCGAG CCCGACGGAC AGCCCGACGC CGACGCCGAC CTTCTCGCCG
ACCCCGAGCC CGACGCCGAG CCCGACGCCG ACGCCCACCC CCACGCCGGG CACGGGCAAC
GGCTGCTCGG CGACGTACCG GGTGGTCAAC GCCTGGCCCA ACGGCTTCGT CTCGGAGGTG
AAGGTCACGG CCGGTGGCGC GTCCCTCTCC GGCTGGCGCG TCAGCATGAC GCTGCCCGGT
GGTCAGGCCA CGCAGGTCTG GAACGGCCAG TCGTCGGGCG GCGTCGTGTC CAACGCTCCG
TGGAACGGGT CGCTGGGCGC CGGGCAGAGC ACGACGTTCG GCTTCCAGGG CACCGGCAGC
GGCGAGGGCG CGACGGTGTC GTGCTCGGCC TCATGA
 
Protein sequence
MSRTRSRTAA LLGTVTLVAA GVVAAATAPA HAAQTITNVA YAPAEPSGSR GHLLDLYIPD 
GNGPFPLVLW STGSAWSSDD GKSGASAIAQ QLNPRGIAVA GVSVRSASQA KFPAQVHDIK
SATRFLRSNA AQYRLNPNQF ASMGDSSGGW VAAMAAVSNG NAYLEGTVGT TGVSSDVQAG
VDFFGPTDFA RLKEQDPGGF IDHDSPSAPE GQLLGCATPT CPDKVRQANP LTYVDAQDPP
MLLLHGQADN VVPHAQTVIF YDALKAACVD TQFFSVPGAG HSHADVTSSS RFGRQTVRTV
EGCRETVTQG TPNPSWDTIA AFLKDAWAGG TSTPTPTPTP TTSPTPSPTD SPTPTPTFSP
TPSPTPSPTP TPTPTPGTGN GCSATYRVVN AWPNGFVSEV KVTAGGASLS GWRVSMTLPG
GQATQVWNGQ SSGGVVSNAP WNGSLGAGQS TTFGFQGTGS GEGATVSCSA S