Gene Cfla_1257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1257 
Symbol 
ID9145136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1404363 
End bp1405949 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content74% 
IMG OID 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003636356 
Protein GI296129106 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCAGG TCGACGGACG TAGGTGGAGG GCTGCCCGGG CAGTCATCGC TCTGATGGCG 
CTGACGGCGG TCGCGGCGCC CGCGCACGCG GTGCAGGGCG TCGTGCCCGG GGCGGCCGCA
GCGGCGGCGA CCGTGCGCCT CGAGATCGCG GGTCACCCCG ACGCGGTGCC CGGGGAGACG
CGGGAGCGGG CGTGCTCGGG GGCGCTGGTC GCCGCGTCGT GGGTGATCAC GGCAGCGTCG
TGCTTCGCGG ACGCGTCCGG CGCGGCGGTG ACGGCCGGCG CGCCGCGCTG GGCGACGACG
GCGACGGTGG GGCGCCCTGA CCTGACGGCG ACGACCGGGC AGGTGCTCAA GGTGGACCGG
CTCGTGCCGC ACCCCGACCG CGACGTCGTC CTGGCGCACC TGGCGTCGAC GGTGACCACC
ACGACTCCGT TGGCGGTCGC GACGACGCCG CCGGCGGTCG GCGAGACGCT GACGGTCGCC
GGGTACGGCC GTACCGCCGA CGCCGTCGTC CCCGACACCG TGCACGCCGC GGCGTACTCG
GTGGCGTCGG TCGGCGACAG GGCTCTCGAC ATCGTGCCGG CGCAGGACGA TGCCGCAATC
TGCAAGGGCG ACGCGGGTGG CCCGGCGCTG CGTGCGACGG CGAGTGGGGG AGTGGAGCTC
GTCGCAATCC ATCACACCGC CTACCAGGGC GGCTGCCTCG GGTCGGTGAG CACCCGCAGG
GAGGCCACCG AGACGCGCGT CGACGACCTC CGCGACTGGG TCGGGCAGGT CACCGCACCG
ACGCAGCACC TGGCGCTCGG CGGAGGTCGT GTCGGCGTCG TGACCGACGC ACGCAAGGCG
ATCGTCGCCG ACGGGCTCAC GGGCAGCTGG ACGACGGTGC ACGACGACGC CGCGCAGGTC
GTCCTCGACG GCACGCGCAT CGGCGTACTG ACGTCGGACG GCGTCGCTCT CGTGAAGGAC
GGGGGCATCA CGGCCCCGTT CGTCCGCGTC GCCGGTGGCG TGCAGCAGCT CGTGCTGTCC
GGCGACCGCA TCGGTGTGCT GACGGACGGC GGGGACGCCT CTGTGAAGGA GGGGCCGGTC
AACGCCGGGT GGGTCAAGGT GTCAGGGGGC GTGAAGCAGC TCGTGCTGTC CGGCGACCGG
ATCGGCGTGC TGACCCACGG CGGCGACGCC TCTGTGAAGG AGGGGCCGGT CAACGCCGGA
TGGGTCAAGG TGTCAGGGGG CGTGAAGCAG CTCGTGCTGT CCGGCAACCG CATCGGCGTG
CTGTCTGACG GCGGCGAAGC CTCCGTGAAG GAGGGCGGTC TGGGTGCCGG CTGGGTCGCC
GAGCACGGCG GCGTGCGCGA CCTCGCGCTG TCGGGTGACC GGATCGGTGT GCTGACGAAC
GGGCGTGACG CCCTGGTGAA GGAGGGCGAC CTGCGAGCGG GATGGGTCGT CGAGTACGGC
GGTGTGCAGT CGATGGTGCT CTCGGGCAAC CGCATCGGTG TGGTCACCGG TGACGGTGCC
GCACTCGTCA AGGAGGGCGC GCTGAACGCC GGCTGGACCA GCGTCTGGGG GAAGTGCCAC
CAGGGGCCGT GCAGCACGTC GGGGTGA
 
Protein sequence
MNQVDGRRWR AARAVIALMA LTAVAAPAHA VQGVVPGAAA AAATVRLEIA GHPDAVPGET 
RERACSGALV AASWVITAAS CFADASGAAV TAGAPRWATT ATVGRPDLTA TTGQVLKVDR
LVPHPDRDVV LAHLASTVTT TTPLAVATTP PAVGETLTVA GYGRTADAVV PDTVHAAAYS
VASVGDRALD IVPAQDDAAI CKGDAGGPAL RATASGGVEL VAIHHTAYQG GCLGSVSTRR
EATETRVDDL RDWVGQVTAP TQHLALGGGR VGVVTDARKA IVADGLTGSW TTVHDDAAQV
VLDGTRIGVL TSDGVALVKD GGITAPFVRV AGGVQQLVLS GDRIGVLTDG GDASVKEGPV
NAGWVKVSGG VKQLVLSGDR IGVLTHGGDA SVKEGPVNAG WVKVSGGVKQ LVLSGNRIGV
LSDGGEASVK EGGLGAGWVA EHGGVRDLAL SGDRIGVLTN GRDALVKEGD LRAGWVVEYG
GVQSMVLSGN RIGVVTGDGA ALVKEGALNA GWTSVWGKCH QGPCSTSG