Gene Cfla_1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1739 
Symbol 
ID9145628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1932864 
End bp1934534 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content73% 
IMG OID 
ProductSulfite reductase (ferredoxin) 
Protein accessionYP_003636835 
Protein GI296129585 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.159601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGA CGACGTCCCG ACCCCCCACC GCCCCGCCGG CCAAGCGCCC CGAGGGCCAG 
TGGGCCTTCG ACCAGCGCGA GCCGCTCAAC GCCAACGAGG CGCTCAAGCA GGCCGACGAC
GGGCTCAACG TCCGCGAGCG CATCGAGACG GTGTACGCCC GCGAGGGCTT CGCGTCGATC
CCCGGCGACG ACCTGCGCGG CCGCATGCGC TGGTGGGGCC TGTACACCCA GCGCAAGCCC
GGTATCGACG GCGGGCGCAC CGCCACCCTC GAGCCGCACG AGCTCGAGGA CGAGTACTTC
ATGCTGCGCG TGCGCTGCGA CGGCGGCTCG CTGGACCTGC GGCAGCTGCG CACCGTCGCG
GGGATCTCGC AGGAGTTCGG CCGCGGCACC GCGGACATCA CCGACCGGCA GAACATCCAG
CTCCACTGGG TCCGCGTCGA GGACGTCCCG GAGATCTGGC GCCGGCTGGA GTCGGTCGGC
CTGACCACGC AGGAGGCGTG CGGCGACGTG CCGCGCGTCA TCCTCGGGTC GCCCGTCGCG
GGCGTGGCGG CCGACGAGAT CATCGACGGC ACGCCCGCGA TCGAGGCGAT CCGCGAGCGC
TACATCGGCG ACCCCGCCTA CTCGAACCTG CCGCGCAAGT TCAAGACGGC GATCAGCGGG
TCGCCCCACC AGGACGTCGC GCACGAGATC AACGACGTCG CGTTCGTCGG CGTCGTCCAC
CCCGAGCTCG GCCCCGGCTT CGACCTGTGG GTCGGCGGCG CGCTGTCCAC CAACCCGATG
CTGGGCAAGC GCCTCGGTGC GTTCGTCACC CTCGAGCAGG TGCCCGACGT GTGGTGCGGC
GTGGTCGGCA TCTTCCGGGA CTACGGCTAC CGCCGGCTGC GCACCCGGGC CCGCCTGAAG
TTCCTGCTCG CGGACTGGGG CACCGAGGTC TTCCGCCAGG TGCTGGAGAC GGAGTACCTG
GGCTACGCGC TCGCCGACGG CCCCCCACCG CCGCCCCCGC CGCACGGCAA CCGGGACCAC
GTGGGCGTGC ACCCGCAGAA GGACGGCCGC TTCTACGTGG GCGCGGCGCC CACCGTCGGT
CGCGTCTCGG GCCCGGTCCT CACCGCACTG GCGGACCTCG TCGAGGAGGC CGGGTCCGAC
CGCGTGCAGC TCACCACCGA GCAGAAGCTC GTGGTGCTGG ACGTCGCGGC GGACCGCGTC
GACGCGCTGG TCGACGGGCT GGAGTCGCTG GGCCTGCGGG TGCGCACCGC CTCGCCGTTC
CGCCGGGGCA CGATGGCCTG CACCGGCATC GAGTTCTGCA AGCTCGCGAT CGTCGAGACC
AAGGGGCGCG CGACCGGCCT GGTCGGCGAG CTCGAGCGCC GGCTGCCGAC GTTCGACCAG
CCGATCACGA TCAACGTCAA CGGCTGCCCC AACTCGTGCG CGCGCATCCA GACCGCCGAC
ATCGGCCTCA AGGGTGTCCT GGCCGGTGAC GAGGAGGGCT ACCAGGTGCA CCTCGGCGGC
GGGCTGGCCA CGACCAGCGG CCTCGGCCGC ACGCTGCGCG GCCTGCGCGT GCCGTCCTCC
GAGCTGCCCG ACTACGTCGA GCGGGTCACG CGCCGGTTCG ACGAGCAGCG TGAGCCGGGC
GAGCTGTTCG CCCAGTGGGT GCAGCGCGCC GACGAGGAGG ACCTGCGATG A
 
Protein sequence
MAQTTSRPPT APPAKRPEGQ WAFDQREPLN ANEALKQADD GLNVRERIET VYAREGFASI 
PGDDLRGRMR WWGLYTQRKP GIDGGRTATL EPHELEDEYF MLRVRCDGGS LDLRQLRTVA
GISQEFGRGT ADITDRQNIQ LHWVRVEDVP EIWRRLESVG LTTQEACGDV PRVILGSPVA
GVAADEIIDG TPAIEAIRER YIGDPAYSNL PRKFKTAISG SPHQDVAHEI NDVAFVGVVH
PELGPGFDLW VGGALSTNPM LGKRLGAFVT LEQVPDVWCG VVGIFRDYGY RRLRTRARLK
FLLADWGTEV FRQVLETEYL GYALADGPPP PPPPHGNRDH VGVHPQKDGR FYVGAAPTVG
RVSGPVLTAL ADLVEEAGSD RVQLTTEQKL VVLDVAADRV DALVDGLESL GLRVRTASPF
RRGTMACTGI EFCKLAIVET KGRATGLVGE LERRLPTFDQ PITINVNGCP NSCARIQTAD
IGLKGVLAGD EEGYQVHLGG GLATTSGLGR TLRGLRVPSS ELPDYVERVT RRFDEQREPG
ELFAQWVQRA DEEDLR