Gene Cfla_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2221 
Symbol 
ID9146121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2479132 
End bp2480712 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content73% 
IMG OID 
Productprotein of unknown function DUF404 
Protein accessionYP_003637311 
Protein GI296130061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00605646 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGACC TCTTCGACGA CTACCCGGCG GGGGCCGCCT GGGACGAGAT GCTCGGTCCC 
GACGGCGAGG TGAGCGGCGC CTACCGGCAC GTGCACGCGG CCCTGGCGCA GCTCTCGGCC
GGCGAGCTGC GCGCCCGCGC CGACACGCTC GCGCGCTCCT ACCTCAAGCA GGGCGTGACG
TTCGACTTCG CGGGCGAGGA GCGACCCTTC CCGCTCGACG TCGTGCCGCG CGCACTGGCG
GGCGAGGAGT GGGACCACGT CGCGCCCGGA GTCGCGCAGC GCGTGCGGGC GCTCGAGGCG
TTCCTCGCCG ACGTGTACGG CCCGCAGAAG TCGATCGCCG ACGGTGTCGT GCCGCGGTCG
GTCGTGGTGT CGTCGACGCA CTTCCACCGT GCCGCGCGTG GCGTCGAGCC GCCCAACGGC
GTGCGGGTGC ACGTCTCGGG CATCGACCTG GTGCGCGACT CCCTCGGCGG CTGGCGCGTC
CTGGAGGACA ACGTGCGCGT GCCGTCCGGC GTCAGCTACG TGCTGTCGAA CCGCCGGGCC
ATGGCCCAGA CCTTCCCGGA GCTGTTCGCC GCGCTGCGCA TCCGCCCCGT CGTCGACTAC
CCCCGGCGCC TGCTGGCGGC ACTGACCGCC GCGGCGCCCC AGGGCGTCGA CGACCCGACG
GTCGTGGTGC TCACTCCGGG CGTCTTCAAC TCGGCCTACT TCGAGCACAG CCTGCTGGCA
CGCACCATGG GTGTGGAGCT CGTCGAGGGC CGCGACCTCT ACGTCTCCGG TGGCCGGGTC
TGGATGCGCA CCACCCAGGG CCGACGGCGC GTCGACGTCA TCTACCGGCG CGTCGACGAC
GAGTTCCTCG ACCCGGTGAC GTTCCGCTCG GACTCGCTGC TCGGCTGCCC GGGCCTCATG
ACGTGCGCGC GGCTCGGCAC CGTCACCATC GCCAACGCCA TCGGCAACGG CGTGGCCGAC
GACAAGCTGC TCTACACCTA CGTGCCGGAC CTCATCCGCT ACCACCTGGG CGAGGAGCCG
ATCCTCCCCA ACGTCGACAC GTGGCGCCTC GAGGACCCCG GCGCGCTCGC GGAGGTGCTC
GACCGGCTCG ACGAGCTGGT CGTCAAGCCG GTCGACGGGT CGGGCGGCAA GGGTCTGGTC
GTGGGACCGC GGGCGACGCG CGCCGAGCTC GACGAGCTGC GGTCACGGCT GCGTGAGGAC
CCGCGCGGCT GGATCGCCCA GCCGGTCGTC CAGCTGTCCA CCGTGCCGAC CCTCGTCGAG
GACGGTCTGC GGCCGCGGCA CGTCGACCTG CGCCCGTTCG CCGTCAACGA CGGCGAGTCC
GTCTACGTGC TGCCCGGCGG TCTCACCCGC GTCGCGCTGC CCGAGGGCCA GCTGGTCGTC
AACTCCTCGC AGGGCGGCGG GTCCAAGGAC ACGTGGGTCC TCGGGGGTCG CGTCCCGCGG
CGCGCGCAGT CGCAGAGCCA GTCGCTGCCG CAGGCGGTGC CGAAGGACGC GTCGGTGCCG
ATCGACTCCC ACCCCTCGGA CCGGCGCGCG CAGGTCATGC AGCAGCAGCA GCAGCGGACG
GCGGGGGGCG CGACGTGCTG A
 
Protein sequence
MADLFDDYPA GAAWDEMLGP DGEVSGAYRH VHAALAQLSA GELRARADTL ARSYLKQGVT 
FDFAGEERPF PLDVVPRALA GEEWDHVAPG VAQRVRALEA FLADVYGPQK SIADGVVPRS
VVVSSTHFHR AARGVEPPNG VRVHVSGIDL VRDSLGGWRV LEDNVRVPSG VSYVLSNRRA
MAQTFPELFA ALRIRPVVDY PRRLLAALTA AAPQGVDDPT VVVLTPGVFN SAYFEHSLLA
RTMGVELVEG RDLYVSGGRV WMRTTQGRRR VDVIYRRVDD EFLDPVTFRS DSLLGCPGLM
TCARLGTVTI ANAIGNGVAD DKLLYTYVPD LIRYHLGEEP ILPNVDTWRL EDPGALAEVL
DRLDELVVKP VDGSGGKGLV VGPRATRAEL DELRSRLRED PRGWIAQPVV QLSTVPTLVE
DGLRPRHVDL RPFAVNDGES VYVLPGGLTR VALPEGQLVV NSSQGGGSKD TWVLGGRVPR
RAQSQSQSLP QAVPKDASVP IDSHPSDRRA QVMQQQQQRT AGGATC