Gene Cfla_2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2196 
Symbol 
ID9146096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2445392 
End bp2446540 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content77% 
IMG OID 
Productprotein of unknown function DUF58 
Protein accessionYP_003637286 
Protein GI296130036 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.309415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00320399 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGGTCTGC GCGTCGCGCC CCTGGGGTGG GGGACCGCGG TCGTGGCCGT CTTCGCCACG 
GCGGCGGGAC GCGTGCTGGG CTGGGGCGAG CTCGCGGCGC TCGGCGTCGC GCTGCTGGCC
GTCCTCGTGG TCGCCCTCCT CATGACGGTC GGGCGCACCC GCTACCGCGT CGTGCTCGAC
CTGGCGGACC ACCGCGTGCG CATCGGCCAG CGGGCCGTGG GGCGCGTCGA CGTCCGCAAC
GCCGCCAGGC GCCGCTCCCT GCCCTCGCAG GTCGACCTGC CCGTCGGGGA GCGGGTCGTC
GAGCTGTCGG TCCCGAGCCT GCCGCCCGGC GGCACGCACG ACGACCTGTT CGCGGTGCCG
ACGGAGCGGC GTGCCGTGAT CGTCGTGGGC CCCGTGGTGT CGCGGCGGGG CGACCCCTTC
GGACTGCTGC AGCGCCGTCT GCGCTGGACC GAGCCCGCCG AGCTCTTCGT GCACCCCGAG
GTGATCGGCC TGGGCGGCGC CAACGCCGGC CTGCTGCGCG ACCTCGAGGG GCAGGCGACG
CGTGACCTGT CGGACTCGGA CCTCAACTTC CACGCGCTGC GCGACTACGT CGCCGGCGAC
GACCGCCGCT ACATCCACTG GCGCACGACC GCGCGCCGTG GCCGCCTCAT GGTCAAGCAG
TTCGAGGACA CGCGGCGCAC GCTGACGTCC ATCGCGCTCG CGACCGCGGT CGGCGACTAC
GCGCACCCCG ACGAGCTCGA GCTGGCGGTG TCGGTCGCCG CGTCGATCGC GGTGCAGGCG
ATCCGCGACG AGCGCGACGT CGAGGTGCTC GCCGGGGCGG GCCACCTGCG GACCGCCACG
CCCCCGCTGC TCCTCGACGA CTGCTCGCGG CTGTCGTGGT CGCCCACGGG GCCCGGTGTC
GTCCTGCTGG GCCGCCGCGT GGTGCGCGAG ACACCGGACG CGTCGGTGGC CTTCCTCGTC
ACGGGCGGCG CGCCCACCGA CGCCGACCTG CGTCTGGGTG CGCGCGCACT GCCCGCCGGC
ACGCGCGCCG TCGTCCTGCG GTGCGCGCTC GGCGAGGACG TCGCGGTGCG CACGCAGGGC
TCGCTCACGC TGGGCACGCT CGGCCGGCTC GACGACCTGC CGCGCGCACT GCGCCGGGTG
GTGGGATGA
 
Protein sequence
MGLRVAPLGW GTAVVAVFAT AAGRVLGWGE LAALGVALLA VLVVALLMTV GRTRYRVVLD 
LADHRVRIGQ RAVGRVDVRN AARRRSLPSQ VDLPVGERVV ELSVPSLPPG GTHDDLFAVP
TERRAVIVVG PVVSRRGDPF GLLQRRLRWT EPAELFVHPE VIGLGGANAG LLRDLEGQAT
RDLSDSDLNF HALRDYVAGD DRRYIHWRTT ARRGRLMVKQ FEDTRRTLTS IALATAVGDY
AHPDELELAV SVAASIAVQA IRDERDVEVL AGAGHLRTAT PPLLLDDCSR LSWSPTGPGV
VLLGRRVVRE TPDASVAFLV TGGAPTDADL RLGARALPAG TRAVVLRCAL GEDVAVRTQG
SLTLGTLGRL DDLPRALRRV VG