Gene Cfla_3716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3716 
Symbol 
ID9147632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp4106624 
End bp4108414 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003638783 
Protein GI296131533 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00638349 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGGAAG AGGTCGGCCG GGGCACGGTG CTCGCCGGGC GCTACCGGGT GGACGAGCCG 
CTGCCGTCGG ACCTCGCGGG AGTGTCCGTG TGGCGCGCCA CCGACCAGAT CCTCGACCGT
CCCGTGCGTG TCCGGGTGCT GGAGTCCGGG GCCGTCGCGC CGGCGCTCGA CGCCGCACGT
CGCGCCGCCC TCGTCACCGA CGCCCGCCTC GTGCGCGTGC TCGACGTCGG CATGCACGAG
GGCGTCGGCT ACGTCGTCTC CGAGCAGATC ACCGGCGCAT CCCTCACCCA GCTCGTCGAG
CGGGGGCCGC TGACGCCCGA CCAGGCCCGC GCGGTCGTCG GCGAGGCCGC GGCCGCGCTC
GAGGTCGCGC GACGCCGCGG CGTGCACCAC CTCGCGCTGC GCCCGTCGGT CGTGCACGTG
TCGGCCGACG GCCGGGTGCT CGTCTCGGGC CTCGCGATCG ACGCCGCACT CCTCGGCGCG
CCCCCGGGGG ACGCGCGCAC GACGAGCCGC ACCGATGCCG TCGACCTCGT CCGGCTCCTC
TACACGGGCC TCACGGGCCG CTGGCCCGCG GGGCGTGACG ACGCCCTCGC GCCCACCGTG
CAGCCTGCAC CCGTCCTGGA CGGCCTGCCC GTGCCGCCCG CGGAGCTGGC GCCCGGCGTC
CCGAACGACC TCGACACGCT GTGCGTCGTG ACCCTCGGCC CGAACCAGGA CGGCCCGTTC
TCCCCCGGCG ACGTCGTGCA CGAGCTCGAG CCCTGGGGTG AGATCCGCAT CGGGCGCCCG
GCCGACGACG ACCGTGCGGC AGGTGAGGGT GCCGTGGCCG CTGCCGCCGC GCCGGCCGTC
GAGCCCGAGC GTCCCGCCCC GCCGGTCCGC GTCGCGCGCC AGTCCGTGCG GTCCGCGTTC
GACGAGCTGC CCGCCGGTGC TCCGCTTCCC GGCACTCCGC CGCCGGCCGC CCCGGCGCGC
GGCGGCATCC CCTCGGGGCG GGTCGAGCGC ACCGGTGTGC TGCCCGCCGG CGCCGCGTAC
GGTGCCGGGG CGGTCCCGCC GCCCGGCCCC CCGCCGAGCC ACGCCCCCGA GCCCGATTTC
TGGGACGAGG GTCCCGGTTT CGCGGACGAC CCGTTCGCGT TCGTGGAGGA CGACGAGCCG
CGCCGCCGGT TCGACCCGAC GGCGCTCGTG CTCGTCGTCG TCGGACTGGC CGTGCTGATC
GGGCTGGTGT TCGCCGCACG CTCGCTCTTC ACGTCGCCGG TCGGCGACCG TGACCCCGTC
GCGGACGACA CCCCGAGCCA GCAGGAGCCC GCGACGCCGT CGGAGGGCGG CGCCACGGAG
CCCACGCCGC AGGAGACCGT CGACGAGGCG CCCGACCCGG GCGTGCCGCC GGCGATCGAG
TCCGTCCGGA CGTTCGACCC GACGGACCCC GCGGGCGAGC GGGTCGCCAA CGTCGAGCTC
ACGCACGACG GCGACCCGTC GACGTTCTGG TTCTCCTACA CCTACAACAA CCCGGCGTTC
GGCGGGCTCA AGGAGGGCAT GGGCCTCGAG GTCACGCTCG CCGCCGAGGC CCCGGTGTCC
GGCGTGACGC TGAACGTGAA CGGCTCGGGC GGCAACGTCG AGGTCCGTGC GACCACCGCG
TCGACGCCCA CGGAGGGTGC GGCCCTCGGT GGCGGGCCGC TGGGCCCCGA GACCGTCCTC
GACTTCGAGG AGCCGGTGAC GACGTCGACC CTCGTCCTGT ACTTCACCGA GCTGCCCACC
AACGCGGCCG GGCAGTACCG GATCGAGGTC ACGGAGATCA CCGTCCGGTA G
 
Protein sequence
MTEEVGRGTV LAGRYRVDEP LPSDLAGVSV WRATDQILDR PVRVRVLESG AVAPALDAAR 
RAALVTDARL VRVLDVGMHE GVGYVVSEQI TGASLTQLVE RGPLTPDQAR AVVGEAAAAL
EVARRRGVHH LALRPSVVHV SADGRVLVSG LAIDAALLGA PPGDARTTSR TDAVDLVRLL
YTGLTGRWPA GRDDALAPTV QPAPVLDGLP VPPAELAPGV PNDLDTLCVV TLGPNQDGPF
SPGDVVHELE PWGEIRIGRP ADDDRAAGEG AVAAAAAPAV EPERPAPPVR VARQSVRSAF
DELPAGAPLP GTPPPAAPAR GGIPSGRVER TGVLPAGAAY GAGAVPPPGP PPSHAPEPDF
WDEGPGFADD PFAFVEDDEP RRRFDPTALV LVVVGLAVLI GLVFAARSLF TSPVGDRDPV
ADDTPSQQEP ATPSEGGATE PTPQETVDEA PDPGVPPAIE SVRTFDPTDP AGERVANVEL
THDGDPSTFW FSYTYNNPAF GGLKEGMGLE VTLAAEAPVS GVTLNVNGSG GNVEVRATTA
STPTEGAALG GGPLGPETVL DFEEPVTTST LVLYFTELPT NAAGQYRIEV TEITVR