Gene Cfla_3373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3373 
Symbol 
ID9147289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3749658 
End bp3751766 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content75% 
IMG OID 
Productprotein of unknown function DUF839 
Protein accessionYP_003638451 
Protein GI296131201 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.14033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00826975 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACCA CGACCCCGCA CGACCACGCC GGAGCGGGCA CGCCCGGCCC GGGCCGTCGC 
GCCCTCCTGC CGTTCCTGCC GATGGCGGGA CGCACGCACG GCCGGCGCTC CCCGGTGACG
TGCCACCTGC GGTGCGGGGA CGCGTGCGCA CAGCCGGTGC CCAACACCAG CGGCAACGCG
TACTTCCGGG ACGTGGCCGA CGCGGCGCTG ACCCGGCGGT CGGTGCTGGC CGGCGCGCTG
GTCGCGGCGA CGGCGGTGGC GGTGGGGGCG GACGTGCTGC AGGCCCCGCC CGCCGCAGCC
CGCGGCTACG GCCCGGGGCG CGGGCACGGG CACCTGCCGT TCCGTCCGAT CGCCCCCGTG
CCGGCGGCGA GCGACGCGTT CACCGTGCCG CAGGGCTACC GCTGGGCGCC GCTCGTGCGC
TGGGGCGACC CGCTGTTCTC GACGCGCGAC GTGTTCGACC CCGCGAGGCT CACGCCCGAG
GTCGCGGAGC GGCTCGTCGG CTACAACTGC GACTACGTCG ACCTGCTCGA GGACCCGCGC
GGCCGCGACG CGCTGGTCGT CGTGAACCAC GAGTACACGA ACGAGGGGAT CATGTTCCCG
CCGGCGGCGA CGCCGCAGGA GCTCGAGCGG CAGCGGCGCG TGGCGATGGC GTCGCACGGC
ATGTCGGTGG TCGCGCTGCG GCGGTCCCGG CCGGGGCGTC CGTGGGCGCC GCGTGTCGGG
GCGCGCGAGA ACCGCCGCGT CACGGCGACG ACGCCCTGCA CGTTCGACGG GCCCGCGGCC
GGCTCGCCGC TGCTGCGGAC GGCCGCGGAC CCGGCGGGCT CGACGGTGCT GGGCACGCTG
AACAACTGCG CGGGCGGCAC GACGCCGTGG GGCACGGTGC TGTCCGGCGA GGAGAACGTC
AACCAGTACT TCCGCACGCC CGGGACGAGC GCGGCGGACC GCCGGTACGG GTTCGCCGAC
AAGGCGACGT CGCGCGGCTG GGAGCAGGTC GACCCGCGCT TCGACACCCG GACCCCGGGG
CAGGAGAACG AGCACCACCG GTTCGGGTGG GTCGTCGAGC TGGACCCCGC GGACCCGTCG
GCGCCGCCCG TGAAGCACAC GGCGCTGGGG CGCTTCAAGC ACGAGGGTGC GAACGTCGTG
GTCGGGCGCA CGGGGCACGT CGCGGCCTAC ATGGGCGACG ACGAGCGGTT CGACTACGTC
TACAAGTTCG TCAGCTCGCA GCGGTACCGG CCCGGGCCGC GGCACCGCGA GCACAACAAG
CGCCTGCTGT CCGCCGGGTC CCTGTACGTC GCGCGGTTCG GGGACGACAC CGGGGTGGCG
CAGGTGGCGG CGACGGGCGC GCGCCCGTCG GGCGGGCAGT TCGACGGCGC GGGCGAGTGG
CTGCCGCTCG TCGTCGACGG CGTCAGCCAG GTGCCGGGCT TCACGACCGA CGAGGTGCTC
GTGCACACGC GTTTGGCGGC CGACGCGGTC GGGGCGACGA AGATGGACCG GCCGGAGGAC
GTCGAGCCGC ACCCGACGAC CGGCCGGGTG TACGTCGCGT GCACCAACAA CACCGACCGC
GGGGCTGCCG GCAAGGAGGG CGCCACGGCC CCGAACCCGC GCACGGCCAA CCGGCACGGG
CACGTCGTCG AGCTGACGGA GGCCGGTGAC GACGTCACGG CGACCACGTT CGGCTGGAGC
ATCCTGCTGC TGTGCGGCGA CCCGGCGACG ACGTCCGACA CGTACTTCGC GGGGTTCCCG
CCCGAGAAGG TCTCGCCGAT CTCCTGCCCG GACAACGTCG CGTTCGACTC CGCGGGCGAC
CTGTGGATCT CGACCGACGG CCAGCCGGGC GCGATCGGCT ACGGCGACGG GCTGTTCAAG
GTGCCGCTCA CCGGGCCCGA GCGCGGGCGG GTGCAGCAGT TCCTCGCCGT GCCGGCCGGC
GCCGAGACGT GCGGGCCGGT GGTGCGGGAC CGCGACGGCA TGGTCTACGT CGCGGTGCAG
CACCCGGGGG AGGACGGCAC CTGGGACGCC CAGCAGTCGT TCTTCCCGGA CTACGTCGCG
CCCGGGTCCG TGTCACGCGG ACGCTGGGGC GGGCCGCGCC CGAGCGTCGT GCAGGTCTGG
CGCGACTGA
 
Protein sequence
MTTTTPHDHA GAGTPGPGRR ALLPFLPMAG RTHGRRSPVT CHLRCGDACA QPVPNTSGNA 
YFRDVADAAL TRRSVLAGAL VAATAVAVGA DVLQAPPAAA RGYGPGRGHG HLPFRPIAPV
PAASDAFTVP QGYRWAPLVR WGDPLFSTRD VFDPARLTPE VAERLVGYNC DYVDLLEDPR
GRDALVVVNH EYTNEGIMFP PAATPQELER QRRVAMASHG MSVVALRRSR PGRPWAPRVG
ARENRRVTAT TPCTFDGPAA GSPLLRTAAD PAGSTVLGTL NNCAGGTTPW GTVLSGEENV
NQYFRTPGTS AADRRYGFAD KATSRGWEQV DPRFDTRTPG QENEHHRFGW VVELDPADPS
APPVKHTALG RFKHEGANVV VGRTGHVAAY MGDDERFDYV YKFVSSQRYR PGPRHREHNK
RLLSAGSLYV ARFGDDTGVA QVAATGARPS GGQFDGAGEW LPLVVDGVSQ VPGFTTDEVL
VHTRLAADAV GATKMDRPED VEPHPTTGRV YVACTNNTDR GAAGKEGATA PNPRTANRHG
HVVELTEAGD DVTATTFGWS ILLLCGDPAT TSDTYFAGFP PEKVSPISCP DNVAFDSAGD
LWISTDGQPG AIGYGDGLFK VPLTGPERGR VQQFLAVPAG AETCGPVVRD RDGMVYVAVQ
HPGEDGTWDA QQSFFPDYVA PGSVSRGRWG GPRPSVVQVW RD