Gene Cfla_2461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2461 
Symbol 
ID9146365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2753512 
End bp2756514 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content71% 
IMG OID 
Productprotein of unknown function UPF0182 
Protein accessionYP_003637548 
Protein GI296130298 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.947136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000899121 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCTTCG CCTCCCCGCC CCGCCCCCGA CCTGTCGCGC CGCGCCGCCG CGGCCCTCTG 
GTCCCGACCG TCGTCACGCT CGTCGTACTC GTCGTCCTGC TGCTCGTGCT CGCACAGGTC
TGGACCGAGG TCCTCTGGTA CTCGCAGCTG GGGTTCACCG AGGTGCTGCG CACCGAGTGG
GTCACGCGCG GCGTCCTGTT CGTCCTGGGC TTCGCCGTCA TGGCCGCGAG CGTGGGCTTC
TCGCTCAGCT ACGGCTACCG CGCGCGGCCG GTGTACGCGC CCTCGACGCA GGAGCAGGCC
AACCTCGACC AGTACCGCGA GGCCATCGAG CCGCTGCGCC GGCTCGTGCT GGTGGTCGGC
CCTGTCGTGC TCGGGCTCTT CGCGGGTGGC GCGGCCTCCC AGCAGTGGAG CACCGTGCAG
CTGTGGCTCA AGGGTGGAGA CGTCGGGGTG AGCGACCCGC AGTGGGGCAT CGACCTGGGG
TTCTACCTCT TCACGCTGCC CGGTCTGCGG TTCGTCGTGT CGTTCCTCAT GGCGGTGCTG
GTGCTCTCGA GCGTCGCGGC CGTCGCGACG CACTACCTCT ACGGCGGCCT GCGCATCGGC
GGGGGCGAGG GAGCGCCGCG CACCACGCGC GCGGCGCGCG TCCACCTGTC GGTGCTCGGT
GCGCTGGTCC TGCTGCTCGT CGGCGCGAGC TACTGGCTGG ACCGCTACTC GATCCTCACC
AAGCAGCAGA CGGGCGAGCA GCGCTGGCAG GGGGCCGGGT TCACGGACGT GCACGCGGTG
ATCCCGTCGA AGGCGATCCT GGCCGTCGCC GCGGTCGTCG TGGCCGGTGC GTTCGTCGCG
ACGGCGTTCA CGGGCAACTG GCGCCTGCCG GCGATCGGCG TGGGGCTGAT GGTGGTGGCG
GCGGTCGTCG TCGGCGGCGT GTACCCGGCG GTCGTGCAGC GCTTCCAGGT GCAGCCGAAC
CAGCAGGACG CGGAGTCGGA GTACATCCAA CGCAACATCG ACGCGACGCT CGCGGCGTAC
GGGCTCGACG ACATCGACAT CGCCGACTAC GACGCGAACG TCACGGCCGA GCCGGGCGCG
CTGCGGGAGA ACGCGGAGAC CACCGCGTCC ATCCGCCTGC TGGACCCGAA CATCGTGTCG
CCGTCGTTCA AGCAGCTGCA GCAGATCCGC GGGTTCTACG ACTTCCCCGA CTCGCTGTCC
GTCGACCGGT ACACGATCGA CGGCGAGAGC CGCGACACGG TGATCGCGGT GCGTGAGCTC
GACCTCAACG GCCTCAACGC GGGGCAGCGC AACTGGACCA ACGACACGAC GGTGTACACG
CACGGGTTCG GGGTCGTCGC CGCGTACGGC AACACGCGCG GTGCACGCGG CGCACCCGCG
TTCTGGGAGG GCGGGATCCC CTCGCAGGGC GAGCTGGGGG AGTACGAGCC GCGCATCTAC
TTCGGCCAGA AGTCCCCCGA GTACTCGATC GTCGGGGGCC GGCCGGGAAC CGATGGCTGG
GAGTTCGACT ACCCGGACGA CGAGACGGGC ACCGGGTCCG TGCCCTTCCG CTTCCCCACG
GAGGAGGACT CCGCGGGCCC GGCGGTGGGG TCGCTGTGGA ACAAGGTGCT GTTCGCGCTG
AAGTTCGGCG ACGAGCAGAT CCTCTTCTCC GACCGCGTGA CCGAGACCTC GCAGATCCTC
TACGACCGCA ACCCGCGCGA CCGCGTCGCG AAGGTCGCGC CGTACCTCGA CCTCGACGGA
CGCGTGTACC CCGCGGTCGT CGACGGCCGC GTGAAGTGGA TCATCGACGG GTACACCACG
TCCGACCAGT ACCCCTACGC CGCGGTGCGC TCCCTCGAGG ACGCCACCAC GGACTCGCTG
ACCGAGCGCA GCAGCACCGT GCAGGCGCTG CTGCCCAAGC AGGTCAACTA CATCCGCAAC
TCCGTGAAGG CCACGGTGGA CGCGTACGAC GGGTCGGTCG ACCTGTACGC GTGGGACACC
GAGGACCCGG TCCTCGACGC GTGGACCAAG GTGTTCCCGA CCTCGCTGAA GCCGATGTCC
GAGATCTCGG GCGACCTGAT GAGCCACATC CGGTACCCGG AGGACCTCTT CAAGGTGCAG
CGTGCTCTGC TCGGGCAGTA CCACGTGACG AACGCGGCGG GCTTCTTCTC CGGGAACGAC
TTCTGGCAGA GCCCGGCCGA CCCGACCGCG ACCAACGCGG ACGTCGCGCA GCCGCCCTAC
TACCTGACGC TGCAGATGCC GGGCGAGGAC AAGGCGACGT TCTCGCTGAT GTCGACCTTC
ATCCCGGGCG GTGCCAACGC GCGCAACGTG CTCACCGGGT ACCTCGCGGT GGACGCGGAA
GCCGGCAACA CGCCGGGCAA GGTCTCCGAG ACGTACGGCA GGATCCGGCT TCTCGAGCTG
CCGCGTGACT CCACCGTCCC CGGGCCGGGG CAGGCGAGCG CGAACTTCAC CGCCGACCCG
ACGGTGTCGA ACGAGCTGAA CATCCTGCAG CGCGGTGACT CGATCGTGCG GCGTGGCAAC
CTGCTGACGC TGCCCGTCGG CGGCGGTCTG CTGTACGTCC AGCCGGTGTA CGTGCAGGCG
GCGAGCGGGA CGACGTTCCC GCTGCTGCAG CGTGTGCTCG TCTCGTTCGG TGACGAGATC
GGCTTCGCCC AGACGCTCGA CGAGGCGCTG GACCAGGTCT TCGGCGGCGA CTCCGGTGCG
GACGCGGGTG ACGCCGGCGC CGAACCGGTG CCCCCGGTCG ACACGGAGGA GCCGGTGGAG
CCCGGCGACG AGCCGACCGG CGACCCGACG GCGGAGCCGA CCGCGGACGC CACGACGGCG
CCGACGGCCG ACCCGGACGC GCGGGCGGCC CTCGACGAGG CGCTGCAGCG GGCGCAGCAG
GCCATCCGCG CGGGTCAGGA CGCGCTGGCC GACGGGGACT TCGCCGCCTA CGGCGAGGCG
CAGGACCGCC TGGACGCCGC GATCGCCGAC GCGATCGCCG CGGAGGGACG CCTGCAGGAC
TGA
 
Protein sequence
MTFASPPRPR PVAPRRRGPL VPTVVTLVVL VVLLLVLAQV WTEVLWYSQL GFTEVLRTEW 
VTRGVLFVLG FAVMAASVGF SLSYGYRARP VYAPSTQEQA NLDQYREAIE PLRRLVLVVG
PVVLGLFAGG AASQQWSTVQ LWLKGGDVGV SDPQWGIDLG FYLFTLPGLR FVVSFLMAVL
VLSSVAAVAT HYLYGGLRIG GGEGAPRTTR AARVHLSVLG ALVLLLVGAS YWLDRYSILT
KQQTGEQRWQ GAGFTDVHAV IPSKAILAVA AVVVAGAFVA TAFTGNWRLP AIGVGLMVVA
AVVVGGVYPA VVQRFQVQPN QQDAESEYIQ RNIDATLAAY GLDDIDIADY DANVTAEPGA
LRENAETTAS IRLLDPNIVS PSFKQLQQIR GFYDFPDSLS VDRYTIDGES RDTVIAVREL
DLNGLNAGQR NWTNDTTVYT HGFGVVAAYG NTRGARGAPA FWEGGIPSQG ELGEYEPRIY
FGQKSPEYSI VGGRPGTDGW EFDYPDDETG TGSVPFRFPT EEDSAGPAVG SLWNKVLFAL
KFGDEQILFS DRVTETSQIL YDRNPRDRVA KVAPYLDLDG RVYPAVVDGR VKWIIDGYTT
SDQYPYAAVR SLEDATTDSL TERSSTVQAL LPKQVNYIRN SVKATVDAYD GSVDLYAWDT
EDPVLDAWTK VFPTSLKPMS EISGDLMSHI RYPEDLFKVQ RALLGQYHVT NAAGFFSGND
FWQSPADPTA TNADVAQPPY YLTLQMPGED KATFSLMSTF IPGGANARNV LTGYLAVDAE
AGNTPGKVSE TYGRIRLLEL PRDSTVPGPG QASANFTADP TVSNELNILQ RGDSIVRRGN
LLTLPVGGGL LYVQPVYVQA ASGTTFPLLQ RVLVSFGDEI GFAQTLDEAL DQVFGGDSGA
DAGDAGAEPV PPVDTEEPVE PGDEPTGDPT AEPTADATTA PTADPDARAA LDEALQRAQQ
AIRAGQDALA DGDFAAYGEA QDRLDAAIAD AIAAEGRLQD