Gene Cfla_3431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3431 
Symbol 
ID9147347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3823266 
End bp3825314 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content77% 
IMG OID 
Productprotein of unknown function DUF255 
Protein accessionYP_003638504 
Protein GI296131254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00455743 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCAACC GCCTCGCCGC CAGCACCAGC CCCTACCTCC TGCAGCACGC CGACAACCCC 
GTCGACTGGT GGGAGTGGGG CGACGACGCG TTCGCCGAGG CCCGCCGGCG CGACGTGCCG
CTGCTGATCT CCGTCGGGTA CGCCGCGTGC CACTGGTGCC ACGTCATGGC GCACGAGTCC
TTCGAGGACC CGGCGACGGC CGCGTTCATG AACGAGCACT TCGTCTGCGT GAAGGTCGAC
CGCGAGGAAC GCCCTGACGT CGACGCCGTC TACATGGCGG CCACGCAGGC GATGACGGGC
AGCGGCGGCT GGCCCATGAC GGTGGTCGCG ACGCCCGACG GGCGGCCGTT CTTCTGCGGC
ACGTACTTCC CACCGCGCCG CGTGCAGCAG GTGCCGTCGT TCCCCGAGGT GCTCGCGGCG
GTGGCGGCCG CGTGGACGGG CCGCCGCGCG GAGGTGCTGT CGAGCGCGGA CGCGATCGCC
GACGCGCTCG CCGCGCGGCC GGGTCCCACC GACGGCCCGA GCGGTGACGA CCGCGTCGAC
GAGCGCGTCG TCGCCCGCGC GCTGGGCGCC CTGTCGGCAT CGTTCGACTC CAGGGACGGC
GGGTTCGGCG GCGCACCGAA GTTCCCGCCG TCGATGGTCC TCGAGTGGCT CCTGCGGCAC
CACGCCCGCA CCGGCGACGC CGACGCCCTC GGCATGGCAC GCCGGACGCT CGACGCCATG
GCCCGCGGCG GCGTGTACGA CCAGCTCGCC GGCGGGTACG CCCGCTACTC CGTCGACGCG
ACGTGGACCG TGCCGCACTT CGAGAAGATG CTCTACGACA ACGCGCTGCT CCTGCGGGTG
CACCTGCACG CGTGGCGGAT GACGGGCGAC GCGCTCGACC GGCGCGTCGT CGAGGAGACC
GCCGACTGGC TGCTCACGGA CCTGCGGACG GCCGAGGGCG GGTTCGCGTC CGCGCTCGAC
GCCGACAGCG AGGGCCGCGA GGGCGCGTTC TACGCCTGGA CGCCCGCGCA GCTGCGCGAG
GTGCTGGGCG ACGACGACGG GGCGTGGGCC GCGCACGTCC TGGGCGTCAC CGACGCCGGC
ACGTTCGAGC ACGGCGCGTC GGTCCTCCAG CTGCGCGAGG ACCCGGCCGA CGTCGCCCGG
TACGCCGACG TGCGCGCCCG CCTGCGCGCG GCACGCGAGC AGCGACCCCG TCCCGCGCGC
GACGACAAGG TCGTCTCCGC CTGGAACGGC CTGGCGATCG CCGCGCTCGC CGAGGCGGGC
GCGCTCCTCG ACCGCCCCGA CTGGCTCGAC GCCGCGCGCG CCTGCGCCCG GCTGCTCGCC
GACCTGCACA CACGCCCCGG CCCGGACGGC GGCGACCGGC TCGTACGCAC GTCACGCGAC
GGCGTCGCCG GCCGCGCGCC CGGGGTGCTC GAGGACTACG CCGACGTCGC CGAGGGCTAC
CTCGCGCTGG CGGCGGTGAC GGGCGAGCAC GTCTGGACGA CCTGGGCGCG GCGCCTCCTC
GCCACCGTGC TCGCGCACTT CGGCGACGGC GACGGCGGCC TGTACGACAC CGCGGACGAC
GAGACGGACG CGGTGCTCGG TGCGCTGCGC CGCCCGCAGG ACGTCGCGGA CGGCCCGGCA
CCGGCCGGGC AGCCCGCGGC GGCGGCCGCC CTCGCCCACC TCGCGGCGCT CACCGGGGAC
CTCGGCCTGC GCGAGGCCGC GCTCGGCGCG CTGCGGGAGC CGCTGGCGCT GTCGCGCCGC
TACCCCCGCG CGACGGGCTG GGCGCTGGCG GCGGCGGAGG CGCTGCTCGA CGGGCCCCGC
GAGGTCGCCG TCGTCGGGCC GCGCGACGAC CCCGCGACCC ACGCGCTGCA CCGGGCCGCG
CTCGCCTCCT CGGCACCCGG GCTCGTCGTC GCGCTCGGCG ACCCGGCCGC CGCGGATGCG
GACACCCCCG CACTGCTGCA GGACCGTCCC CTCGTCGACG GCCGTCCAGC CGCTTACGTG
TGCCGCGGGT TCGTGTGCGA GCGCCCCACG ACCGACCCCG ACGAGCTCGC CCGACAGCTG
CGGGCGTGA
 
Protein sequence
MPNRLAASTS PYLLQHADNP VDWWEWGDDA FAEARRRDVP LLISVGYAAC HWCHVMAHES 
FEDPATAAFM NEHFVCVKVD REERPDVDAV YMAATQAMTG SGGWPMTVVA TPDGRPFFCG
TYFPPRRVQQ VPSFPEVLAA VAAAWTGRRA EVLSSADAIA DALAARPGPT DGPSGDDRVD
ERVVARALGA LSASFDSRDG GFGGAPKFPP SMVLEWLLRH HARTGDADAL GMARRTLDAM
ARGGVYDQLA GGYARYSVDA TWTVPHFEKM LYDNALLLRV HLHAWRMTGD ALDRRVVEET
ADWLLTDLRT AEGGFASALD ADSEGREGAF YAWTPAQLRE VLGDDDGAWA AHVLGVTDAG
TFEHGASVLQ LREDPADVAR YADVRARLRA AREQRPRPAR DDKVVSAWNG LAIAALAEAG
ALLDRPDWLD AARACARLLA DLHTRPGPDG GDRLVRTSRD GVAGRAPGVL EDYADVAEGY
LALAAVTGEH VWTTWARRLL ATVLAHFGDG DGGLYDTADD ETDAVLGALR RPQDVADGPA
PAGQPAAAAA LAHLAALTGD LGLREAALGA LREPLALSRR YPRATGWALA AAEALLDGPR
EVAVVGPRDD PATHALHRAA LASSAPGLVV ALGDPAAADA DTPALLQDRP LVDGRPAAYV
CRGFVCERPT TDPDELARQL RA