Gene Cfla_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1038 
Symbol 
ID9144913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1150384 
End bp1152081 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content75% 
IMG OID 
Productprotein of unknown function DUF885 
Protein accessionYP_003636142 
Protein GI296128892 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.999287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTCCT CCACCGCTTC CCGCCCTGTC AGTCCCGTCG ACGCCGTCGC CGACGCCTAC 
GTCGGCACGC TCGCCCGTCT GCACCCTCTC GAGGCGACGT ACCTGGGGAT CCCCGGGCAC
GACCGCGAGA TGACGGACTT CTCGCCCGAC GCCGCCGCCG AGCGCGCCGC CGCCGCTCGC
TCGACGCTCC TCGCTCTGGA GGGCCTGTCC GCCGCGGACG AGGTCGACGA GGTCACCGTC
GCGTCGATGC GCTGGGCGCT CGGCACGGGC ATCGCCCTGC ACGACGCGGG CGAGCACGCC
CGCTACCTCA ACAACATCGC CTCGCCGTCG CAGCAGGTCG TCGAGATCTT CGACCTCATG
CCCACCGACA GCGAGGAGGC GTGGGACGCC GTCGCGGCGC GCCTGGCCAC GGTGCCCGAC
GCGCTCGCGT CGTACGTCGC GTCGCTGCGG CACGCCGCGG CGGCCGGCGA CGTGGCCGCG
ATCCGGCAGG TGCAGGCCGT GATCGCGCAG GCCCGCGAGG TCGCCGACCC GGAGCGCTCG
GCGTTCACGC GGCTCGTCCG GGGCGACGAC GCACGCCGTG CGCTGGGTGA CGACCACGCG
CTGCGCGCCG ACCTGGAGCG CAACGCGGCC ACGGCACGCC GGGCGTACGC GGACCTCGCC
GAGTTCCTCG CCGACGAGCT CGCGCCGGGC GCGCCGCAGC AGGACGCCGT CGGCCGCGAC
CGGTACGCGC TGTGGTCGCG GCACCACGTG GGGGCGCAGC TCGACCTCGA CGAGACGTAC
GCGTGGGGCC TGGAGGAGCT GGCGCGGGTG CAGGCCGAGC AGGCGCAGGT CGCCGCACAG
GTGGCCGGTC CCGGCGCCAG CGTGGCGGAC GCCGTCGCGG TGCTCGACGC GGACCCGGCG
CGCCGCCTCG ACGACACGAC CGCGCTGCAG GCGTGGATGC AGCGGACGTC GGACGCCGCG
ATCGAGGCGC TCGACGGCAC CCACTTCGAC ATCCCGGCGC CCGTGCGGAC GCTCGAGTGC
CGCATCGCGC CGTCGCACAC GGGGGCGATC TACTACACCG GGCCGAGCGA CGACTTCAGC
AGGCCGGGCC GCATGTGGTG GTCGGTCCCG CACGACGTGA CGTCGTTCTC GACGTGGCGC
GAGCGCACGA CCGTCTACCA CGAGGGCGTC CCGGGCCATC ACCTGCAGAT CGCCCAGGCG
GTCCACGAGC GCGCGACGCT CAACTCGTGG CGTCGGCTCG CCTCGTGGAC GTCGGGCCAC
GGCGAGGGCT GGGCGCTCTA CGCGGAGCGG CTCATGGCGG ACCTCGGGTT CCTCGACGAC
CCGGGCGACC GGCTCGGGAT GCTCGACGGC CAGCGGCTGC GTGCCGCGCG CGTCGTGTTC
GACCTCGGCA TGCACCTGGG GCTGCCCGCG CCGGCGGAGG TCGGGACGTG GACGCCGGAG
AGCGGCTGGG ACTTCCTGCG CGCCAACGTC AACATGTCGG AGTCGTTCGT GCGCTTCGAG
TACACGCGCT ACCTGGGCTG GCCGGGGCAG GCGCCGTCGT ACAAGGTCGG GCAGCGGCTG
TGGGAGAAGG CACGGGACGC GGCGCGTGAC GCGGCGGCCG CGCAGGGCCG CACGTTCGAC
GTCCGCGAGT TCCACGCCCG CGCGCTGTCG CTGGGGTCGG TCCCCCTCGA GGTGCTGCCG
GGGGCGCTGG CGCGCTGA
 
Protein sequence
MTSSTASRPV SPVDAVADAY VGTLARLHPL EATYLGIPGH DREMTDFSPD AAAERAAAAR 
STLLALEGLS AADEVDEVTV ASMRWALGTG IALHDAGEHA RYLNNIASPS QQVVEIFDLM
PTDSEEAWDA VAARLATVPD ALASYVASLR HAAAAGDVAA IRQVQAVIAQ AREVADPERS
AFTRLVRGDD ARRALGDDHA LRADLERNAA TARRAYADLA EFLADELAPG APQQDAVGRD
RYALWSRHHV GAQLDLDETY AWGLEELARV QAEQAQVAAQ VAGPGASVAD AVAVLDADPA
RRLDDTTALQ AWMQRTSDAA IEALDGTHFD IPAPVRTLEC RIAPSHTGAI YYTGPSDDFS
RPGRMWWSVP HDVTSFSTWR ERTTVYHEGV PGHHLQIAQA VHERATLNSW RRLASWTSGH
GEGWALYAER LMADLGFLDD PGDRLGMLDG QRLRAARVVF DLGMHLGLPA PAEVGTWTPE
SGWDFLRANV NMSESFVRFE YTRYLGWPGQ APSYKVGQRL WEKARDAARD AAAAQGRTFD
VREFHARALS LGSVPLEVLP GALAR