Gene Cfla_1135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1135 
Symbol 
ID9145014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1270752 
End bp1272491 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003636238 
Protein GI296128988 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00673541 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000103654 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACGAG GAACCACAGC CGCGGCGGTC GCCGCGGTGC TCGTGACGGG GGTGCTCGGC 
AGCCCCGCGC GGGCAGCCGA ACCGGTGGTG GAGGAGGTGC TGGGCGCCAA CCTGCCGGTG
CACGCGACCG ACCACGCGCT GGACTGGGAC GCGGGCGACG AGACGACGAT CGCGCTGTCG
GGTGCGTCGG CGCAGGTCAC GGGGAGCGGT GCGAGCGCGC AGGGTGGCAC GGTCACCATC
TCCGCACCGG GGACGTACCG GGTCAGCGGC ACGCTCACCG ACGGGGCGGT CGTCGTGGCG
TCGGCGGGCG AGGGAGTCGT GCGGGTGGTG CTCGACGGGG CGTCGATCAC GTCCTCGACG
ACCGCACCGC TGCAGGTGCA GGACGCCGAC GAGGTCGTGG TCGTGCTCGC CGAGGGCTCG
ACGAACAGCC TCACGGACCC GGCGACGTAC CAGTACCCCG AGGGCCAGGA CGAGCCGAAC
GCGGCGCTGT TCTCGACGGC CGACCTCACC ATCGCGGGCA GCGGGGCGCT GACGGTGACG
GGCAGCGCGA ACGACGGGAT CGCCTCCAAG GACGGCCTCG TGGTCGCGGG TGGCCGGATC
ACGGTGACCG CGGCGGACGA CGCCGTGCGC GGCAAGGACT ACCTCGTGGT CACCGGCGGC
ACGCTCGAGC TCACCGCGGC CGGCGACGGT CTGAAGGCCG ACGACGACAC CCCCGAGGGC
GGCTTCGTGC ACGTCGCGGG CGGCGCCACC CGCGTGACGT CGGGCGACGA CGGCGTGACG
GCGGCGTCCG ACGTGGTGGT CTCGGACGGC TACCTGCAGG TGCGGGCCGG TGGCGGTGCC
GGTGCGGGCG GCGACTCGGG CGCCAAGGGG CTCGTGGGTG ACGTGTCGCT CGTGCTCGGC
GGGGGCTCGC TCGCCGTGGA CGCGATCGAC GACGCCCTGC ACTCCGACGG CACGATCACC
GTCGCCGGGG GCAACGCGAC GCTCGCCACG GCGGGCGACG GCGCGGACGC GGGCGAGCGG
CTGACGATCA CCGGCGGTGC GCTCATCGTC ACGCAGTCCT CGGAGGGCCT CGAGGCGAAG
GTCGTCCAGA TCGCGGGGGG CCTGATCGAG GTCACGGCGG CCGACGACGC GATCAGCGCC
TCGGACCCCT CGCAGCCGGA CGCGATGGGT GCGATCCCGG GTGTCGACGT CGCCGTCAGC
GGCGGCCTGA CCGTGCTGCA CGCCGCGACG GGCGACGGGC TGGACTCCAA CGGCACCGCC
CAGATGAGCG GCGGCACGCT CGTGGTCGAC GGTCCGACGG AGTTCATCAA CAGCGCGGTC
GACACCAACG GCGCCTTCAC GGTGACGGGG GGCACGCTCA TCGGCGTCAG CTCCGCCGGG
CTGCTCGGCA CCCCGACCGT CCAGTCGCCC CAGACGTGGG TGTCGCTCGG CTCGGAGCAG
CCGGCCGGGA CGCTGCTGCA CGTGCTCGCC CCCGACGGCA CCGTGCTCGC GTCGTTCCGC
ACGACGAAGG CCTCGGGCAA CCTGCTCTAC TCCCACGCGT CGCTGCAGCT GGGCTCGCAG
TACCGGCTCG CGGTGGGCGG CACGGCCGAC GGGCCGGTCA CGGGGGGGTT CCACCAGCGG
CCCGGTGACG CGTCCGGTGC CACGGTCGTC GCGACGGCGC AGGCTGCGAC CGCGCCGAGC
GGCTGGGGCG GCGGCGGGGG CTGGCCGCCG CCGGGCGGCG TCCGCCCGCC TGCGGGCTGA
 
Protein sequence
MRRGTTAAAV AAVLVTGVLG SPARAAEPVV EEVLGANLPV HATDHALDWD AGDETTIALS 
GASAQVTGSG ASAQGGTVTI SAPGTYRVSG TLTDGAVVVA SAGEGVVRVV LDGASITSST
TAPLQVQDAD EVVVVLAEGS TNSLTDPATY QYPEGQDEPN AALFSTADLT IAGSGALTVT
GSANDGIASK DGLVVAGGRI TVTAADDAVR GKDYLVVTGG TLELTAAGDG LKADDDTPEG
GFVHVAGGAT RVTSGDDGVT AASDVVVSDG YLQVRAGGGA GAGGDSGAKG LVGDVSLVLG
GGSLAVDAID DALHSDGTIT VAGGNATLAT AGDGADAGER LTITGGALIV TQSSEGLEAK
VVQIAGGLIE VTAADDAISA SDPSQPDAMG AIPGVDVAVS GGLTVLHAAT GDGLDSNGTA
QMSGGTLVVD GPTEFINSAV DTNGAFTVTG GTLIGVSSAG LLGTPTVQSP QTWVSLGSEQ
PAGTLLHVLA PDGTVLASFR TTKASGNLLY SHASLQLGSQ YRLAVGGTAD GPVTGGFHQR
PGDASGATVV ATAQAATAPS GWGGGGGWPP PGGVRPPAG