Gene Cfla_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1070 
Symbol 
ID9144946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1184527 
End bp1186155 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content69% 
IMG OID 
ProductATP synthase F1, alpha subunit 
Protein accessionYP_003636174 
Protein GI296128924 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGC TGACGATCCG GCCGGAGGAC ATCCGCTCCG CGCTGGACAG CTTCGTGAAG 
TCCTACGAGC CGAAGGGCCC GGTGACCGAG GAGGTCGGTC GGGTCACGCT CGCCGCCGAC
GGCATCGCGC AGGTCGAGGG CCTGCCCGGC GCGATGGCCA ACGAGCTGCT GAAGTTCGAG
GACGGCACGC TCGGCCTGGC GCTCAACCTC GACGTGCGCG AGATCGGTGT CGTCGTCCTC
GGCGAGTTCA CCGGCATCGA GGAGGGCCAG GAGGTCCGCC GCACGGGTGA GGTCCTCTCG
GTCGCCGTCG GTGACGGCTA CCTCGGTCGT GTCGTCGACC CGCTGGGCCA GCCGATCGAC
GGCCTGGGCG AGGTCGCCAC CGAGGCGCGT CGCGCGCTGG AGCTGCAGGC CCCCGGCGTC
ATGGCCCGCA AGTCGGTCCA CGAGCCGCTG CAGACCGGCC TCAAGGCCAT CGACTCGATG
ATCCCGATCG GCCGCGGTCA GCGTCAGCTC ATCATCGGCG ACCGTCAGAC CGGCAAGACG
GCGATCGCGA TCGACACGAT CATCAACCAG AAGGCCAACT GGGACAGCGG CGACCCGACC
AAGCAGGTGC GCTGCATCTA CGTCGCGATC GGCCAGAAGG GCTCGACCAT CGCCTCGGTG
CGCTCCGCGC TCGAGGAGGC CGGTGCGCTC GAGTACACGA CCATCGTCGC GGCCCCCGCC
TCCGACCCGG CCGGCTTCAA GTACCTCGCG CCCTACACCG GCTCGGCCAT CGGGCAGCAC
TGGATGTACC AGGGCAAGCA CGTCCTCATC GTGTTCGACG ACCTGTCGAA GCAGGCCGAG
GCGTACCGCG CCGTGTCGCT GCTGCTGCGC CGCCCGCCGG GCCGCGAGGC GTACCCCGGT
GACGTCTTCT ACCTGCACTC CCGTCTGCTC GAGCGCTGCG CCAAGCTCTC GGACGAGCTG
GGCGCGGGCT CGATGACGGG CCTGCCGGTC ATCGAGACCA AGGCCAACGA CGTCTCGGCG
TACATCCCGA CCAACGTCAT CTCGATCACC GACGGCCAGA TCTTCCTGCA GTCGGACCTG
TTCAACGCCG ACCAGCGCCC CGCCGTCGAC GTCGGCATCT CGGTGTCCCG CGTCGGTGGT
GCCGCGCAGG TCAAGGCGAT GAAGCAGGTC TCCGGCACGC TGAAGCTCGA CCTCGCGCAG
TACCGCTCGC TCGAGGCGTT CGCGATGTTC GCGTCCGACC TCGACGCGGC GTCGCGCGCG
CAGCTGACGC GTGGTGCGCG CCTCATGGAG CTGCTCAAGC AGGGCCAGTA CTCGCCGTAC
CCGGTCGAGA ACCAGGTCGC CTCCATCTGG GCCGGCACCA AGGGCAAGCT GGACGACGTC
CCGATCGAGG ACGTCCGCCG CTTCGAGACC GAGCTGCTCG ACCACCTGCG CCGCAACACC
GACGTGCTGT CGACGATCGC CGAGACCGGC AAGCTCGACG AGCAGACGGA GAAGGCGCTC
GGCGACGCGA TCGACGAGTT CCGCGAGGGC TTCCTGAAGT TCGACGGCAG CCCGCTCGTG
GGCACCGCGG ACGAGGACGT CGACGTCGAG GTCGAGCAGG AGCAGATCGT CCGCCAGAAG
CGGGCCTGA
 
Protein sequence
MAELTIRPED IRSALDSFVK SYEPKGPVTE EVGRVTLAAD GIAQVEGLPG AMANELLKFE 
DGTLGLALNL DVREIGVVVL GEFTGIEEGQ EVRRTGEVLS VAVGDGYLGR VVDPLGQPID
GLGEVATEAR RALELQAPGV MARKSVHEPL QTGLKAIDSM IPIGRGQRQL IIGDRQTGKT
AIAIDTIINQ KANWDSGDPT KQVRCIYVAI GQKGSTIASV RSALEEAGAL EYTTIVAAPA
SDPAGFKYLA PYTGSAIGQH WMYQGKHVLI VFDDLSKQAE AYRAVSLLLR RPPGREAYPG
DVFYLHSRLL ERCAKLSDEL GAGSMTGLPV IETKANDVSA YIPTNVISIT DGQIFLQSDL
FNADQRPAVD VGISVSRVGG AAQVKAMKQV SGTLKLDLAQ YRSLEAFAMF ASDLDAASRA
QLTRGARLME LLKQGQYSPY PVENQVASIW AGTKGKLDDV PIEDVRRFET ELLDHLRRNT
DVLSTIAETG KLDEQTEKAL GDAIDEFREG FLKFDGSPLV GTADEDVDVE VEQEQIVRQK
RA