Gene Cfla_1134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1134 
Symbol 
ID9145013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1268924 
End bp1270636 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content72% 
IMG OID 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003636237 
Protein GI296128987 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0498657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00011132 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAGCG CAGGCGGGGA CGACGGCGTG GACATCAAGC CGCGGTCGCG GCAGGTGACC 
GACGGGCTGG AGGCGACGGC GGCGCGCGGC ATGCTGCGCG CGGTCGGGCT GGGGGACGAG
GACTTCGCGA AGCCGCAGGT GGGCGTCGCG TCGTCGTGGA ACGAGATCAC GCCGTGCAAC
CTGTCGCTCG ACCGGCTCGC CAAGGCCGTG AAGGGCGGGG TGCACGCGGC CGGGGGGTAC
CCGCTGGAGT TCGGCACGAT CTCGGTGTCC GACGGCATCT CGATGGGCCA CGAGGGCATG
CACTTCTCGC TCGTGAGCCG GGACATCATC GCGGACTCCG TGGAGACGGT GATGATGGCC
GAGCGCCTCG ACGGCTCGGT CCTCCTGGCC GGGTGCGACA AGTCCTTGCC GGGCATGCTC
ATGGCTGCGG CGCGCCTCGA CCTCGCGAGC GTGTTCCTCT ACGCGGGCTC GATCATGCCG
GGCTGGGTCA AGCTGTCCGA CGGCACCGAG AAGGACGTGA CGATCATCGA CGCGTTCGAG
GCCGTCGGGG CGTGCGCCCG CGGGCTGATG AGCCGCGAGG ACGTCGACCG CATCGAGCGC
GCCATCTGCC CCGGCGAGGG TGCGTGCGGC GGCATGTACA CGGCCAACAC CATGGCGTCG
GTGGCCGAGG CGATGGGCAT GTCGCTGCCC GGCTCGGCCG CGCCGCCGTC GGCCGACCGG
CGGCGCGACC AGTTCGCGCA CCGCTCGGGC GAGGCGGTGG TCGAGCTGCT GCGCCGTGGC
ATCACGGCCC GCCGGATCAT GACCAAGGAA GCGTTCGAGA ACGCGATCGC CGTCGTCATG
GCCTTCGGCG GATCGACCAA CGCCGTGCTG CACCTGCTGG CCATCGCGCA CGAGGCCGAG
GTGGACCTCA CGCTCGAGGA CTTCAGCCGC GTCGCCGCGA AGGTTCCGCA CCTGGGCGAC
CTCAAGCCGT TCGGCCGGTA CGTGATGAAC GACGTCGACC GGGTCGGCGG CGTGCCCGTC
GTCATGAAGG CCCTGCTCGA CGCGGGGCTG CTGCACGGGG ACTGCCTCAC GGTCACGGGG
CGCACGGTCG CCGAGAACCT CGCCGAGATC GCGCCGCCCG ACCCGGACGG CAAGATCCTG
CGGGCGCTCG ACGACCCGAT CCACCGCACG GGCGGCATCA CGATCCTGTC GGGGTCGCTC
GCGCCCGAGG GTGCGGTGGT CAAGTCCGCG GGCTTCGACT CCGACGTGTT CGAGGGCACC
GCGCGCGTCT TCGAGCGTGA GCGAGCCGCG CTCGACGCGC TCGAGGACGG CACGATCCGG
GCGGGCGACG TCGTCGTCAT CCGGTACGAG GGCCCCAAGG GTGGACCGGG CATGCGCGAG
ATGCTCGCCA TCACCGGGGC CATCAAGGGT GCGGGGCTCG GCAAGGACGT GCTCCTCGTG
ACCGACGGGC GGTTCTCGGG CGGCACGACG GGGCTCTGCG TCGGCCACAT CGCGCCCGAG
GCCGTCGACG CCGGGCCGAT CGCGTTCGTC CGGGACGGCG ACCGGGTGCG GCTCGACGTC
GCGCACGCCA CGCTCGACCT GCTCGTCGAC GACGCGGAGC TCGTGGCCCG CCGGGAGGGC
TGGGCGCCGC TCGCGCCGCG GTACACCCGT GGGGTGCTGG GCAAGTACCA GAAGCTCGTG
CAGTCCGCGT CACGCGGCGC GGTGCTCGGG TAA
 
Protein sequence
MTSAGGDDGV DIKPRSRQVT DGLEATAARG MLRAVGLGDE DFAKPQVGVA SSWNEITPCN 
LSLDRLAKAV KGGVHAAGGY PLEFGTISVS DGISMGHEGM HFSLVSRDII ADSVETVMMA
ERLDGSVLLA GCDKSLPGML MAAARLDLAS VFLYAGSIMP GWVKLSDGTE KDVTIIDAFE
AVGACARGLM SREDVDRIER AICPGEGACG GMYTANTMAS VAEAMGMSLP GSAAPPSADR
RRDQFAHRSG EAVVELLRRG ITARRIMTKE AFENAIAVVM AFGGSTNAVL HLLAIAHEAE
VDLTLEDFSR VAAKVPHLGD LKPFGRYVMN DVDRVGGVPV VMKALLDAGL LHGDCLTVTG
RTVAENLAEI APPDPDGKIL RALDDPIHRT GGITILSGSL APEGAVVKSA GFDSDVFEGT
ARVFERERAA LDALEDGTIR AGDVVVIRYE GPKGGPGMRE MLAITGAIKG AGLGKDVLLV
TDGRFSGGTT GLCVGHIAPE AVDAGPIAFV RDGDRVRLDV AHATLDLLVD DAELVARREG
WAPLAPRYTR GVLGKYQKLV QSASRGAVLG