Gene Cfla_1130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1130 
Symbol 
ID9145009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1263227 
End bp1265089 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content71% 
IMG OID 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003636233 
Protein GI296128983 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000175823 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCGTC CGCTGCGCTC TCGTACCTCG ACCCACGGTC GCAACATGGC CGGCGCCCGT 
GCGTTGTGGC GTGCCACCGG CATGGGCTCG GAGGACTTCG GCAAGCCGAT CATCGCGATC
GCGAACTCGT ACACGCAGTT CGTCCCCGGC CACGTCCACC TCAAGGACAT GGGCGACCTC
GTCGCGGGGG CGATCCGCGA GGCCGGTGGC GTCTCGAAGG AGTTCAACAC CATCGCCGTC
GACGACGGCA TCGCGATGGG TCACGCCGGC ATGCTCTACT CGCTGCCCAG CCGTGACCTC
ATCGCCGACT CGGTCGAGTA CATGGTGCAG GCGCACTGCG CGGACGCGCT GGTCTGCATC
TCCAACTGCG ACAAGATCAC GCCCGGCATG CTCAACGCGG CCCTGCGGCT GAACATCCCC
GTGATCTTCG TCTCCGGGGG GCCGATGGAG GCCGGCAAGG CGGTCGTCGC CGACGGTGTC
GCGACGACGG CCCTCAACCT CATCAACGCG ATCAACTACT CGGCGGACGA CAACGTCTCC
GACGCCGCGC TGGCGAGCGT CGAGGAGAAC GCGTGCCCCA CGTGCGGCTC GTGCTCCGGC
ATGTTCACGG CCAACTCCAT GAACTGCCTC ACCGAGGCCC TCGGCCTGTC GCTGCCGGGC
AACGGCTCGA CCCTCGCCAC GCACACCGCA CGCCGCGAGC TGTTCCTCGA GGCCGGCCGC
ACGATCGTCG ACCTCGCGCG CCGCTACTAC GACGACGAGG ACGACTCGGT CGCCCCGCGG
TCGATCGCGA CCAGGGCGGC ATTCTCCAAC GCCATGGCGC TGGACGTCGC GATGGGCGGC
TCGACCAACA CCGTGCTGCA CATCCTCGCC GCCGCGCAGG AGGCGGAGGT CGACTTCACG
CTGGCCGACA TCGACGCCAT CAGCCGTCGC GTGCCGTGCC TGGCGAAGGT CGCGCCCAAC
CACCCGGACT ACCACATGGA GGACGTCCAC CGCGCCGGTG GCATCCCCGC GCTGCTGGGC
GAGCTCGACC GCGGCGGCCT GCTCGACCAC GACGTGACGA GCGTGCACAC CCCGACGCTG
CGGGCGTGGC TCGACGACTG GGACATCCGC GGCGGCAAGG CCACGCAGCG CGCGCAGGAC
CTGTTCCTCG CCGCGCCGGG CGGCGTGCGG ACCACGCGGG CGTTCTCGAC CTCGAACGTG
TGGGAGTCCC TCGACACCGA CGCGGCCGGC GGGTGCATCC GCGACGTCGC GCACGCGTAC
ACGGTCGAGG GTGGCCTGGC GGTGCTGCGC GGCAACCTGG CCGAGGACGG CGCGATCATC
AAGACCGCCG GTATCGACCC CGACGTCTTC CACTTCGTCG GCACGGCGCT CGTGTGCGAG
TCGCAGGACG AGGCTGTCGA CAAGATCCTG CGCAAGGAGG TCGAGCCCGG GCACGTCGTC
GTGGTCCGGT ACGAGGGCCC GGCCGGCGGC CCCGGCATGC AGGAGATGCT CTACCCGACG
TCCTTCATCA AGGGCCGCGG CCTGGGCAAG GTCTGCGCGC TCATCACCGA CGGCCGGTTC
TCCGGCGGCT CGTCGGGCAT CTCCGTGGGC CACGTCAGCC CGGAGGCCGC CGCGGGCGGC
ACCATCGGCC TCATCGAGGA CGGCGACGAG ATCGAGATCG ACGTCGAGTC ACGCCTCATT
CGCGTCAACG TGCCCGACGC CGTGCTCGCC GAGCGCCGCG CCAAGATGGA GGCACGCGAG
AACCCGTGGC AGCCGGTCGA CCGTGACCGC TACGTCTCGC CCGCGCTGCA GGCGTACGCG
GCGATGGCGA CCAGCGCGGA CCGCGGCGCG GTGCGCGACG TCAGCCGGAT CCGCCGGGGC
TGA
 
Protein sequence
MSRPLRSRTS THGRNMAGAR ALWRATGMGS EDFGKPIIAI ANSYTQFVPG HVHLKDMGDL 
VAGAIREAGG VSKEFNTIAV DDGIAMGHAG MLYSLPSRDL IADSVEYMVQ AHCADALVCI
SNCDKITPGM LNAALRLNIP VIFVSGGPME AGKAVVADGV ATTALNLINA INYSADDNVS
DAALASVEEN ACPTCGSCSG MFTANSMNCL TEALGLSLPG NGSTLATHTA RRELFLEAGR
TIVDLARRYY DDEDDSVAPR SIATRAAFSN AMALDVAMGG STNTVLHILA AAQEAEVDFT
LADIDAISRR VPCLAKVAPN HPDYHMEDVH RAGGIPALLG ELDRGGLLDH DVTSVHTPTL
RAWLDDWDIR GGKATQRAQD LFLAAPGGVR TTRAFSTSNV WESLDTDAAG GCIRDVAHAY
TVEGGLAVLR GNLAEDGAII KTAGIDPDVF HFVGTALVCE SQDEAVDKIL RKEVEPGHVV
VVRYEGPAGG PGMQEMLYPT SFIKGRGLGK VCALITDGRF SGGSSGISVG HVSPEAAAGG
TIGLIEDGDE IEIDVESRLI RVNVPDAVLA ERRAKMEARE NPWQPVDRDR YVSPALQAYA
AMATSADRGA VRDVSRIRRG