Gene Cfla_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1501 
Symbol 
ID9145387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1664384 
End bp1665544 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID 
Product1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 
Protein accessionYP_003636598 
Protein GI296129348 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCC CGATCAGCCT CGGCATGCCG CAGGCGCCTG CCCCCGTGCT GGCCCCGCGG 
CGAGCGTCCC GCAAGATCCG CGTCGGCAAG GTCGAGGTCG GGGGCGACGC CCCGGTCTCG
GTGCAGTCGA TGACGACGAC CCCGACGACC GACGTCAACC GCACGCTGCA GCAGATCGCC
GAGCTGACGG CCTCCGGCTG CGACATCGTG CGCGTGGCCG TGCCGAGCCA GGACGACGCC
GAGGCGCTGC CTGCGATCGC GCGCAAGTCC CAGATCCCGG TGATCGCGGA CATCCACTTC
CAGCCGAAGT ACGTCTTCGC GGCCATCGAC GCCGGTTGCG CCGCGGTGCG GGTGAACCCT
GGCAACATCC GCAAGTTCGA CGACCAGGTC AAGGAGATCG CGCAGGCCGC CACCGACGCC
GGGGTCTCGA TCCGGATCGG CGTCAACGCC GGCTCGCTCG ACCCCCGCCT GCTCGCCAAG
TACGGCAAGG CGACGCCCGA GGCGCTCGTC GAGTCGGCCG TGTGGGAGGC GTCCCTGTTC
GAGGAGCACG GCTTCCGCGA CTTCAAGATC AGCGTCAAGC ACAACGACCC GGTCGTGATG
GTGCGCGCCT ACGAGCTGCT CGCCGAGCGG GGCGACTGGC CGCTGCACCT CGGTGTGACG
GAGGCGGGCC CGGCGTTCCA GGGCACCATC AAGTCGGCGA CGGCCTTCGG GGCCCTGCTG
AGCAAGGGCA TCGGCGACAC CATCCGCGTG TCCCTGTCGG CTCCTCCCGT CGAGGAGGTC
AAGGTCGGCA TCCAGATCCT GCAGTCGCTG AACCTGCGCC CGCGCAAGCT CGAGATCGTG
TCGTGCCCCT CGTGCGGGCG TGCTCAGGTC GACGTCTACA CGCTCGCCGA GAAGGTCACC
GCCGGGCTCG AGGGCATGGA GGTGCCGCTG CGCGTCGCGG TCATGGGGTG CGTCGTCAAC
GGGCCGGGTG AGGCGCGCGA GGCCGACCTC GGCGTCGCCT CCGGCAACGG CAAGGGCCAG
ATCTTCGTCA AGGGCGAGGT CGTCAAGACC GTGCCCGAGT CGATGATCGT CGAGACCCTC
ATCGAGGAGG CCATGCGCCT CGCCGAGACC ATGGACCCCG TCGAGGCGGG CGAGGGCGCG
CCCGTGGTGA GCGTCGGCTG A
 
Protein sequence
MSTPISLGMP QAPAPVLAPR RASRKIRVGK VEVGGDAPVS VQSMTTTPTT DVNRTLQQIA 
ELTASGCDIV RVAVPSQDDA EALPAIARKS QIPVIADIHF QPKYVFAAID AGCAAVRVNP
GNIRKFDDQV KEIAQAATDA GVSIRIGVNA GSLDPRLLAK YGKATPEALV ESAVWEASLF
EEHGFRDFKI SVKHNDPVVM VRAYELLAER GDWPLHLGVT EAGPAFQGTI KSATAFGALL
SKGIGDTIRV SLSAPPVEEV KVGIQILQSL NLRPRKLEIV SCPSCGRAQV DVYTLAEKVT
AGLEGMEVPL RVAVMGCVVN GPGEAREADL GVASGNGKGQ IFVKGEVVKT VPESMIVETL
IEEAMRLAET MDPVEAGEGA PVVSVG