Gene Cfla_3720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3720 
Symbol 
ID9147636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp4111292 
End bp4112542 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID 
ProductCobyrinic acid ac-diamide synthase 
Protein accessionYP_003638787 
Protein GI296131537 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000821515 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000233932 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGCAT CGCCGTGGCG TTCCGAGCCG CAGCAGATCA CCGAGACGCG CGAGGGATCC 
CAGGAAGCTT CGCCCGCGCA GTCCAACGGC GGCGCCGCAG TGTCCGCACC GCCCGTCCAG
GCCGCGACCG ACGCCGGGTC CTCGTCGATT GCCGTGGGGT CGACGCTCGG TGCTGACGAG
CTCCCTGCCG TCGTGGACGA CGAGGCGGGC AGTGTGGCGG ATCTCCCGGG GCGGGGGAGC
GTGGCCGAGC CTACGCGCGG CGACGGCCAG GACGACGCGC ACCGCGCTGC GCTCGTGGAG
AGCCTTCCGC AGCCCACCCC GGACACGCCG CTGCTCGCCG AGCTCCAGAT CGACGCACGT
CGCCGCATCG AGCTGCGGGG CCGCAAGTTC CCGCGTCCCG AGCGCACCCG CGTGATCACG
GTCGCGAACC AGAAGGGAGG CGTCGGCAAG ACGACGACCA CGGTGAACCT GGCGGCAGCA
CTCGCGCAGG CAGGACTCCA GGTCCTCGTC CTCGACAACG ACCCGCAGGG GAACGCATCG
ACCGCCCTCG GCATCGAGCA CAGGGCCGGC ACGCCGTCGA TCTACGAGGT GCTCGTCGAC
GGTGCGCCCA TGCACGCGGC CGTTCAGGAG AGCCCCGATG TGCCGGGGCT GTGGTGCCTG
CCGGCGACGA TCGACCTCTC GGGTGCCGAG ATCGAGCTCG TCTCGATGGT GGCGCGCGAG
ACTCGACTTC GCAGTGCACT CGACTCGTAC CTCGAGTGGC GGGCGGAGCA GGGACTCAGC
CGGATCGACT ACGTCTTCGT CGACTGCCCG CCGAGTCTCG GGCTCCTCAC CGTCAACGCG
TTCGTCGTCG CCCGCGAGGT CCTCATCCCG ATCCAGTGCG AGTACTACGC GCTCGAAGGG
CTGTCCCAGC TCCTCAAGAC GATCGAGCTC ATCCAGGCGC ACCTCAACCC CGAGCTGACC
GTATCGACGA TCCTGCTCAC GATGTACGAC GCGCGGACCA ACCTGGCGCA GCAGGTGGCC
GAGGAGGTCC GGACGCACTT CCCCGAGCGC ACCCTGCGCA CCACCGTGCC GCGGTCCGTC
CGGATCTCCG AGGCGCCCAG CTACGGGCAG ACCGTCATGA CCTACGACGG CGGCTCCTCC
GGGGCCCTCG CCTACCTCGA GGCTGCTCGC GAGCTCGCGG AGCGTGCCCT TCCCCCCACC
ACCCCCACCC CCGCCGGCAC ACCGAGCGGC CCTGTTCAGG AGGACCAGTG A
 
Protein sequence
MGASPWRSEP QQITETREGS QEASPAQSNG GAAVSAPPVQ AATDAGSSSI AVGSTLGADE 
LPAVVDDEAG SVADLPGRGS VAEPTRGDGQ DDAHRAALVE SLPQPTPDTP LLAELQIDAR
RRIELRGRKF PRPERTRVIT VANQKGGVGK TTTTVNLAAA LAQAGLQVLV LDNDPQGNAS
TALGIEHRAG TPSIYEVLVD GAPMHAAVQE SPDVPGLWCL PATIDLSGAE IELVSMVARE
TRLRSALDSY LEWRAEQGLS RIDYVFVDCP PSLGLLTVNA FVVAREVLIP IQCEYYALEG
LSQLLKTIEL IQAHLNPELT VSTILLTMYD ARTNLAQQVA EEVRTHFPER TLRTTVPRSV
RISEAPSYGQ TVMTYDGGSS GALAYLEAAR ELAERALPPT TPTPAGTPSG PVQEDQ