Gene Cfla_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2101 
Symbol 
ID9145997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2342744 
End bp2344555 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content74% 
IMG OID 
Product2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase 
Protein accessionYP_003637195 
Protein GI296129945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.291221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAGA ACGTGCAGCT TCCCGCGCTC GGCGAGTCCG TCACCGAGGG CACCGTCACC 
CGCTGGCTCA AGAACGTCGG TGACACCGTC GAGGTCGACG AGCCCCTGCT CGAGATCTCG
ACCGACAAGG TCGACACCGA GATCCCCTCG CCGGTGGCCG GCGTGCTCGA GCAGATCCTC
GTCCAGGAGG ACGAGACGGT CGAGGTCGGC GCCACGCTCG CCGTCATCGG CTCGGGCGAG
GGCGGCGGTG ACGCCGGCTC GGGCGAGCAG CAGGCACCCG CCGAGGAGCC CGTCGCCGAG
CAGGCACCCG CCGAGGAGCC CGCGGCGGAG CAGTCCGCGC AGCAGCCCGT CGAGGAGCAC
GAGGACGCCC CGGGACCCGC GCCGTCCACA GGTGGCGGTG GCAGTGGCCA GGAGGTCACT
CTCCCGGCGC TCGGCGAGTC CGTCACCGAG GGCACGGTCA CGCGCTGGCT CAAGGCCGTG
GGCGACGAGG TCGCGGTCGA CGAGCCGCTG CTGGAGATCT CCACCGACAA GGTGGACACG
GAGATCCCGT CGCCCGTCGC GGGCACGCTG CAGGAGATCC GCGTCCAGGA GGACGAGACC
GTCGAGGTCG GCGCCGTGCT GGCCGTCGTC GGATCGGGCG ACGCGGCTCC CGCCGCCGAG
CAGCCCGCCG CACCGCAGCA GCCCGAGGAG CAGGCGAGCG AGCCCGCCGC CGAGACCCCG
CAGGGCGCGG CGCAGGAGCC CGCCGGCTAC GAGGCGCCCG CGCCCGAGGC CGAGGCGGCG
CCCGCCGCCG AGCAGCAGGC TCCCGCCGCG CAGGAGGCCG CCGCCGCGCA GCCGACCGCC
ACGCAGACGC CGGCACCGTC CGCGCCCGCG CCGTCCGCCG GTGGCTCGTA CCTCACGCCG
CTCGTGCGCA AGCTCGCGGC CGAGAAGGGC GTCGACGTGT CGACCCTCAC GGGCTCGGGC
GTCGGTGGGC GCATCCGCAA GGAGGACGTG CTCGAGGCCG CGGCCAAGGC CGAGGAGGCC
CGCAAGGCCG CCGCGGCGCC GGCCGCCCCG GCCGCGGCTC CCGCGAAGGC CGCCGCCCCG
GCGCCGGTCT CGCCGCTGCG CGGCACGACC GAGAAGGCCA GCCGACTGCG GCAGATCATC
GCCGAGCGCA TGGTCGAGGC GCTGCACACG CAGGCGCAGC TCACCACGGT GGTCGAGGTC
GACGTCACGA GGATCGCCAA GCTGCGTGCC CGCGCCAAGG CGGACTTCGC GGCACGCGAG
GGCGTCAACC TCACGTACCT GCCGTTCTTC GTCCAGGCCG CGATCGAGGG GCTCAAGACC
TACCCGAAGA TCAACGGCGT GCTCGAGGGC ACCCAGATCA CGTACCACGG GCAGGAGAAC
GTGGCGATCG CGGTCGACAC CGAGCGCGGC CTGCTGACGC CGGTCATCCG CGACGCCGGC
GACCTCAACC TCGCGGGCAT CGCGCGCAAG ATCGCCGACC TCGCCTCGCG CACGCGTGCC
AACAAGGTGA CGCCGGACGA GCTGTCGGGC GCCACGTTCA CCGTGACGAA CACCGGCTCG
GGCGGCGCGA TCATCGACAC CCCGATCGTG CCTGGTGGCA CGTCGGCGAT CCTCGGCACC
GGGGCGATCG TCAAGCGCCC GGTCGTCGTC AAGGGGCCCG ACGGCGACGA GGTCATCGCC
ATCCGGTCGA TGTGCTACCT GTGCCTGTCG TACGACCACC GGCTCGTCGA CGGCGCCGAC
GCGTCGCGCT ACCTCACCGC GGTCAAGAAC CGCCTCGAGG AGGGCGCGTT CGAGGCCGAG
CTCGGCCTCT GA
 
Protein sequence
MSQNVQLPAL GESVTEGTVT RWLKNVGDTV EVDEPLLEIS TDKVDTEIPS PVAGVLEQIL 
VQEDETVEVG ATLAVIGSGE GGGDAGSGEQ QAPAEEPVAE QAPAEEPAAE QSAQQPVEEH
EDAPGPAPST GGGGSGQEVT LPALGESVTE GTVTRWLKAV GDEVAVDEPL LEISTDKVDT
EIPSPVAGTL QEIRVQEDET VEVGAVLAVV GSGDAAPAAE QPAAPQQPEE QASEPAAETP
QGAAQEPAGY EAPAPEAEAA PAAEQQAPAA QEAAAAQPTA TQTPAPSAPA PSAGGSYLTP
LVRKLAAEKG VDVSTLTGSG VGGRIRKEDV LEAAAKAEEA RKAAAAPAAP AAAPAKAAAP
APVSPLRGTT EKASRLRQII AERMVEALHT QAQLTTVVEV DVTRIAKLRA RAKADFAARE
GVNLTYLPFF VQAAIEGLKT YPKINGVLEG TQITYHGQEN VAIAVDTERG LLTPVIRDAG
DLNLAGIARK IADLASRTRA NKVTPDELSG ATFTVTNTGS GGAIIDTPIV PGGTSAILGT
GAIVKRPVVV KGPDGDEVIA IRSMCYLCLS YDHRLVDGAD ASRYLTAVKN RLEEGAFEAE
LGL