Gene Cfla_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1980 
Symbol 
ID9145875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2203684 
End bp2205120 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content70% 
IMG OID 
Productpyruvate kinase 
Protein accessionYP_003637074 
Protein GI296129824 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.237893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGAG CCAAGATCGT CTGCACCATC GGCCCGGTGA CGGAGTCCCC CGAGCAGGTC 
CAGGCCCTCG TCGACGCGGG CATGGACGTC GCCCGCCTGA ACCGCAGCCA CGGTGACACC
GAGGTGCACA AGCGCGTGTA CGACAACGTG CGCGCCGCGG CGAAGGCCTC GGGCCGTTCG
GTGGCCGTCC TCGTCGACCT CCAGGGCCCC AAGATCCGCC TCGGGCGGTT CGTCGAGGGC
AAGCACGACC TCGCGGTCGG CGACGTCTTC ACCATCACGA CCGACGAGAT CGAGGGCACC
AAGGAGCGCG TCTCGACGAC GTTCAAGGGC CTGCCGGGCG ACGTCAAGCC GGGCGACCCG
ATTCTCATCG ACGACGGCAA GGTCCTCGTG CGGGTCACCG AGGTGAACGG CAACGACGTG
GTGACCCGCG TCGAGGTGCC GGGGCCGGTG TCGAACAACA AGGGCCTCAA CCTGCCGGGC
GTCGCGGTGT CCGTCCCCGC GATGAGCGAG AAGGACGAGG ACGACCTGCG GTGGGCCCTC
AACGTGGGGG CGGACCTCAT CGCGCTCTCG TTCGTCCGCA CCGCGGCCGA CTACGACGAC
GTGCGCCGGA TCATGGAGGA GGAGGGCCGG GTCGTCCCGG TCATCGCCAA GATCGAGAAG
CCGCAGGCCG TCGAGAACCT CTCGGAGATC GTGCAGGCGT TCGACGGCAT CATGGTCGCC
CGCGGCGACC TGGGCGTCGA GCTGCCCCTG GAGCAGGTGC CGCTGGTGCA GAAGCGTGCG
GTCGAGCTCG CGCGCCGCAA CGCCAAGCCC GTCATCGTGG CCACCCAGGT GCTCGAGTCG
ATGATCACGA GCCCGCGCCC GACGCGCGCC GAGGCGTCCG ACTGCGCCAA CGCGGTGCTC
GACGGTGCCG ACGCGGTCAT GCTCTCGGGC GAGACGTCCG TCGGTGACTT CCCGATCGAG
GCCGTGCGCA CGATGGCGCG CATCATCGAG AGCACCGAGG AGCTCGGGCG CGAGCGCATC
GCGCCGCTCG GCTCGGTGCC CTCGACGCGC GGTGGTGCGA TCACGCGTGC CGCCGCCGAG
ATCGGCGAGC GCATCGGCGT GAAGTACCTG GTGACGTTCA CGCAGTCCGG CGACTCGGCG
CGGCGCATGT CGCGGCTGCG CTCGCCGATC CCGCTGCTGG CGTTCACGCC CGAGGAGGAC
GTGCGCAACC GTCTCTCGCT GTCGTGGGGC GTCCAGACGT ACCAGGTGCC GAAGGTCGAG
AGCACGGACT CGATGGTGAG CCAGGTGGAC CACACGCTGC GCGCCAACGG GCTGGCCGAG
GTCGGCGACT ACGTGGTCGT CGTCGCGGGC ACGCCGGTGG GTGTCGTCGG CTCGACCAAC
ACGGTCGTGG TGCACAAGCT CGGTGACGAG GAGGCGGCGC GCACGCGGAT CGCCTGA
 
Protein sequence
MRRAKIVCTI GPVTESPEQV QALVDAGMDV ARLNRSHGDT EVHKRVYDNV RAAAKASGRS 
VAVLVDLQGP KIRLGRFVEG KHDLAVGDVF TITTDEIEGT KERVSTTFKG LPGDVKPGDP
ILIDDGKVLV RVTEVNGNDV VTRVEVPGPV SNNKGLNLPG VAVSVPAMSE KDEDDLRWAL
NVGADLIALS FVRTAADYDD VRRIMEEEGR VVPVIAKIEK PQAVENLSEI VQAFDGIMVA
RGDLGVELPL EQVPLVQKRA VELARRNAKP VIVATQVLES MITSPRPTRA EASDCANAVL
DGADAVMLSG ETSVGDFPIE AVRTMARIIE STEELGRERI APLGSVPSTR GGAITRAAAE
IGERIGVKYL VTFTQSGDSA RRMSRLRSPI PLLAFTPEED VRNRLSLSWG VQTYQVPKVE
STDSMVSQVD HTLRANGLAE VGDYVVVVAG TPVGVVGSTN TVVVHKLGDE EAARTRIA