Gene Cfla_1745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1745 
Symbol 
ID9145634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1943646 
End bp1944821 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content71% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003636841 
Protein GI296129591 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.298658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00455743 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCGCAA CGCCCGAACC GACGGGGGCG CTCGTCACCG TCATCGTCCC GACGTTCAAC 
GAGGCCCCGA ACGTGGCCGA GCTGGTGCGC CGCGTGGGTG CGGCGACGCG CGGCCTCGGG
GTCGAGATGC TGTTCGTGGA CGACTCGACG GACGACACCG CCGACGTCGT GCGGGCCGTG
GCCCCCACGG CTGAGCTGCC CGTACGCGTG ATCCACCGTG ACGACCCGGT CGGCGGCCTC
GGCGGCGCCG TGCTCGAGGG TGTGCGGGCC TCGTCGACGC CGTACTTCCT CGTGATGGAC
GGTGACCTGC AGCACCCGCC CGAGCTCATC CCGAGCCTCG TCGCGCGGGT CCAGGAGGTC
GACGTGGACG TCGTCGTCGC GTCGCGCTAC ATCGGTGACG GGTCCAGCGC GGGGCTCTCC
GGCGCCGTGC GCCAGGCCGT CTCCTCGACG TCGACCGCCG TGACCCGCGC CATGTTCCCC
GTCCGGCTGC GTGACTGCTC CGACCCGATG ACCGGGTTCT TCGCGGTGCG CAGGGCGGCC
GTCGACCTCG ACTCGCTGCG CCCGCGCGGC TTCAAGATCC TGCTCGAGAT CCTCGCGCGC
CACCCCATGC GCGTCGTCGA GGTCCCGTTC GTGTTCGGCT CGCGCTACGC CGGGGAGTCC
AAGGCGAACC TCGCCCAGGG CATCCACTTC ATGTGGCAGC TCGCCGGCCT GCGGTTCGGT
CGCATGTCGC GGTTCGCGAT CATCGGCGGC ATGGGCGCGG TCGCGAACAT CGCGATCGTG
TGGCTGCTGA CGAGGTTGGG GGCGCCCTGG CTCCTCGCCG CGATCGTCGC CGCCGAGCTC
ACCATCGTCG GGAACTTCCT GCTGCAGGAG CGCTTCGTCT TCCGGGACCT CCGTCACGAG
GGCAAGGGTG TCTGGGCGAG GTTCGGGCAG TCCTTCACGT TCAACAACGT GGAGACGCTC
GTCCGCATGC CGGTCATGGC GCTGCTCGTC GAGACGATGC ACGTCGCCGC CGTCCTGGCC
ACGGCCATCA CGATCGCGAT CGCGTTCGTC GTCCGGTTCA CGTTCCACTC GCGGATCGTC
TACCGCCCGC GCCAGTCGAG CGTGCGGGCC CACCTCGTCG CGCGAGAGGC GGACAACGCC
GAGCCACCGC CCCTGCCGCG CGCGGAGACC GTCTGA
 
Protein sequence
MTATPEPTGA LVTVIVPTFN EAPNVAELVR RVGAATRGLG VEMLFVDDST DDTADVVRAV 
APTAELPVRV IHRDDPVGGL GGAVLEGVRA SSTPYFLVMD GDLQHPPELI PSLVARVQEV
DVDVVVASRY IGDGSSAGLS GAVRQAVSST STAVTRAMFP VRLRDCSDPM TGFFAVRRAA
VDLDSLRPRG FKILLEILAR HPMRVVEVPF VFGSRYAGES KANLAQGIHF MWQLAGLRFG
RMSRFAIIGG MGAVANIAIV WLLTRLGAPW LLAAIVAAEL TIVGNFLLQE RFVFRDLRHE
GKGVWARFGQ SFTFNNVETL VRMPVMALLV ETMHVAAVLA TAITIAIAFV VRFTFHSRIV
YRPRQSSVRA HLVAREADNA EPPPLPRAET V