Gene Cfla_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3026 
Symbol 
ID9146938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3358042 
End bp3359343 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content71% 
IMG OID 
ProductPeptidase M23 
Protein accessionYP_003638108 
Protein GI296130858 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTTC GTCGCGCTGC AGTGACCGTC CTGGCCGCCG CCGTACTCAC CACGATGCTC 
GCCCCCGTCG CCACCGCGGC GCCGGCCGAC TTCGAGCTCC CGTTCCCGTG CGGGGAGACC
TGGAAGGGCC AGACCCGTGC CAACCACCCG ACGGGCTCGG GGACGTCGCT GGCGCTCGAC
CTCAACTGGG GCAGCGGCGC AGCCGACAAG GGACGTGCCG TCGTCGCGAG CGCGTCCGGC
TCGGTGCGGC TGACCGGGGG CTCCCTCGGC ACGGTCGTCA TCGACCACCC GGGCGGGTGG
GAGACGCGGT ACCTGCACAT GACGGACATC GCCGTCCGGT CCGGCCAGTC GGTGAGCAGA
GGCCAGCTCG TCGGGCGGGT CGGGGACGTC GGCTCCCCCG GCAGCTACCA CCTGCACTAC
GAGCAGCGTC TGCACGGTGG TCTGCGGGCC ATCACGTTCG GTGGTTCGCC GATCTCCTAC
TCGGCCAGCT ACACGAGCGC CAACTGCCAC AACAGCGCGC CGCAGCCTGC ACCGGCGCCG
CGGCCCGCCG GGACCGTCTA CGAGGCAGGG TTCACCAACG GATGGGCCAA CCTCGACCCG
GGGACGATCT CCGGCGCCAC CGACGTCGCC GCGATGACGG TGAACGGTGT GAAGTACATC
TACTCGATCA TCGGCGGCAC CGTGTACGAG GCGGCGAGCG ACAACGGCTG GCGCAACCTG
TCGACCACCA TCCCCGCCGA TGCGGTGGCC GTCACCCACA GCGGGGGTGT CAAGCTGGTC
TACACCCTGA AGGACGGGGT CGCGTACGAG GCCGCGAGCA ACAACGGGTG GAAGTCGTTG
CCGCTCGGCA CGATCGCGGG TGCGACGGAC ATCGCCGCGA CGACGCTCGG CAACGACCGC
CTCGTCTACA CCCTGATCAA CGGCTCGGCG CACGAGGCGG CGTTCAGCGG CGGCTGGCGG
AACCTGCCGC TGGGGACGGT TCACGGCGCG TCGCGGATCG CGGCGATGAG CGTCGACGGC
GTGAAGCTCG TCTACACCCT GCAGAACGGC ACCGTCCACG AGGCTGCCAG CAACAACGGC
TGGAAGAACC TCTCGACCGG CACCGTCACC GGCGCGTCCG AGCTCGCGGT GGCCGGCAGC
GGCAGCCTGA AGCTCGTCTA CACGCTCGTC GGCGGCGCAC CGTACGAGGC GGCCAGCGAC
AACGGGTGGC GCAACCTGTG GACCGGGAAC GCCGGCGGCG CCACGGGGCT CGCGGCAGCC
ACCGACAGCG CCGGCGGGCG GTTCGTCTAC ACGATCCGCT GA
 
Protein sequence
MRLRRAAVTV LAAAVLTTML APVATAAPAD FELPFPCGET WKGQTRANHP TGSGTSLALD 
LNWGSGAADK GRAVVASASG SVRLTGGSLG TVVIDHPGGW ETRYLHMTDI AVRSGQSVSR
GQLVGRVGDV GSPGSYHLHY EQRLHGGLRA ITFGGSPISY SASYTSANCH NSAPQPAPAP
RPAGTVYEAG FTNGWANLDP GTISGATDVA AMTVNGVKYI YSIIGGTVYE AASDNGWRNL
STTIPADAVA VTHSGGVKLV YTLKDGVAYE AASNNGWKSL PLGTIAGATD IAATTLGNDR
LVYTLINGSA HEAAFSGGWR NLPLGTVHGA SRIAAMSVDG VKLVYTLQNG TVHEAASNNG
WKNLSTGTVT GASELAVAGS GSLKLVYTLV GGAPYEAASD NGWRNLWTGN AGGATGLAAA
TDSAGGRFVY TIR