Gene Cfla_0894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0894 
Symbol 
ID9144768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp974873 
End bp977140 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content70% 
IMG OID 
Productparallel beta-helix repeat protein 
Protein accessionYP_003636002 
Protein GI296128752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGAC ACGGTAGACA TGCGATGGCG GAAGCACGGA AGAGCCCGGC AGGGGCGGTC 
CGTCGATGGG GGGTCGGGAC GGCGGTCGTC GCCGTGGTCG TCGGTGTGAG CACCGCGTCC
GCGAGCCCCG CACAGGTCCC GGCGGCCGTT CCCGCCGCCA GCCCCGTCGT GGTCGCCTCG
AGCACCTCGA CGGAGAACAC CGCCCCGCTC GGCGGCGGCT CGGTCTTCAC GACCGTCGCC
CCGACGCGGC TGCTCAACCG CACCGTCCTC GGTCCGGGGT CCACGTACGA CGTGACGATC
CCCAACCTCC CGAACGGCAC CCGCTCGGTG ATCGTCAACG TCTCCGGCTC CGACGCGACG
GTGCCGACCA CGGTCACGGC GTGCACGGGC TCGTCCTGCT CGGTGGCCTC GGCCCTGTCG
CTGCGTCCCG GCATGCCGGC GTCGCGCCAG GTCGTCGCCC CCGTCACCAA CGGTCGGATC
ACGCTGCGCA ACTCGACCGG CCAGGTCGCC GTCCAGCTCG ACCTGTCGGC GTACGTGCGC
CCCTCGAGCG TCCAGGGTGG CGAGGTCTAC GTCCCGACGC CGCAGCGCCG CGTGCTCTCG
TGGCACCTGC TCGGCGCCGC CGCGACGACG ACGGTCCAGC TCACGGACGT CCCGGAGGGG
GCGAAGGCCG TCGTCCTGGA CCTCGGCTAC TCCTCGGCGA CCGCGAAGTC GTACCTGGCG
CTCTGCCCGG CCGACCAGTC GTCGACCACG TGCAACCGCA CCACGACCAT CCAGACGGTC
CCGACGCTCA ACCACTCGAA CGCCGTCGTC GTGCCGATCG ACTCGGCGGG GCGCGTCAAG
GTCTACAACA GCGCCGGCAG CGTGCGGCTG AACGCGGACG TGCAGGGCTG GTACGTCCAG
CGCGGCACCA CCGACGTCGG TGGTGAGCTC GTCGCCTCCT CGGGTGCGGT CGCGACGCGG
CTGTCCGCGG CCGGTGCGAC CCGCACCGTC ACCCTGCCGG ACGTGCCGAA GCACGCGACC
TCGGCCGTCG TGCTCGTCCG CTCGACCTAC GCCGCGCAGG GCACCGGTGT GTGGGCGTGC
CCCGCGGCCG CCGTGTCGGA CGCCTGCAAG GCCGCGTCCG TCATCAACCC GTACCCCGGC
TACATCACGG ACAACGTCGC GTACGTCGAG CTCGGTGGCG CGAACAACGA CCAGGTGACG
CTCGGCGCGA CCCTCGCGGC GACCGACATC ACGGTGTCCC CGCTGGCGTA CACGGTCGTG
ACCCCGGTCG CGACGCCGAC GCCCACCCCG ACGCAGACGC CGACGCCGAC GCCGACGGCC
ACACCGAAGC CGACCGCGTC GCCCACCCCG TCGCCGACGA GCGGTACCGG GACCTCGACG
GGCGCCAAGC CGGGTTCGAC CAACACCGGC GTCCCCGCGG GCACGAGCCT CACGCGTCAC
AACGGCGACA TCGTGGTGAC GCAGGCGGGA ACCGTCATCG AGAACATGGA CATCCACGGC
TTCATCACCG TGCGTGCGCC CGACGTCGTC ATCCGCAAGT CGATCGTCCG CGGCTCCGGC
CCGGGCACGA CCAACATGGG CCTGGTCAAC TGCAACCACA ACGCGTGCAG CAACCTGCTG
GTGGAGGACG TCACCCTCGT CCCCAAGTCG CCCTCGGTCT GGCTCAACGG GGTCTTCGGT
CACGACTACA CCGCCCGGCG CGTCAACACG TACCACGTGG TCGACGGCTT CCAGATCCAC
AACGTCCGCA ACAACGGCGG TCCGGTCAAC GTGGTCATCG AGAACTCCTG GTGCCACGAC
ATGAGCTACT TCGCCAAGGA CCCGAACCAC AACAACACCG AGACCCACAA CGACTGCATC
CAGATCCAGG GTGGGACGAA CATGAGGATC ACGGGGAACA ACCTCGAGTC CTTCATGGCG
ACGCAGGCCG GTGACCAGAC GTACGACGCG CGCAACCGCG GCTCGGCCCT GATGGTCACG
CCCAACGCGG CCCCGGTCTC GAAGGTGACG ATGACGGGCA ACTGGCTCGA CGGTGGGACG
GCCTCGGCGT TCTTCTCGAC GTCGAAGTTC GGTGCCTACA ACTTCGGCAC CTTCAGCGGC
AACATGTTCG GCCGCAACCA GTACGACCAC GGCCGTGGCT CGAAGTACCA GATCCGCATC
GCGAACGACG GCATCTCGTT CGACAAGCCG CTCACCACGA ACATGTGGGC GGACGGGTCG
GGCTACCTGG CCGAGGGGCG TGACGCCGGC ATCCGCTTCG GCTCCTGA
 
Protein sequence
MRRHGRHAMA EARKSPAGAV RRWGVGTAVV AVVVGVSTAS ASPAQVPAAV PAASPVVVAS 
STSTENTAPL GGGSVFTTVA PTRLLNRTVL GPGSTYDVTI PNLPNGTRSV IVNVSGSDAT
VPTTVTACTG SSCSVASALS LRPGMPASRQ VVAPVTNGRI TLRNSTGQVA VQLDLSAYVR
PSSVQGGEVY VPTPQRRVLS WHLLGAAATT TVQLTDVPEG AKAVVLDLGY SSATAKSYLA
LCPADQSSTT CNRTTTIQTV PTLNHSNAVV VPIDSAGRVK VYNSAGSVRL NADVQGWYVQ
RGTTDVGGEL VASSGAVATR LSAAGATRTV TLPDVPKHAT SAVVLVRSTY AAQGTGVWAC
PAAAVSDACK AASVINPYPG YITDNVAYVE LGGANNDQVT LGATLAATDI TVSPLAYTVV
TPVATPTPTP TQTPTPTPTA TPKPTASPTP SPTSGTGTST GAKPGSTNTG VPAGTSLTRH
NGDIVVTQAG TVIENMDIHG FITVRAPDVV IRKSIVRGSG PGTTNMGLVN CNHNACSNLL
VEDVTLVPKS PSVWLNGVFG HDYTARRVNT YHVVDGFQIH NVRNNGGPVN VVIENSWCHD
MSYFAKDPNH NNTETHNDCI QIQGGTNMRI TGNNLESFMA TQAGDQTYDA RNRGSALMVT
PNAAPVSKVT MTGNWLDGGT ASAFFSTSKF GAYNFGTFSG NMFGRNQYDH GRGSKYQIRI
ANDGISFDKP LTTNMWADGS GYLAEGRDAG IRFGS