Gene Cfla_1824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1824 
Symbol 
ID9145717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2035278 
End bp2036507 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content67% 
IMG OID 
ProductType II secretion system F domain protein 
Protein accessionYP_003636920 
Protein GI296129670 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.608306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.597079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG GGACGAAGAC CTTCGAGTAC GCGGTCCGTG ACCGGTCGGG GAAGATCGTC 
AAGGGCCGCG TCGAGGCGAA CAACCAGGCC GCCGTCGCCA ACCGGCTGCG CGAGATGGGC
CTTGCGGCCG TCTCGATCTC GGAGGTCTCC ACGAGCGGCC TGCAGACCGA GTTCACGATC
CCCGGGCTGT CCAACAGGAT CTCCCTGAAG GACATCGCCA TCATGTCGCG CCAGCTGGCG
ACCATGATCG ACTCCGGCCT GTCCCTGCTG CGCGCGCTGG CGATCCTCGT CGAGCAGACC
GAGTCCAAGC CGCTCGCGAA GATCCTGTCG CAGGTCCGCA ACGACGTCGA GGTCGGGACC
GCGTTCTCGA CGGCGCTGGG CAAGCACCCG GAGACGTTCC CGCCGCTGAT GGTCAACATG
GTGCGCGCCG GCGAGGTCGG CGGCTTCCTC GACCAGACGC TCGTGTCCAT CGCCGACAAC
TTCGAGACCG AGGTCCGGCT GCGGGCCAAG ATCAAGTCCG CGATGGCCTA CCCCGTCATC
GTCCTGGTGA TCGCCGTCCT GGCCGTGGTG GGCATGCTGC TGTTCATCGT GCCGGTCTTC
GCCGAGATGT TCGCCGGGCT CGGCGGCGAG CTGCCGGGGC CGACCAAGTT CCTCATGTTC
CTGTCGGGCA TGCTGAAGTG GACGATCGGT CCCACGGTCG TCCTGCTGGT GCTGGCGGGC
GTGTGGTGGG GCAAGCACAA GAACGACAGG GCCCTGCGCG AGCGGATCGA CCCGCTGAAG
CTCAAGGTGC CGGTCTTCGG GCCGCTGTTC CGCAAGATCG CGGTGTCCCG GTTCACGCGC
AACTTCGGGA CGATGATCCA CGCGGGCGTC CCGCTGCTCC AGGCCCTGGA GATCGTCGGC
GAGGCCAGCG GGAACATCGT CATCGAACGC GCGGCCAAGG CCGTGCAGGA GTCCGTGCGG
CGCGGTGAGT CGCTGGCGGG GCCGCTGTCG CAGCACCCGG TCTTCCCGCC GATGGTCGTG
CAGATGATGG CGGTCGGTGA GGACACCGGC GCGCTGGACA CCATGCTCGG GAAGGTCGCC
GACTTCTACG ACCAGGAGGT CGAGGCGATG ACCGAGCAGC TCACGAGCCT CATCGAGCCG
CTCATGATCG TCGTCATCGG CGCGATCGTC GGCTTCATGG TGATCTCCAT GTACATGCCG
ATCTTCGGCG TCTTCGACCT CATCCAGTAG
 
Protein sequence
MAAGTKTFEY AVRDRSGKIV KGRVEANNQA AVANRLREMG LAAVSISEVS TSGLQTEFTI 
PGLSNRISLK DIAIMSRQLA TMIDSGLSLL RALAILVEQT ESKPLAKILS QVRNDVEVGT
AFSTALGKHP ETFPPLMVNM VRAGEVGGFL DQTLVSIADN FETEVRLRAK IKSAMAYPVI
VLVIAVLAVV GMLLFIVPVF AEMFAGLGGE LPGPTKFLMF LSGMLKWTIG PTVVLLVLAG
VWWGKHKNDR ALRERIDPLK LKVPVFGPLF RKIAVSRFTR NFGTMIHAGV PLLQALEIVG
EASGNIVIER AAKAVQESVR RGESLAGPLS QHPVFPPMVV QMMAVGEDTG ALDTMLGKVA
DFYDQEVEAM TEQLTSLIEP LMIVVIGAIV GFMVISMYMP IFGVFDLIQ