Gene Cfla_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1822 
Symbol 
ID9145715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2031987 
End bp2033660 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content69% 
IMG OID 
Producttype II secretion system protein E 
Protein accessionYP_003636918 
Protein GI296129668 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0186286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.89431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCAGC TCGGGGAGAT CCTGCTCGAC GAGGGGCTCG TCACGGAGGC GCAGCTCCTG 
GCGGCACTGG ACGAACAGAC CAGCCTGGGG ACGTCGCTCG GGCGGACGCT CGTCGAGCTC
GGCATCCTCA CCGAGGCGCA GCTGGTCCGG GCGCTCGCCG CGCAGGTGGG CATGGAGTTC
GTCGACCTCG ACGAGTACCC GGTGGACCGG ACCGCGGTCG CCCTGGTGCC CGGTGCCCTG
TGCCGGCGGC ACTCGGTGCT CCCGGTCGGT GTGCGCAACG GTGCGCTCGT GCTCGCCACG
CCGGACCCGG GCAACGTCGT CGCCGTCGAC GACGTCCGCA CGATCTCGGG CATGACCGTG
ATCTCGGTCG TCGCGACGCA CGACAACGTC CTGCGAGCCA TCGACCGGTA CTGCCGGGCC
GACGGCGAGA TGGAGGACCT CACCAACGCC TTCGAGGAGT CGCAGGAAGC CGAGGTCGAC
CTCTCGGCAC GCATGGGGGA CGTCCTCGAC GACGAGGCGC CGATCGTGCG GTTCGTCAAC
CTGCTGGTGA CGCAGGCGAT CACGGACCGC GCGTCGGACA TCCACATCGA GCCCAGCGAG
CACGACCTGC GCGTGCGCTA CCGCATCGAC GGTGTCCTGC ACGAGACGCA GCGGGCGCCG
AAGAACGTCA CCGGCGGCGT CGTCAGCCGC GTGAAGATCA TGAGCGACAT CGACATCGCG
GAGAAGCGCA AGCCGCAGGA CGGCCGGATG TCGGTCATGC ACAACGGGCG CAAGATCGAC
CTCCGTGTCG CGACCCTGCC GACGGTGTGG GGCGAGAAGA TCGTCATGCG CATCCTCGAC
AACTCCACGG CGAGCCTGGA CCTGCGTGAC CTGTCGTTCC TCGAGCACAA CTACGCGACG
TACAAGGAGT CGTACACCAA GCCGTACGGC ATGATCCTCG TCACGGGGCC CACGGGTTCG
GGGAAGTCCA CGACGCTGTA CGCGACGCTC AACGCCGTCT CCAAGCCGGA CATCAACGTC
ATCACCGTCG AGGACCCGGT CGAGTACCGG CTCGCGGGCA TCAACCAGGT GCAGGTGAAC
CCCAAGGCGG GTCTGACGTT CGCCGCGGCC CTGCGCTCGA TCCTGCGTTC GGACCCCGAC
GTCGTGCTCC TCGGTGAGAT CCGCGACCAC GAGACCGCGC AGATCGCGGT CGAGGCCGCC
CTCACCGGGC ACCTCGTGCT CTCGACGCTG CACACGAACG ACGCGCCCTC GGCGGTGACC
CGCCTGACGG AGATGGGTAT CGAGCCCTTC CTCGTGGGTT CGGCGCTCGA CTGCGTGGTC
GCGCAGCGGC TCGCGCGACG GCTCTGCCCG AAGTGCAAGG AGGCGTACCG CCCGACCCCG
CGGGAGCTGG AGGCCGCGCG CTTCCCGTGG GTCGAGGGCG AGCAGCTCCC CGAGTTCTTC
CGTCCGGCGG GGTGCGCGGC GTGCTCGCGC ACCGGGTACA AGGGGCGCCT CGCGCTGCAC
GAGGTGATGC GGGTCACCGA GGACATCGAG CGTCACGCCG TCGCTCACTC GTCGTCGGCC
GACATCGGGG CGACCGCGGT CAAGCAGGGG ATGATCACGC TGCGCGACGA CGGGTGGCAG
AAGGTGGCGT CCGGCCTGAC GTCGATCGAG GAGATCCTGC GCGTCGTGGC GTGA
 
Protein sequence
MKQLGEILLD EGLVTEAQLL AALDEQTSLG TSLGRTLVEL GILTEAQLVR ALAAQVGMEF 
VDLDEYPVDR TAVALVPGAL CRRHSVLPVG VRNGALVLAT PDPGNVVAVD DVRTISGMTV
ISVVATHDNV LRAIDRYCRA DGEMEDLTNA FEESQEAEVD LSARMGDVLD DEAPIVRFVN
LLVTQAITDR ASDIHIEPSE HDLRVRYRID GVLHETQRAP KNVTGGVVSR VKIMSDIDIA
EKRKPQDGRM SVMHNGRKID LRVATLPTVW GEKIVMRILD NSTASLDLRD LSFLEHNYAT
YKESYTKPYG MILVTGPTGS GKSTTLYATL NAVSKPDINV ITVEDPVEYR LAGINQVQVN
PKAGLTFAAA LRSILRSDPD VVLLGEIRDH ETAQIAVEAA LTGHLVLSTL HTNDAPSAVT
RLTEMGIEPF LVGSALDCVV AQRLARRLCP KCKEAYRPTP RELEAARFPW VEGEQLPEFF
RPAGCAACSR TGYKGRLALH EVMRVTEDIE RHAVAHSSSA DIGATAVKQG MITLRDDGWQ
KVASGLTSIE EILRVVA