Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1822 |
Symbol | |
ID | 9145715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 2031987 |
End bp | 2033660 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | type II secretion system protein E |
Protein accession | YP_003636918 |
Protein GI | 296129668 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0186286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.89431 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCAGC TCGGGGAGAT CCTGCTCGAC GAGGGGCTCG TCACGGAGGC GCAGCTCCTG GCGGCACTGG ACGAACAGAC CAGCCTGGGG ACGTCGCTCG GGCGGACGCT CGTCGAGCTC GGCATCCTCA CCGAGGCGCA GCTGGTCCGG GCGCTCGCCG CGCAGGTGGG CATGGAGTTC GTCGACCTCG ACGAGTACCC GGTGGACCGG ACCGCGGTCG CCCTGGTGCC CGGTGCCCTG TGCCGGCGGC ACTCGGTGCT CCCGGTCGGT GTGCGCAACG GTGCGCTCGT GCTCGCCACG CCGGACCCGG GCAACGTCGT CGCCGTCGAC GACGTCCGCA CGATCTCGGG CATGACCGTG ATCTCGGTCG TCGCGACGCA CGACAACGTC CTGCGAGCCA TCGACCGGTA CTGCCGGGCC GACGGCGAGA TGGAGGACCT CACCAACGCC TTCGAGGAGT CGCAGGAAGC CGAGGTCGAC CTCTCGGCAC GCATGGGGGA CGTCCTCGAC GACGAGGCGC CGATCGTGCG GTTCGTCAAC CTGCTGGTGA CGCAGGCGAT CACGGACCGC GCGTCGGACA TCCACATCGA GCCCAGCGAG CACGACCTGC GCGTGCGCTA CCGCATCGAC GGTGTCCTGC ACGAGACGCA GCGGGCGCCG AAGAACGTCA CCGGCGGCGT CGTCAGCCGC GTGAAGATCA TGAGCGACAT CGACATCGCG GAGAAGCGCA AGCCGCAGGA CGGCCGGATG TCGGTCATGC ACAACGGGCG CAAGATCGAC CTCCGTGTCG CGACCCTGCC GACGGTGTGG GGCGAGAAGA TCGTCATGCG CATCCTCGAC AACTCCACGG CGAGCCTGGA CCTGCGTGAC CTGTCGTTCC TCGAGCACAA CTACGCGACG TACAAGGAGT CGTACACCAA GCCGTACGGC ATGATCCTCG TCACGGGGCC CACGGGTTCG GGGAAGTCCA CGACGCTGTA CGCGACGCTC AACGCCGTCT CCAAGCCGGA CATCAACGTC ATCACCGTCG AGGACCCGGT CGAGTACCGG CTCGCGGGCA TCAACCAGGT GCAGGTGAAC CCCAAGGCGG GTCTGACGTT CGCCGCGGCC CTGCGCTCGA TCCTGCGTTC GGACCCCGAC GTCGTGCTCC TCGGTGAGAT CCGCGACCAC GAGACCGCGC AGATCGCGGT CGAGGCCGCC CTCACCGGGC ACCTCGTGCT CTCGACGCTG CACACGAACG ACGCGCCCTC GGCGGTGACC CGCCTGACGG AGATGGGTAT CGAGCCCTTC CTCGTGGGTT CGGCGCTCGA CTGCGTGGTC GCGCAGCGGC TCGCGCGACG GCTCTGCCCG AAGTGCAAGG AGGCGTACCG CCCGACCCCG CGGGAGCTGG AGGCCGCGCG CTTCCCGTGG GTCGAGGGCG AGCAGCTCCC CGAGTTCTTC CGTCCGGCGG GGTGCGCGGC GTGCTCGCGC ACCGGGTACA AGGGGCGCCT CGCGCTGCAC GAGGTGATGC GGGTCACCGA GGACATCGAG CGTCACGCCG TCGCTCACTC GTCGTCGGCC GACATCGGGG CGACCGCGGT CAAGCAGGGG ATGATCACGC TGCGCGACGA CGGGTGGCAG AAGGTGGCGT CCGGCCTGAC GTCGATCGAG GAGATCCTGC GCGTCGTGGC GTGA
|
Protein sequence | MKQLGEILLD EGLVTEAQLL AALDEQTSLG TSLGRTLVEL GILTEAQLVR ALAAQVGMEF VDLDEYPVDR TAVALVPGAL CRRHSVLPVG VRNGALVLAT PDPGNVVAVD DVRTISGMTV ISVVATHDNV LRAIDRYCRA DGEMEDLTNA FEESQEAEVD LSARMGDVLD DEAPIVRFVN LLVTQAITDR ASDIHIEPSE HDLRVRYRID GVLHETQRAP KNVTGGVVSR VKIMSDIDIA EKRKPQDGRM SVMHNGRKID LRVATLPTVW GEKIVMRILD NSTASLDLRD LSFLEHNYAT YKESYTKPYG MILVTGPTGS GKSTTLYATL NAVSKPDINV ITVEDPVEYR LAGINQVQVN PKAGLTFAAA LRSILRSDPD VVLLGEIRDH ETAQIAVEAA LTGHLVLSTL HTNDAPSAVT RLTEMGIEPF LVGSALDCVV AQRLARRLCP KCKEAYRPTP RELEAARFPW VEGEQLPEFF RPAGCAACSR TGYKGRLALH EVMRVTEDIE RHAVAHSSSA DIGATAVKQG MITLRDDGWQ KVASGLTSIE EILRVVA
|
| |