Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1949 |
Symbol | |
ID | 9145843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2167150 |
End bp | 2170023 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003637043 |
Protein GI | 296129793 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0876147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCTTC CTCTGTCCGA CCGTCTCGTG GTCCGCGGCG CGCGGGAGCA CAACCTCCGC AACGTCGACC TGGACCTGCC GCGTGACGCG CTCATCGCGT TCACCGGCCT GTCCGGCTCC GGCAAGTCGT CGCTCGCCTT CGACACGATC TTCGCCGAGG GCCAGCGCCG CTACGTCGAG TCGCTGTCGG CGTACGCGCG GCAGTTCCTC GGGCAGATGG ACAAGCCCGA CGTCGACTTC ATCGAGGGAC TGTCGCCGGC CGTCTCGATC GACCAGAAGT CCACCAACCG CAACCCGCGT TCGACCGTGG GCACGATCAC CGAGGTGTAC GACTACCTGC GCCTGCTCTT CGCGCGTGCC GGCACGCAGC ACTGCCCGGT GTGCGGCGAG CGCGTCACGG CGCAGACACC GCAGCAGATC GTCGACCGGC TGCTCGAGCT GCCGGAGGGC ACCCGCTACC AGGTGCTCGC GCCGGTCGTG CGCGGCCGCA AGGGGGAGTA CACCGACCTG TTCCGCGAGC TGCAGGGCAA GGGCTTCTCG CGGGCGCGCG TCGACGGCGA GGTCGTGCAG CTCGCGTCGC CGCCGACGCT GGAGAAGAAG CTCAAGCACG ACATCGAGGT CGTCGTCGAC CGGCTCGTGT CGCGCGAGGG CGTGCAGCGG CGCCTGACCG ACTCGGTCGA GACCGCGCTC GGGCTGGCCG GCGGCCTGCT GGTCGTCGAG CTCGTCGACG CGGACGCGGA CGACCCGCAG CGCGAGCGGA GGTTCTCCGA GAAGCGCGCC TGCCCCAACG ACCACGTCCT CACGCTCGAG GAGGTGGAGC CGCGGACCTT CTCCTTCAAC GCCCCGTACG GCGCGTGCCC CGAGTGCACG GGTCTCGGAT CGCGTCTCGA GGTGGACCCG GAGCTCGTGA TCCCCGACGA CGAGCTGTCG TTGTCGCAGG GCGCGGTCGC CCCGTGGGCG CAGACGTCGT CCGAGTACTT CCAGCGGGTC CTCACCGCGC TCGCGGCCGA CCTGGGCTTC TCCATGGACA CCCCGTGGCG TGCGCTGCCG AAGCGTGCGC GCGACGCCGT GCTGCACGGG CAGAACCACG AGGTGCACGT GCGGTACCGC AACCGCTGGG GCCGGGAACG CCAGTACTCC ACCGGCTTCG AGGGCGTCAT CACGTTCCTG CAGCGCCGCC ACGCCGAGAC GGACTCCGAT TGGAGCAAGG AGAAGTACGA GGCCTTCATG CGTGAGGTCC CGTGCCCGAC GTGCGAGGGC ACGCGTCTCA AGCCCGAGGT GCTCGCGGTG AAGGTCGGCG GGTACTCGAT CGCCGACGTC TGCGCGCTGC CGATCGACGA GGCTCGCGCC TTCATCGACG GTCTCGAGCT GGGCGAGCGC GAGCGGGCGA TCGCGGCGCA GGTCGTCAAG GAGATCCAAG CCCGGCTCGG CTTCCTCCTG GACGTGGGGC TCGACTACCT CTCGCTGATG CGGCCCGCAG GGACGCTGTC CGGAGGTGAG GCGCAGCGCA TCCGGCTCGC CACGCAGATC GGCTCCGGGC TGGTCGGCGT CCTGTACGTG CTCGACGAGC CGTCCATCGG GCTGCACCAG CGCGACAACC GGCGGCTCAT CGACACCCTC ACCCGGCTGC GCGACCTCGG GAACACGCTC ATCGTCGTCG AGCACGACGA GGACACGATC CGCACGGCCG ACTGGATCGT CGACATCGGT CCGGGAGCGG GCGAGCACGG TGGCCGCGTC GTCCACTCGG GCGACCTCGA CGGTCTGCTG GCGTCGGCGG AGTCCGTGAC CGGCGCCTAC CTGTCGGGGC GTCGCACGAT CCCGATGCCC GCGCAGCGGC GCCCCGTCGA CCGGTCGCGG CAGGTCACGG TCGTCGGCGC GCGTGAGAAC AACCTGCGCG GCATCGACGT GTCGTTCCCG CTGGGCGTGC TCACCGCGGT GACCGGCGTG TCCGGGTCCG GCAAGTCGAC CCTCGTCAAC TCGATCCTCT ACACGGTCAT GGCCAACGAG CTCAACGGTG CCCGCCAGGT CGCGGGGCGG CACCGCCGGG TCACCGGGCT CGAGCAGCTC GACAAGGTGG TCCACGTCGA CCAGGGACCC ATCGGGCGCA CGCCGCGGTC GAACCCCGCC ACGTACACCG GCGTGTGGGA CCACGTGCGC AAGCTGTTCG CCGAGACCAC CGAGGCGAAG GTGCGCGGCT ACACGCCCGG GCGCTTCTCG TTCAACGTCA AGGGTGGGCG CTGCGAGGCA TGCTCCGGGG ACGGCACGCT GAAGATCGAG ATGAACTTCC TGCCGGACGT CTACGTGCCG TGCGAGGTGT GCCACGGGGC GCGGTACAAC CGTGAGACGC TCGAGGTGCA CTTCAAGGGC AGGACCGTGG CGGACGTGCT GGCGATGCCG ATCGAGGAGG CCGCCGAGTT CTTCGCCGCG GTCCCGGCGA TCGCGCGTCA CCTGCGTACT CTCGTCGACG TCGGCCTCGG CTACGTGCGG CTCGGTCAGC CGGCGCCGAC GCTGTCCGGC GGCGAGGCGC AGCGCGTCAA GCTCGCCTCA GAGCTGCAGC GCCGCTCCAC GGGCCGCACG ATCTACGTGC TCGACGAGCC GACGACGGGC CTGCACTTCG AGGACATCCG CAAGCTGCTC GGCGTCCTGC AGTCGCTGGT CGACAAGGGC AACAGCGTGC TGGTGATCGA GCACAACCTC GACGTGATCA GGAACGCCGA CTGGGTCATC GACATGGGCC CGGAGGGCGG CTCCGGGGGT GGCACCGTGG TTGCGGAAGG GACCCCCGAG CACGTCGCCA CCGTGCCGGA GAGCCACACG GGCCGTTTCC TCGCCGAGGC CTTCGGCCCG GCGCGGGTCG AGGTGCCCGC CTGA
|
Protein sequence | MSLPLSDRLV VRGAREHNLR NVDLDLPRDA LIAFTGLSGS GKSSLAFDTI FAEGQRRYVE SLSAYARQFL GQMDKPDVDF IEGLSPAVSI DQKSTNRNPR STVGTITEVY DYLRLLFARA GTQHCPVCGE RVTAQTPQQI VDRLLELPEG TRYQVLAPVV RGRKGEYTDL FRELQGKGFS RARVDGEVVQ LASPPTLEKK LKHDIEVVVD RLVSREGVQR RLTDSVETAL GLAGGLLVVE LVDADADDPQ RERRFSEKRA CPNDHVLTLE EVEPRTFSFN APYGACPECT GLGSRLEVDP ELVIPDDELS LSQGAVAPWA QTSSEYFQRV LTALAADLGF SMDTPWRALP KRARDAVLHG QNHEVHVRYR NRWGRERQYS TGFEGVITFL QRRHAETDSD WSKEKYEAFM REVPCPTCEG TRLKPEVLAV KVGGYSIADV CALPIDEARA FIDGLELGER ERAIAAQVVK EIQARLGFLL DVGLDYLSLM RPAGTLSGGE AQRIRLATQI GSGLVGVLYV LDEPSIGLHQ RDNRRLIDTL TRLRDLGNTL IVVEHDEDTI RTADWIVDIG PGAGEHGGRV VHSGDLDGLL ASAESVTGAY LSGRRTIPMP AQRRPVDRSR QVTVVGAREN NLRGIDVSFP LGVLTAVTGV SGSGKSTLVN SILYTVMANE LNGARQVAGR HRRVTGLEQL DKVVHVDQGP IGRTPRSNPA TYTGVWDHVR KLFAETTEAK VRGYTPGRFS FNVKGGRCEA CSGDGTLKIE MNFLPDVYVP CEVCHGARYN RETLEVHFKG RTVADVLAMP IEEAAEFFAA VPAIARHLRT LVDVGLGYVR LGQPAPTLSG GEAQRVKLAS ELQRRSTGRT IYVLDEPTTG LHFEDIRKLL GVLQSLVDKG NSVLVIEHNL DVIRNADWVI DMGPEGGSGG GTVVAEGTPE HVATVPESHT GRFLAEAFGP ARVEVPA
|
| |