Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0032 |
Symbol | |
ID | 9143897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 36945 |
End bp | 38582 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003635151 |
Protein GI | 296127901 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCGC GCCGCGCCGG GCAGTCCCGG CCCTTGATGT CGTCCACCGG CCTCAGCCGC ATCATCTACA AGCCGCCGCC GCTCACCTCC GCGCAGAAGG TCCGACGCGC TCGAGCCTGG GCCGAGAGCG ACGTCGTCTC CCGGCTCGTC ACGGCTCGGA CGCACAATGT CGGCCGCCAC CGCGCCTACC CGCTGGACGC CATCGCCGCA GGGATATTCC TGCATGCTCT GAGTAGCCCG ACGGCGTTCA CCCTCGCGGG GGTCACCCGC ACCCTGCTGT CAGCGTCGCC CAGCGACAGG TTCGCGCTGG GCTTCCCGCC AAGGACCGTG ATCCGGTACC GGCCCATCCT GTCCGGGTTC CACGCCCTGG AGGACGCCTT GGGCGTGGGC TTCTTGGTCC CCCACGACCA CGAGCTGACC GCGGACCCGG AGACCGGCGA GGTCGACGAG TGCCCCGAGG GCTGCCCCTT CGCGCCGGCG AGCCTCGACG AGGTCGCCAC CGGACTGGTC CAGGCCAGCA TTCCGTCCAC GTGGACGCCG CCGAGCGTCG TGGCGATCGA CGGGACCGAC GTCGAGTCCG CTTACAGGGC TCGCACCGCG AACACCGGGA CCGACGCTGG CGATCGGCCG GCCGCTGACC CCGATGCACG GTGGGCCAAG CGCACCGGGA CCGACCGTCG GCCAAACGAG CTGTACTTCG GGTACGAGGC GCACATCGCG ACCTACGCAC CCGAGCCCGC AGGTGAACCG CTGCCCCAGC TGGCCGCCGG CCTGGCCATG CGCCCCGGGG TCAAGGACAG AGCAGGAGCA GCCCTCGGGA TCCTCGACGC GTTGCGGCAC GTCAAGGAGG TCCTGCTCGA CCGCGGCTAC ACCACCTCCC GTGCGGAGAA GCTCGCACGC CCACTACGGG AGCGTGGGAT CGCAGTGACG ATGGACCTGC ACTCCACCCA GCGCGGCGTT CGCCCCGGTC CGATCACCGG GTCACTGTGG ATCGACGGCG CTTTGTACTC CGACGCGCTA CCGGATCCCC TGAGGCAGCT CGAGCCTCCG AAGATCGGCG ACACCGCCAC TCAGAAGGCA CGCGCCCGCG AGCTGTTCGA CAAGCGCGAG GCCTACCGGT TCACCGCGCT GGCGCGTCGT CGCGACGAGC TCGGCAGCCA GCGGTGGAAG GGTCCGGCCC TCGCCGGGCA CCTGCGCTGC CCGAACACGC CGGCCTCGAT GCGCCTGGGA CGCCACCTTC CGACCGCCCA GTGCCTGCGC GGCGGGAACT GCGCATGCGG CAAGACCGTC ACCGTGCGCG ACACCGACCT CGAGCGCGAC CGGCAGCCGT TGCCCTGGCA GTCCACGCGG TGGGCGCTGT CGTTCAACCG CCGCAGCCTG ATCGAGGGAC TCAACGCCCA GCTGCGCTAC CAGGTCCGCA ACGTCAACCG CGGGTACTTC CGCGTCGCCG GTCTCGTCGC GACCACCCTC CTGATGGCGG TGACGCTCGC CGCGCACAAC ATCGAACGGC TCCGCGGCTG GCACCTCGCC CGGCACCAGC CCGACCCGTG GCAGGTGAAG CTCGGCGAAG AGCCGTCCGA CGAACCGATG GACCGCCACA CCTGGACGAA AGGCCGGCGC CGCCCGTACC GCGGGTAG
|
Protein sequence | MKARRAGQSR PLMSSTGLSR IIYKPPPLTS AQKVRRARAW AESDVVSRLV TARTHNVGRH RAYPLDAIAA GIFLHALSSP TAFTLAGVTR TLLSASPSDR FALGFPPRTV IRYRPILSGF HALEDALGVG FLVPHDHELT ADPETGEVDE CPEGCPFAPA SLDEVATGLV QASIPSTWTP PSVVAIDGTD VESAYRARTA NTGTDAGDRP AADPDARWAK RTGTDRRPNE LYFGYEAHIA TYAPEPAGEP LPQLAAGLAM RPGVKDRAGA ALGILDALRH VKEVLLDRGY TTSRAEKLAR PLRERGIAVT MDLHSTQRGV RPGPITGSLW IDGALYSDAL PDPLRQLEPP KIGDTATQKA RARELFDKRE AYRFTALARR RDELGSQRWK GPALAGHLRC PNTPASMRLG RHLPTAQCLR GGNCACGKTV TVRDTDLERD RQPLPWQSTR WALSFNRRSL IEGLNAQLRY QVRNVNRGYF RVAGLVATTL LMAVTLAAHN IERLRGWHLA RHQPDPWQVK LGEEPSDEPM DRHTWTKGRR RPYRG
|
| |