Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1135 |
Symbol | |
ID | 9145014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1270752 |
End bp | 1272491 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003636238 |
Protein GI | 296128988 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00673541 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000103654 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGACGAG GAACCACAGC CGCGGCGGTC GCCGCGGTGC TCGTGACGGG GGTGCTCGGC AGCCCCGCGC GGGCAGCCGA ACCGGTGGTG GAGGAGGTGC TGGGCGCCAA CCTGCCGGTG CACGCGACCG ACCACGCGCT GGACTGGGAC GCGGGCGACG AGACGACGAT CGCGCTGTCG GGTGCGTCGG CGCAGGTCAC GGGGAGCGGT GCGAGCGCGC AGGGTGGCAC GGTCACCATC TCCGCACCGG GGACGTACCG GGTCAGCGGC ACGCTCACCG ACGGGGCGGT CGTCGTGGCG TCGGCGGGCG AGGGAGTCGT GCGGGTGGTG CTCGACGGGG CGTCGATCAC GTCCTCGACG ACCGCACCGC TGCAGGTGCA GGACGCCGAC GAGGTCGTGG TCGTGCTCGC CGAGGGCTCG ACGAACAGCC TCACGGACCC GGCGACGTAC CAGTACCCCG AGGGCCAGGA CGAGCCGAAC GCGGCGCTGT TCTCGACGGC CGACCTCACC ATCGCGGGCA GCGGGGCGCT GACGGTGACG GGCAGCGCGA ACGACGGGAT CGCCTCCAAG GACGGCCTCG TGGTCGCGGG TGGCCGGATC ACGGTGACCG CGGCGGACGA CGCCGTGCGC GGCAAGGACT ACCTCGTGGT CACCGGCGGC ACGCTCGAGC TCACCGCGGC CGGCGACGGT CTGAAGGCCG ACGACGACAC CCCCGAGGGC GGCTTCGTGC ACGTCGCGGG CGGCGCCACC CGCGTGACGT CGGGCGACGA CGGCGTGACG GCGGCGTCCG ACGTGGTGGT CTCGGACGGC TACCTGCAGG TGCGGGCCGG TGGCGGTGCC GGTGCGGGCG GCGACTCGGG CGCCAAGGGG CTCGTGGGTG ACGTGTCGCT CGTGCTCGGC GGGGGCTCGC TCGCCGTGGA CGCGATCGAC GACGCCCTGC ACTCCGACGG CACGATCACC GTCGCCGGGG GCAACGCGAC GCTCGCCACG GCGGGCGACG GCGCGGACGC GGGCGAGCGG CTGACGATCA CCGGCGGTGC GCTCATCGTC ACGCAGTCCT CGGAGGGCCT CGAGGCGAAG GTCGTCCAGA TCGCGGGGGG CCTGATCGAG GTCACGGCGG CCGACGACGC GATCAGCGCC TCGGACCCCT CGCAGCCGGA CGCGATGGGT GCGATCCCGG GTGTCGACGT CGCCGTCAGC GGCGGCCTGA CCGTGCTGCA CGCCGCGACG GGCGACGGGC TGGACTCCAA CGGCACCGCC CAGATGAGCG GCGGCACGCT CGTGGTCGAC GGTCCGACGG AGTTCATCAA CAGCGCGGTC GACACCAACG GCGCCTTCAC GGTGACGGGG GGCACGCTCA TCGGCGTCAG CTCCGCCGGG CTGCTCGGCA CCCCGACCGT CCAGTCGCCC CAGACGTGGG TGTCGCTCGG CTCGGAGCAG CCGGCCGGGA CGCTGCTGCA CGTGCTCGCC CCCGACGGCA CCGTGCTCGC GTCGTTCCGC ACGACGAAGG CCTCGGGCAA CCTGCTCTAC TCCCACGCGT CGCTGCAGCT GGGCTCGCAG TACCGGCTCG CGGTGGGCGG CACGGCCGAC GGGCCGGTCA CGGGGGGGTT CCACCAGCGG CCCGGTGACG CGTCCGGTGC CACGGTCGTC GCGACGGCGC AGGCTGCGAC CGCGCCGAGC GGCTGGGGCG GCGGCGGGGG CTGGCCGCCG CCGGGCGGCG TCCGCCCGCC TGCGGGCTGA
|
Protein sequence | MRRGTTAAAV AAVLVTGVLG SPARAAEPVV EEVLGANLPV HATDHALDWD AGDETTIALS GASAQVTGSG ASAQGGTVTI SAPGTYRVSG TLTDGAVVVA SAGEGVVRVV LDGASITSST TAPLQVQDAD EVVVVLAEGS TNSLTDPATY QYPEGQDEPN AALFSTADLT IAGSGALTVT GSANDGIASK DGLVVAGGRI TVTAADDAVR GKDYLVVTGG TLELTAAGDG LKADDDTPEG GFVHVAGGAT RVTSGDDGVT AASDVVVSDG YLQVRAGGGA GAGGDSGAKG LVGDVSLVLG GGSLAVDAID DALHSDGTIT VAGGNATLAT AGDGADAGER LTITGGALIV TQSSEGLEAK VVQIAGGLIE VTAADDAISA SDPSQPDAMG AIPGVDVAVS GGLTVLHAAT GDGLDSNGTA QMSGGTLVVD GPTEFINSAV DTNGAFTVTG GTLIGVSSAG LLGTPTVQSP QTWVSLGSEQ PAGTLLHVLA PDGTVLASFR TTKASGNLLY SHASLQLGSQ YRLAVGGTAD GPVTGGFHQR PGDASGATVV ATAQAATAPS GWGGGGGWPP PGGVRPPAG
|
| |