Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1147 |
Symbol | |
ID | 9145026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1287057 |
End bp | 1289996 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | Fibronectin type III domain protein |
Protein accession | YP_003636250 |
Protein GI | 296129000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.478266 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000177205 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAGCCACA AGACCGCATC GCCCGGCCTG CTGCGGCGCC TGGTCGGAGC TCCCGGCGAC CGCCAGCGCG GCTTCGCCGA GGCAGGCGTC GCCGTGACCG GTGCCGCTCT CGTGCTGGGG GCCGCGCTCG GCTCCGGCGT CGCCTCGACG GTCGTGTCGA TGTCCGACGG CGTCACCTGG CTCCCCGACG AGGAGACCGG TCAGGTCGTC CAGATCAACC CCGCCACGGG GCGCGCGGAG CGGCGCCTGC AGGTCGCCGC ACCGGGCAGC GAGCTCGCGA TCAGCCAGGC CGACGGCCGC CTCGTCGTCA CCGACGTGGG CGCCGGCACC GCCACGACGA TCGACCTCGC GACGCTCCTG GCCGGGGGCC AGCGCCGGAC CGAGGAACCC GCGCGCGTCC TCGTCGGCGG CGGTCAGGTC TACCTCGTGA GCGTCGGTAC CGGTGTCGTG CGCGCCGTCG ACCCCCTCAC GCTCCAGGAC CTCGGCTCGC CGTACCGTGC CGCCGTCCCG CTGGCCGACG CCGTCGTGGA CGCGCGGGGC GCCGTGTGGG TCGTGACGAC CGAGGGCGAC GTGCGTTCGG TGACGTGGAA GCCCGGCGGC TCGCGCTTCG AGGTGGGTGA GCCGCGGCCC GTGCGCGGCG CGGGCCGCGG CACCCGGCTG CTGCCGCACG CGCGCGGCGT CACCGTCTTC GCTCCCGACG GGGGAGCGAT CGTGCAGGTC GGCGTGGGTC GCGACCTCGC CGTCGCGGTG CCCGCGCTCT CCGGCGAGGT GCTGCCCGCC GCGCAGGCCC CGACCGACCT CGCCCCGGCG GGGGTGCCCG CGCGGTCCGC CGTCGTCATG CTCTCGGGAG ACCGCCTGCT CGAGGTGGGC GTCGGCACCC TCGGCTGCGC GCGGCCCGGG CGTCCGGCGG TCTTCTCCGG GCTGGTCTAC GTGCCGTGCA CCGGCTCCGG GCGCGTCCTC GTGCTCGGCA CCGACGGCAG CCGTGCGCGT CCCGACGTCG TCGCGCCCGC CGGGCGCGAC CCGCGACTGC TCGTGGACGA CGGACGTCTC GTCGTGCACA CCGAGGACGG TTCGCAGGCG GTCGTCGTCG AGGCCGACGG CGGCACGCGC GTCGTCGACA CGGGCCGCGC GGGGACAGCG GTGCACGATC CGCGCAACGC CGCCTCGGCG CCCGTCGCCG TCCCCGCCCC CCGGCCGCCG CACCGCGGCG CGGGCGCTCC CGCAGGCTCG CACCAGCGCG GCCCGCACCA GCAGGACGCC GGGCGGCCGC AGCACGAGCA GCCGCAGCAG GAGCAGGAGC AGGAGGAGGA GCAGGAGCAG GGTCCCGACG GGGAGGCGGT CGCGCCGGAG CCGACCGCGG CACCGAGCAC ACCGACCGGC CCGGTCCCGA CCGCCCGCCC GCGCCCCCCG GCGATCACCC GGCCGCCGGC CGGTCCGGGA GGCGCGGCGT CCACGGCACG CCCGACGTCG ACGGCCCGAC CCACGTCGAC CCCGGGGCGC GCGCCCGACG CACCGACGCA GGTCGAGGCG ACGCTCGGCG AGTCGGACGG GTACGACGAC GACGTCACCG TCACCTGGAG CCCGGTGTCC CCGCAGCCCG AGGCGTACGT CGTGCGCGCG TCGCTGTCGG ACGGCGTCAT CTCGGCCGAC CGCGACCCGA CGCCCGATCC TGTGGAGGTC GGCGGTGCCG CGACCTCCGC CACCATCCGT GTGGCGTGCA ACACCCGGTG GTCCTTCTCC GTCGTGGCGG TCGCCGACGG GGCCACGTCC GAGCCGGCCG TGGGCCCGTC GCTGCGGGGG ACGTCGTGCA CGGGCGCGCA GCGGGCGCCG TCGGCGCCCA CGGGCGTGAC GGCCGTCGCC CACCCCGACG GGACCGTGAC GGTGTCGTGG ACGCGGTCGC ACTCCGGCGC CGAGGGGTAC CTGGTGGGAC CCGTCGGCGG GTCCACCACG GCGACCGACG AGGGGGCCAG GTCGGTGGAG CTGCGGGACG TGCCCCCCGG CCAGGGTGTC CGGTTCGTCG TCGAGGCGTT CCGCGGGGAC CTGCGCACGC CGTCCGAGCC GTCCGCTCCC GTGGCGGTGG TCGGCGTCCC CGGGGAGGTG CGGTTCGCCG GGGGGATGCG TACCGGGTGG TCCGGCTCGA CGTACCAGTT CGCGATCGGG TGGGACGTCC CCGTCGACAA CGGCTCTCCC GTCGAGTCGT ACCACGTGGT GTGGAGCGGC GGGGACTACA GCGGCGAGGA GGTCACCACG CAGCCGTCCT TCGAGGTCGA CCTCAGCTGT GGCGGACGGA CCTCGTGCGT GAACGGGGGT GAGGTCACGC TGACGGTCAC GCCGCGCAAC GCCGTCGGTG CCGGCCCGGC CGCCACGTTC GAGCACCGCT TCTCCGGCCC CGCCGGGCCC TCCGCGGGCG AGGTCGTCGT CGCGTCCGTG ACGCCCCGCA CGCCCGCACT CGAGGACCCC CTGGTCGAGA TGATCGCCAC GCTCGTCCCC CCGCAGGGGT GGGTGGCGCA CAGCGGCGGC TGCACGCTCG TGACGACCTG GGAGGGTGTC GAGCGGACAC GCCCCGTCGG GTGCTCGGCC GGGGAGGTCT CCGCGGGCAC CTACGCGTCG GGTGGCGGGC AGCTCACGGT CGCGCTGCGC GCCGACGGCA ACGGGGCGCT GTCCGCACCC GTGACGGTCA CCGTGCCCGA CCGTTCGGCG TGGCCGTACT GCGACCCGAC CCTCCCGCGC TGCACGGTGA CCGAGCTGCA GTCCGGCCCG GAGCCCGGCG GTGTGCCGGA GGTGCTCATC GGGAACCCGG ACCTGGACCC CTGGCGTGCG TCCCAGGCCG CGGCCGGAAC GTTCCTCTTC CTCGGCGCAG GAGCGCTCAG GGCCTTGCGC CGACGCCGCA GGCCCGACAT GGTGCACGTC CTGACCCCCA CGGAGGAGAG ACCGGGTTGA
|
Protein sequence | MSHKTASPGL LRRLVGAPGD RQRGFAEAGV AVTGAALVLG AALGSGVAST VVSMSDGVTW LPDEETGQVV QINPATGRAE RRLQVAAPGS ELAISQADGR LVVTDVGAGT ATTIDLATLL AGGQRRTEEP ARVLVGGGQV YLVSVGTGVV RAVDPLTLQD LGSPYRAAVP LADAVVDARG AVWVVTTEGD VRSVTWKPGG SRFEVGEPRP VRGAGRGTRL LPHARGVTVF APDGGAIVQV GVGRDLAVAV PALSGEVLPA AQAPTDLAPA GVPARSAVVM LSGDRLLEVG VGTLGCARPG RPAVFSGLVY VPCTGSGRVL VLGTDGSRAR PDVVAPAGRD PRLLVDDGRL VVHTEDGSQA VVVEADGGTR VVDTGRAGTA VHDPRNAASA PVAVPAPRPP HRGAGAPAGS HQRGPHQQDA GRPQHEQPQQ EQEQEEEQEQ GPDGEAVAPE PTAAPSTPTG PVPTARPRPP AITRPPAGPG GAASTARPTS TARPTSTPGR APDAPTQVEA TLGESDGYDD DVTVTWSPVS PQPEAYVVRA SLSDGVISAD RDPTPDPVEV GGAATSATIR VACNTRWSFS VVAVADGATS EPAVGPSLRG TSCTGAQRAP SAPTGVTAVA HPDGTVTVSW TRSHSGAEGY LVGPVGGSTT ATDEGARSVE LRDVPPGQGV RFVVEAFRGD LRTPSEPSAP VAVVGVPGEV RFAGGMRTGW SGSTYQFAIG WDVPVDNGSP VESYHVVWSG GDYSGEEVTT QPSFEVDLSC GGRTSCVNGG EVTLTVTPRN AVGAGPAATF EHRFSGPAGP SAGEVVVASV TPRTPALEDP LVEMIATLVP PQGWVAHSGG CTLVTTWEGV ERTRPVGCSA GEVSAGTYAS GGGQLTVALR ADGNGALSAP VTVTVPDRSA WPYCDPTLPR CTVTELQSGP EPGGVPEVLI GNPDLDPWRA SQAAAGTFLF LGAGALRALR RRRRPDMVHV LTPTEERPG
|
| |