Gene Cfla_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1147 
Symbol 
ID9145026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1287057 
End bp1289996 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content77% 
IMG OID 
ProductFibronectin type III domain protein 
Protein accessionYP_003636250 
Protein GI296129000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.478266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000177205 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGCCACA AGACCGCATC GCCCGGCCTG CTGCGGCGCC TGGTCGGAGC TCCCGGCGAC 
CGCCAGCGCG GCTTCGCCGA GGCAGGCGTC GCCGTGACCG GTGCCGCTCT CGTGCTGGGG
GCCGCGCTCG GCTCCGGCGT CGCCTCGACG GTCGTGTCGA TGTCCGACGG CGTCACCTGG
CTCCCCGACG AGGAGACCGG TCAGGTCGTC CAGATCAACC CCGCCACGGG GCGCGCGGAG
CGGCGCCTGC AGGTCGCCGC ACCGGGCAGC GAGCTCGCGA TCAGCCAGGC CGACGGCCGC
CTCGTCGTCA CCGACGTGGG CGCCGGCACC GCCACGACGA TCGACCTCGC GACGCTCCTG
GCCGGGGGCC AGCGCCGGAC CGAGGAACCC GCGCGCGTCC TCGTCGGCGG CGGTCAGGTC
TACCTCGTGA GCGTCGGTAC CGGTGTCGTG CGCGCCGTCG ACCCCCTCAC GCTCCAGGAC
CTCGGCTCGC CGTACCGTGC CGCCGTCCCG CTGGCCGACG CCGTCGTGGA CGCGCGGGGC
GCCGTGTGGG TCGTGACGAC CGAGGGCGAC GTGCGTTCGG TGACGTGGAA GCCCGGCGGC
TCGCGCTTCG AGGTGGGTGA GCCGCGGCCC GTGCGCGGCG CGGGCCGCGG CACCCGGCTG
CTGCCGCACG CGCGCGGCGT CACCGTCTTC GCTCCCGACG GGGGAGCGAT CGTGCAGGTC
GGCGTGGGTC GCGACCTCGC CGTCGCGGTG CCCGCGCTCT CCGGCGAGGT GCTGCCCGCC
GCGCAGGCCC CGACCGACCT CGCCCCGGCG GGGGTGCCCG CGCGGTCCGC CGTCGTCATG
CTCTCGGGAG ACCGCCTGCT CGAGGTGGGC GTCGGCACCC TCGGCTGCGC GCGGCCCGGG
CGTCCGGCGG TCTTCTCCGG GCTGGTCTAC GTGCCGTGCA CCGGCTCCGG GCGCGTCCTC
GTGCTCGGCA CCGACGGCAG CCGTGCGCGT CCCGACGTCG TCGCGCCCGC CGGGCGCGAC
CCGCGACTGC TCGTGGACGA CGGACGTCTC GTCGTGCACA CCGAGGACGG TTCGCAGGCG
GTCGTCGTCG AGGCCGACGG CGGCACGCGC GTCGTCGACA CGGGCCGCGC GGGGACAGCG
GTGCACGATC CGCGCAACGC CGCCTCGGCG CCCGTCGCCG TCCCCGCCCC CCGGCCGCCG
CACCGCGGCG CGGGCGCTCC CGCAGGCTCG CACCAGCGCG GCCCGCACCA GCAGGACGCC
GGGCGGCCGC AGCACGAGCA GCCGCAGCAG GAGCAGGAGC AGGAGGAGGA GCAGGAGCAG
GGTCCCGACG GGGAGGCGGT CGCGCCGGAG CCGACCGCGG CACCGAGCAC ACCGACCGGC
CCGGTCCCGA CCGCCCGCCC GCGCCCCCCG GCGATCACCC GGCCGCCGGC CGGTCCGGGA
GGCGCGGCGT CCACGGCACG CCCGACGTCG ACGGCCCGAC CCACGTCGAC CCCGGGGCGC
GCGCCCGACG CACCGACGCA GGTCGAGGCG ACGCTCGGCG AGTCGGACGG GTACGACGAC
GACGTCACCG TCACCTGGAG CCCGGTGTCC CCGCAGCCCG AGGCGTACGT CGTGCGCGCG
TCGCTGTCGG ACGGCGTCAT CTCGGCCGAC CGCGACCCGA CGCCCGATCC TGTGGAGGTC
GGCGGTGCCG CGACCTCCGC CACCATCCGT GTGGCGTGCA ACACCCGGTG GTCCTTCTCC
GTCGTGGCGG TCGCCGACGG GGCCACGTCC GAGCCGGCCG TGGGCCCGTC GCTGCGGGGG
ACGTCGTGCA CGGGCGCGCA GCGGGCGCCG TCGGCGCCCA CGGGCGTGAC GGCCGTCGCC
CACCCCGACG GGACCGTGAC GGTGTCGTGG ACGCGGTCGC ACTCCGGCGC CGAGGGGTAC
CTGGTGGGAC CCGTCGGCGG GTCCACCACG GCGACCGACG AGGGGGCCAG GTCGGTGGAG
CTGCGGGACG TGCCCCCCGG CCAGGGTGTC CGGTTCGTCG TCGAGGCGTT CCGCGGGGAC
CTGCGCACGC CGTCCGAGCC GTCCGCTCCC GTGGCGGTGG TCGGCGTCCC CGGGGAGGTG
CGGTTCGCCG GGGGGATGCG TACCGGGTGG TCCGGCTCGA CGTACCAGTT CGCGATCGGG
TGGGACGTCC CCGTCGACAA CGGCTCTCCC GTCGAGTCGT ACCACGTGGT GTGGAGCGGC
GGGGACTACA GCGGCGAGGA GGTCACCACG CAGCCGTCCT TCGAGGTCGA CCTCAGCTGT
GGCGGACGGA CCTCGTGCGT GAACGGGGGT GAGGTCACGC TGACGGTCAC GCCGCGCAAC
GCCGTCGGTG CCGGCCCGGC CGCCACGTTC GAGCACCGCT TCTCCGGCCC CGCCGGGCCC
TCCGCGGGCG AGGTCGTCGT CGCGTCCGTG ACGCCCCGCA CGCCCGCACT CGAGGACCCC
CTGGTCGAGA TGATCGCCAC GCTCGTCCCC CCGCAGGGGT GGGTGGCGCA CAGCGGCGGC
TGCACGCTCG TGACGACCTG GGAGGGTGTC GAGCGGACAC GCCCCGTCGG GTGCTCGGCC
GGGGAGGTCT CCGCGGGCAC CTACGCGTCG GGTGGCGGGC AGCTCACGGT CGCGCTGCGC
GCCGACGGCA ACGGGGCGCT GTCCGCACCC GTGACGGTCA CCGTGCCCGA CCGTTCGGCG
TGGCCGTACT GCGACCCGAC CCTCCCGCGC TGCACGGTGA CCGAGCTGCA GTCCGGCCCG
GAGCCCGGCG GTGTGCCGGA GGTGCTCATC GGGAACCCGG ACCTGGACCC CTGGCGTGCG
TCCCAGGCCG CGGCCGGAAC GTTCCTCTTC CTCGGCGCAG GAGCGCTCAG GGCCTTGCGC
CGACGCCGCA GGCCCGACAT GGTGCACGTC CTGACCCCCA CGGAGGAGAG ACCGGGTTGA
 
Protein sequence
MSHKTASPGL LRRLVGAPGD RQRGFAEAGV AVTGAALVLG AALGSGVAST VVSMSDGVTW 
LPDEETGQVV QINPATGRAE RRLQVAAPGS ELAISQADGR LVVTDVGAGT ATTIDLATLL
AGGQRRTEEP ARVLVGGGQV YLVSVGTGVV RAVDPLTLQD LGSPYRAAVP LADAVVDARG
AVWVVTTEGD VRSVTWKPGG SRFEVGEPRP VRGAGRGTRL LPHARGVTVF APDGGAIVQV
GVGRDLAVAV PALSGEVLPA AQAPTDLAPA GVPARSAVVM LSGDRLLEVG VGTLGCARPG
RPAVFSGLVY VPCTGSGRVL VLGTDGSRAR PDVVAPAGRD PRLLVDDGRL VVHTEDGSQA
VVVEADGGTR VVDTGRAGTA VHDPRNAASA PVAVPAPRPP HRGAGAPAGS HQRGPHQQDA
GRPQHEQPQQ EQEQEEEQEQ GPDGEAVAPE PTAAPSTPTG PVPTARPRPP AITRPPAGPG
GAASTARPTS TARPTSTPGR APDAPTQVEA TLGESDGYDD DVTVTWSPVS PQPEAYVVRA
SLSDGVISAD RDPTPDPVEV GGAATSATIR VACNTRWSFS VVAVADGATS EPAVGPSLRG
TSCTGAQRAP SAPTGVTAVA HPDGTVTVSW TRSHSGAEGY LVGPVGGSTT ATDEGARSVE
LRDVPPGQGV RFVVEAFRGD LRTPSEPSAP VAVVGVPGEV RFAGGMRTGW SGSTYQFAIG
WDVPVDNGSP VESYHVVWSG GDYSGEEVTT QPSFEVDLSC GGRTSCVNGG EVTLTVTPRN
AVGAGPAATF EHRFSGPAGP SAGEVVVASV TPRTPALEDP LVEMIATLVP PQGWVAHSGG
CTLVTTWEGV ERTRPVGCSA GEVSAGTYAS GGGQLTVALR ADGNGALSAP VTVTVPDRSA
WPYCDPTLPR CTVTELQSGP EPGGVPEVLI GNPDLDPWRA SQAAAGTFLF LGAGALRALR
RRRRPDMVHV LTPTEERPG