Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1010 |
Symbol | |
ID | 5898465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1069627 |
End bp | 1071297 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641561492 |
Product | hypothetical protein |
Protein accession | YP_001682638 |
Protein GI | 167644975 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE [COG4786] Flagellar basal body rod protein |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0819939 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.629928 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATCA ACAGCGCCAT GCTCGCCGGG GTTTCCGGCC TGATCTCCAA CTCGTCGGCC CTGGCCGCGA TTTCGGACAA CATCGCCAAC GTCAACACGG TCGGCTACAA GCGCAGCTCG GCCAACTTCT CGACGCTGGT CACCGCCCAG AGCAAGAGCG CCACCTACAG CGCCGGCGGC GTGAAGGCCC AGACCCACCA GTTCGTCAGC CAGCAGGGCC TGACCCAGTC GACGACCTCG AACCTCGACC TGTCGATCGC CGGCTCGGGC TTCTTCGTCG GCACCGAGAA GCCGGAAGGC CTGACCGCCA CCGACACCCG CTCGTTCACC CGCGCCGGTT CGTTCCAGCT GGACAACCTG GGCTATCTGA AGAACGACGC CGGCCTCTAC CTGCAGGGCT GGCTGGCCGA TCCGGTGACC GGCACCATCA CCCCCGACCC GTCGGACCTG ACCCAGCTGT CGTCGATCAA TGTCGGCACG GTCGGCGGCA CGGCCGAGAA GACCACCCGG ATCGGCGTCA ACGCCAACCT GCGCTCGGAG CAGCCGGTGT CGGCCGCGGC CAACGCCGTG GCGACCAAGA CAGCCGTCAT CGACAGCGGC GGCGCGACCA ACAACTACTC AGTCTATTAC AGCCCCACGG GCACGGGCAA CCAGTACCAG GTCGAGATCC GCAAAGCCGG CGTGGCCGTG TCGACCGGCA CCGCGACCTT CGATCCGGTC ACCGGAAACC TGCTCTCGAC CACCCTGCCG GGCACGCCGC CCAACCTCAA CATCGGCGGC GGCAACACCG TCACCCAGAC CCAGTTGGGC CTGAACAACA AGACCGACGC CGTCTCCAGC GGCGCCTACG ACCCGACGAC CCGCTCGATG TCGGACTACG CCCTGGACAA CACCACGGGC GTGAAGCCGG ACTTCGAGAT CCAGATCCCG GTCTCGGACT CCAAGGGCGG CCAGCGCACC ATCACCCTGT CGCTGCTGAA GGGCCCGGGT CCCAACGAAT GGTTCGCCGA ACTGCGCGCC AAGCCGGGCG ACCTGGACAA CAACGCCAAC GGCCAGATCG CCTCGGGCAA GGTGACCTTC ACCACCGACG GCAAGCTGGC CTCGGTCGGC AACCTGTTCG GCGGCGTCAC CCCGACCGCG ATCAGCATCG GCGCCTCCGA TCCGCTGGCG GTCGGCACGG CCCCGCGCTG GGCCGACGGC TTGGGCATCG ACGCACAGAA CCTGCAGGTC GACCTGGCCA GCGCCTCTGG CGGCCTGACC CAGTACAACA GCCAATCCGT CGTCCAGTCG GTCAACACCA ACGGCACGGC CTTCGGCAAC CTGACCAACA TCGAAGTCGA TGACAAAGGC TACGTCTCGG CGATCTTCGA CAACGGCGTG ACCCGCCGGA TCGCACAGGT AGCGATCGCG ACCTTCTCCA ACCCCAATGG ATTGAAGGGG GTGAACGGAA ATGCATATCG CGTCACCAAC GAAAGCGGCA CCTATAGCCT GAAGACTCCG GGTGGCGGCG GCGCGGGCTC GATTGCTCCG TCCACGCTGG AAGCTTCGAC GGTCGACTTG TCGACTGAGT TCACCGGCTT GATCACGACG CAGAGAGCCT ATTCGGCCTC GTCGAAGATC ATCACTACCG CTGACCAGAT GCTAGAAGAG CTTCTGAGCA TTAAGCGGTA A
|
Protein sequence | MSINSAMLAG VSGLISNSSA LAAISDNIAN VNTVGYKRSS ANFSTLVTAQ SKSATYSAGG VKAQTHQFVS QQGLTQSTTS NLDLSIAGSG FFVGTEKPEG LTATDTRSFT RAGSFQLDNL GYLKNDAGLY LQGWLADPVT GTITPDPSDL TQLSSINVGT VGGTAEKTTR IGVNANLRSE QPVSAAANAV ATKTAVIDSG GATNNYSVYY SPTGTGNQYQ VEIRKAGVAV STGTATFDPV TGNLLSTTLP GTPPNLNIGG GNTVTQTQLG LNNKTDAVSS GAYDPTTRSM SDYALDNTTG VKPDFEIQIP VSDSKGGQRT ITLSLLKGPG PNEWFAELRA KPGDLDNNAN GQIASGKVTF TTDGKLASVG NLFGGVTPTA ISIGASDPLA VGTAPRWADG LGIDAQNLQV DLASASGGLT QYNSQSVVQS VNTNGTAFGN LTNIEVDDKG YVSAIFDNGV TRRIAQVAIA TFSNPNGLKG VNGNAYRVTN ESGTYSLKTP GGGGAGSIAP STLEASTVDL STEFTGLITT QRAYSASSKI ITTADQMLEE LLSIKR
|
| |