Gene Caul_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1010 
Symbol 
ID5898465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1069627 
End bp1071297 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content65% 
IMG OID641561492 
Producthypothetical protein 
Protein accessionYP_001682638 
Protein GI167644975 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE
[COG4786] Flagellar basal body rod protein 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0819939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.629928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA ACAGCGCCAT GCTCGCCGGG GTTTCCGGCC TGATCTCCAA CTCGTCGGCC 
CTGGCCGCGA TTTCGGACAA CATCGCCAAC GTCAACACGG TCGGCTACAA GCGCAGCTCG
GCCAACTTCT CGACGCTGGT CACCGCCCAG AGCAAGAGCG CCACCTACAG CGCCGGCGGC
GTGAAGGCCC AGACCCACCA GTTCGTCAGC CAGCAGGGCC TGACCCAGTC GACGACCTCG
AACCTCGACC TGTCGATCGC CGGCTCGGGC TTCTTCGTCG GCACCGAGAA GCCGGAAGGC
CTGACCGCCA CCGACACCCG CTCGTTCACC CGCGCCGGTT CGTTCCAGCT GGACAACCTG
GGCTATCTGA AGAACGACGC CGGCCTCTAC CTGCAGGGCT GGCTGGCCGA TCCGGTGACC
GGCACCATCA CCCCCGACCC GTCGGACCTG ACCCAGCTGT CGTCGATCAA TGTCGGCACG
GTCGGCGGCA CGGCCGAGAA GACCACCCGG ATCGGCGTCA ACGCCAACCT GCGCTCGGAG
CAGCCGGTGT CGGCCGCGGC CAACGCCGTG GCGACCAAGA CAGCCGTCAT CGACAGCGGC
GGCGCGACCA ACAACTACTC AGTCTATTAC AGCCCCACGG GCACGGGCAA CCAGTACCAG
GTCGAGATCC GCAAAGCCGG CGTGGCCGTG TCGACCGGCA CCGCGACCTT CGATCCGGTC
ACCGGAAACC TGCTCTCGAC CACCCTGCCG GGCACGCCGC CCAACCTCAA CATCGGCGGC
GGCAACACCG TCACCCAGAC CCAGTTGGGC CTGAACAACA AGACCGACGC CGTCTCCAGC
GGCGCCTACG ACCCGACGAC CCGCTCGATG TCGGACTACG CCCTGGACAA CACCACGGGC
GTGAAGCCGG ACTTCGAGAT CCAGATCCCG GTCTCGGACT CCAAGGGCGG CCAGCGCACC
ATCACCCTGT CGCTGCTGAA GGGCCCGGGT CCCAACGAAT GGTTCGCCGA ACTGCGCGCC
AAGCCGGGCG ACCTGGACAA CAACGCCAAC GGCCAGATCG CCTCGGGCAA GGTGACCTTC
ACCACCGACG GCAAGCTGGC CTCGGTCGGC AACCTGTTCG GCGGCGTCAC CCCGACCGCG
ATCAGCATCG GCGCCTCCGA TCCGCTGGCG GTCGGCACGG CCCCGCGCTG GGCCGACGGC
TTGGGCATCG ACGCACAGAA CCTGCAGGTC GACCTGGCCA GCGCCTCTGG CGGCCTGACC
CAGTACAACA GCCAATCCGT CGTCCAGTCG GTCAACACCA ACGGCACGGC CTTCGGCAAC
CTGACCAACA TCGAAGTCGA TGACAAAGGC TACGTCTCGG CGATCTTCGA CAACGGCGTG
ACCCGCCGGA TCGCACAGGT AGCGATCGCG ACCTTCTCCA ACCCCAATGG ATTGAAGGGG
GTGAACGGAA ATGCATATCG CGTCACCAAC GAAAGCGGCA CCTATAGCCT GAAGACTCCG
GGTGGCGGCG GCGCGGGCTC GATTGCTCCG TCCACGCTGG AAGCTTCGAC GGTCGACTTG
TCGACTGAGT TCACCGGCTT GATCACGACG CAGAGAGCCT ATTCGGCCTC GTCGAAGATC
ATCACTACCG CTGACCAGAT GCTAGAAGAG CTTCTGAGCA TTAAGCGGTA A
 
Protein sequence
MSINSAMLAG VSGLISNSSA LAAISDNIAN VNTVGYKRSS ANFSTLVTAQ SKSATYSAGG 
VKAQTHQFVS QQGLTQSTTS NLDLSIAGSG FFVGTEKPEG LTATDTRSFT RAGSFQLDNL
GYLKNDAGLY LQGWLADPVT GTITPDPSDL TQLSSINVGT VGGTAEKTTR IGVNANLRSE
QPVSAAANAV ATKTAVIDSG GATNNYSVYY SPTGTGNQYQ VEIRKAGVAV STGTATFDPV
TGNLLSTTLP GTPPNLNIGG GNTVTQTQLG LNNKTDAVSS GAYDPTTRSM SDYALDNTTG
VKPDFEIQIP VSDSKGGQRT ITLSLLKGPG PNEWFAELRA KPGDLDNNAN GQIASGKVTF
TTDGKLASVG NLFGGVTPTA ISIGASDPLA VGTAPRWADG LGIDAQNLQV DLASASGGLT
QYNSQSVVQS VNTNGTAFGN LTNIEVDDKG YVSAIFDNGV TRRIAQVAIA TFSNPNGLKG
VNGNAYRVTN ESGTYSLKTP GGGGAGSIAP STLEASTVDL STEFTGLITT QRAYSASSKI
ITTADQMLEE LLSIKR