Gene Caul_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1007 
Symbol 
ID5898462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1064603 
End bp1066717 
Gene Length2115 bp 
Protein Length704 aa 
Translation table11 
GC content68% 
IMG OID641561489 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_001682635 
Protein GI167644972 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.159823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.701147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTGA ACACCATCTT GAACGTTGCG ACCTCGGGGA TGATGGCGGC CCAGACGGGC 
CTGCGCGTCG TGTCCGACAA CATCGCCAAC ATCAACACCG CCGGTTACGT CCGCAAGACG
ATCGCCCAGT CGAACCTGGT GTCGAACGGC ATGGGCGTGG GCGTCAACAT CGACGCCATC
AAGCGGGCCA CCGACAAGTA CATGGCCGCC GCCAGCCTGA ACGGCGCCTC CGAGGCTGGC
CGTTCGGGCT CCATCGCGAC CGCCATGGAC AACGCCCAGA AGCTGTTCGG CGACCCCAGC
GCCAAGACCA ACTATTTCGC GACCCTGGAC GACGTCTACG CCGCCTTCTC CCTGGCGGTC
GATGATCCCT CCTCGACCCT GCTGCGCAGC GGCGCCGTGA CCAAGGTCGA GGACTTCCTC
AGCGAGAGCA AGCGCATCAC CGCGTCGCTG TCGGGCCAGA TCAAGGACAC CGACAACCGC
ATCGCCGGCG ACGTCGATCG CGTCAACGAT CTGCTCAAGC AGATCGACAG CCTGAACGTC
GACATCACCC GCGCCAAGAT GGCGGGCGCC GACGGCACCG GTTCGGAGAA CGTCCAGAGC
GGCCTGATCG ACGAGCTGTC GACCTATATG AACATCCAGG TCAGCGACCG CGCCAACGGC
GGCGTCATGG TGCGCTCGGC CGAGGGCGTC ACCCTGGCCG GCAACGGTCC CGCCACCGTC
ACCTACAACC AGTCGGGTCC GGCCAACGGC TATCTGACGG TCACCACCGC CAACAGCGGC
GGCCAGGTCA TGCCGCTGGC CGTCACCAAC GGTGAGATCC AGGGCCTGCT GCAGCTGCGC
AACAAGGATC TCCCGAACCT CTCCGATCAG CTGGGCGAGT TCGTCAGCCG CGCCGCCGAG
GAACTGAACC GCGCCTCCAA CGCCGCCAGC TCGGTCCCCG CGCCGAACGT CATGACCGGC
AAGGACACCG GCCTGGACGC CAACACGGCG TTCGCCAACT TCACCGGCAA GACCACCATC
GCCGTCACCA ACGCCGCCGG CGCCGTCGTG CAGCGGGTCG ATGTCGATTT CGGCGCCGGG
ACCATGACCG TCAACGGCGC GCCCGGCCCG GCCTTCACCA ACACCAACTT CCTCACCCAG
CTGAACACCG CCCTGGGCGG CGCGGCCACG GCCAGCTTCG CCAACGGCGC GATGAGCCTG
AACACCGCGA CCGCCACCAA CGGGGTCGCC ATCGTCGACG ACGCCACCAC GCCCTCGACC
AAGGCCGGCA AGGGCTTCAG CCAGTTCTTC GGCCTCAACG ACATCATCCG CTCGGACGGC
TACTCGCCCT ACGAGACCGG GATGACCGCC ACCGACCCCA GCGGCTTCAC CCCCGGCCAG
ACCATCACCC TGCGCTTGAC CGACACGGAC GGCAGCCGCA TCCGCGACAT CGCGGTGGCC
ATCCCGACCG GGACCGGCTC GATGCAGGAA ACCATCGACG CCCTGAACTC GCGCAATTCG
GGCGTCGGCC TCTACGGGTC GTTCGCCCTG GGCTCCAAGG GCGAGCTGAC CTTCACCCCC
AGCGGCGCGG CGCCCGTCAC CCTGTCGGTG GTCGAGGACA CCACCCAGCG CGGAACCTCG
GGTCCGTCGC TCAGCCAGCT GTTCGGCGTG GGCATGAGCG AGCGCAGCAC GCGCGGCGGC
CTGTTCCATG TCGATCCGGC CATGGAAGCC GATCCCTCCA AGTTGCCGTT CGCCAAGCTC
GACCTGACCG CGGCGGCCGG CACGCCCGCC CTGGCGACCG GCGACGGCCG CGGCGCCCTG
GCCCTGGCCA AGGCCGGCGA CGTCACGACC AGCTTCGCCA ACGCCGGCGA CGCCGCCGCC
GCCAAGAAGA GCGTGTTGAG CTACGGCGCC GACTTCAGCG GCTCGATCGC CCGCAAGGCC
GCCTCGGCCA CCAGCCGCAA GGAAGCCGCC GACTCCGTCC AGACCGAGGT CAACGCCCAG
CGCCAGTCGC AGGAGGGGGT CAACCTCGAC GAGGAACTGG TCAACCTGAC CACCTATCAG
CAGGCGTTCA ACGCCTCCGC CCGCCTGATC CAGGCGACGA AGGACATGTT CGATGTCCTC
ACCAATATCG TTTGA
 
Protein sequence
MSLNTILNVA TSGMMAAQTG LRVVSDNIAN INTAGYVRKT IAQSNLVSNG MGVGVNIDAI 
KRATDKYMAA ASLNGASEAG RSGSIATAMD NAQKLFGDPS AKTNYFATLD DVYAAFSLAV
DDPSSTLLRS GAVTKVEDFL SESKRITASL SGQIKDTDNR IAGDVDRVND LLKQIDSLNV
DITRAKMAGA DGTGSENVQS GLIDELSTYM NIQVSDRANG GVMVRSAEGV TLAGNGPATV
TYNQSGPANG YLTVTTANSG GQVMPLAVTN GEIQGLLQLR NKDLPNLSDQ LGEFVSRAAE
ELNRASNAAS SVPAPNVMTG KDTGLDANTA FANFTGKTTI AVTNAAGAVV QRVDVDFGAG
TMTVNGAPGP AFTNTNFLTQ LNTALGGAAT ASFANGAMSL NTATATNGVA IVDDATTPST
KAGKGFSQFF GLNDIIRSDG YSPYETGMTA TDPSGFTPGQ TITLRLTDTD GSRIRDIAVA
IPTGTGSMQE TIDALNSRNS GVGLYGSFAL GSKGELTFTP SGAAPVTLSV VEDTTQRGTS
GPSLSQLFGV GMSERSTRGG LFHVDPAMEA DPSKLPFAKL DLTAAAGTPA LATGDGRGAL
ALAKAGDVTT SFANAGDAAA AKKSVLSYGA DFSGSIARKA ASATSRKEAA DSVQTEVNAQ
RQSQEGVNLD EELVNLTTYQ QAFNASARLI QATKDMFDVL TNIV