Gene Caul_2797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2797 
Symbol 
ID5900252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3035566 
End bp3037923 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content64% 
IMG OID641563289 
Productouter membrane protein assembly complex, YaeT protein 
Protein accessionYP_001684422 
Protein GI167646759 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0843475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0789319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGTC CCATGAACAA AATTCGCGCC CATAGCGCCG CGTTCGCCAC CGGCGTTGCG 
CTCCTTCTCG GTTCCACGGC GCTGACCGCG CCTCAGGCCG CCTTCGCCCA GGCCCAGTCG
GGCGTGGTCC AGCGCATCGT GGTGCAGGGC AACGAGCGGA TCGAACAGGG GACCGTGCTG
TCCTACCTGC CGATCCAGCC CGGCGAGAGC GTCGATCCGC AGCGCCTCGA CCTGGCGCTG
AAGACCCTGG CCCGCACCGA CCTGTTCGCC GACGTGAAGA TTGAGTTGGT GGGCGGCGAC
CTGATCGTCA AGGTCGTCGA GAACCCGATC ATCAACCAGG TGGTTTTCGA GGGGAACTCG
GCGCTCAAGG AAGACAAGCT GAAGGACGAG GTCCAGATCC GTCCGCGCGG CATCTTCACG
CGCTCCAAGG TCCAGGCCGA CGTTCAGCGC ATCATAGAGC TCTATCGTCG CTCGGGCCGC
ATCTCGGCGA CCGTCACGCC CAAGGTGGTC GAGCTGCCGC AAAAGCGCGT CGACCTGGTG
TTCGAGATCA ATGAAGGCCC CAAGAGCGGC GTGCTGGGCG TCAACTTCCT GGGCAACACC
GAGTACTCGG ACAACGACCT GAAGGACGTC ATCGTCACCA AGGAGTCGCA CTGGTACAAG
TTTCTGACCA GCAACGACAA TTACGACCCC GACCGTATCG AGTACGACCG CGAGCAGCTG
CGCAAGTTCT ACCGCAACCG CGGCTATTTC GACTTTCGCG TCGTGGCCTC GGTCGCCGAA
CTGGCCACCG ACAAGAACGG CTTCGCGGTG ACCTACACGC TTGACGAAGG TCCCAAGTAC
AAGTTCGGCA AGATCACCGT CGAGACCGAG CTGAAGAAGC TGGATGGCAA CCTGCTGGCC
CAGATCCTGC CGGTCCGCAC CGGCCAGCTC TATGAAGACG AGAAGATCGA GCAGGCCACC
GACGCCCTGA CCTTCGCGGC GGGCGCCGCC GGCTTCGCCT TCGTGGACGT GCGTCCGCGC
TATGTGCCCA ACCACGAGAC CAACACCGTG GACGTGGTGT TCTCGGTCCG CGAAGGCCCG
CGCGTTTACG TCGACCGCAT CGACATCGTC GGCAACACCC GCACGCTCGA CTATGTCGTC
CGTCGCGAAC TGGAAGTGGC CGAGGGCGAC GCCTACAACC GCGTGCTGGT CGATCGCTCG
AAGAACAACA TCCGCGCCCT GGGCTTCTTC AAGGACGTCA ATATCGAGGA AGTGCCCGGC
GCTCAGCCGG ACCGCACCGC CCTGCGGGTC AAGACCGAAG AGCAGCCGAC CGGCGAATTG
TCGTTCAGCG CCGGCTACAG CTCGGTCGAC AAGCTGGTGC TCGACGTCGG CATCACCGAA
CGCAACTTCC GTGGCCGGGG CCAGAACCTG CGGGCCCGCG TCTCGGTCGG TTCGCTGCGT
CAGCAGATCG ACTTCGGCTT CACCGAGCCG CGCTTCCTGG GCCGCGACCT GCGCGCGGGC
CTCGACCTCT ATTCCTACCG CTACGACCTG AGCGACTATG CGTCCTACGA CACCCAGTCG
ACGGGCGGCA CGCTGCGGCT GGGCTTCCCG CTGACCCAGA ACGCCTCGAT GGGCCTGCGC
TACCAACTCC GCCAGGACAA GGTCAGCGTC GCTGACAGTC TGTGCACGAG CGGCTCGGTG
TCTCAAATCC TCTGCCTGCA GCGCGGCGCC TACATGACCT CGCTAATCGG CTACAACCTG
CGCATCGACA AGCGCAACGA TCCGATCCAG CCGACGCGCG GCTGGTTCGC CGACCTGAGC
CAGGATCTGG CCGGCTTCGG CGGTGACGTG AAGTACCTGA AGACCGATAC CGACATCGGT
TGGTATTGGG GCTTCAACAA GGACTTCATC TTCAGCGCCA CCGGTTCGGC GGGCTATATC
GAAGGCTGGG GCGGCGACAA CATCCGCATC AACGACCGCT ACTATAAGGG CGGGTCCTCG
TTCCGCGGCT TCGAGATCGC CGGTATCGGC CCGCGCGACA CCACGACCCA GACCAACGCC
CTGGGCGCCA AGCTCTACGC CATCGGCACC TTCGAACTGA CGGTCCCGAC CCTCCTGCCC
GAGCAGTACG GCATCAAGGC CGCGGTGTTC ACCGACTTTG GTACGGCGGG TCAGCTCGAC
GACATCGATC GTTTGGGCGC CGACGGCAAA CCCAACCCGC TCATCAAGGA CGACCTGGGT
CTACGGGCCT CGGCGGGGTT GAGCATCGAC TGGAAATCGC CCATGGGCCC CATCCGGTTC
GATTTCAGCC ACATTCTCGC TAAAGACAGC TATGACAGAA CCGAAACCTT CCGGTTCTCC
ACCTCCACAA GGTTTTAA
 
Protein sequence
MIGPMNKIRA HSAAFATGVA LLLGSTALTA PQAAFAQAQS GVVQRIVVQG NERIEQGTVL 
SYLPIQPGES VDPQRLDLAL KTLARTDLFA DVKIELVGGD LIVKVVENPI INQVVFEGNS
ALKEDKLKDE VQIRPRGIFT RSKVQADVQR IIELYRRSGR ISATVTPKVV ELPQKRVDLV
FEINEGPKSG VLGVNFLGNT EYSDNDLKDV IVTKESHWYK FLTSNDNYDP DRIEYDREQL
RKFYRNRGYF DFRVVASVAE LATDKNGFAV TYTLDEGPKY KFGKITVETE LKKLDGNLLA
QILPVRTGQL YEDEKIEQAT DALTFAAGAA GFAFVDVRPR YVPNHETNTV DVVFSVREGP
RVYVDRIDIV GNTRTLDYVV RRELEVAEGD AYNRVLVDRS KNNIRALGFF KDVNIEEVPG
AQPDRTALRV KTEEQPTGEL SFSAGYSSVD KLVLDVGITE RNFRGRGQNL RARVSVGSLR
QQIDFGFTEP RFLGRDLRAG LDLYSYRYDL SDYASYDTQS TGGTLRLGFP LTQNASMGLR
YQLRQDKVSV ADSLCTSGSV SQILCLQRGA YMTSLIGYNL RIDKRNDPIQ PTRGWFADLS
QDLAGFGGDV KYLKTDTDIG WYWGFNKDFI FSATGSAGYI EGWGGDNIRI NDRYYKGGSS
FRGFEIAGIG PRDTTTQTNA LGAKLYAIGT FELTVPTLLP EQYGIKAAVF TDFGTAGQLD
DIDRLGADGK PNPLIKDDLG LRASAGLSID WKSPMGPIRF DFSHILAKDS YDRTETFRFS
TSTRF