Gene Caul_4091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4091 
Symbol 
ID5901553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4440617 
End bp4443376 
Gene Length2760 bp 
Protein Length919 aa 
Translation table11 
GC content66% 
IMG OID641564611 
Productouter membrane insertion C-terminal signal 
Protein accessionYP_001685713 
Protein GI167648050 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.798393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC AATTCATGCC GCGCCGGGCG CGATTGATGA GCGGCGCGGC CACCGGCCTG 
GCGCTGCTGC TGACGACCGC GGGCCTCGCC CACGCTCAGG ACAAGGCCGC TCCCGCCAAC
GACACCGTCG AGGAAATCAC GGTCACCGGC ATCCGCGCGG GCATCGAGAA CGCCATCGCG
CTGAAGAAGT CCTCCAGCTC GATCGTCGAG GCCGTGTCGG CCGAAGACAT CGGCAAGCTG
CCCGACACCT CGATCGCCGA GTCCCTGGCC CGCCTGCCCG GCCTGACCGC CCAGCGCCTG
GACGGTCGCG CCCAGTCGAT CTCGATCCGC GGCCTGGGCC CCGACTACAA CACCGTGCTG
CTGAACGGCC GCGAGCAGGT CTCGACCGGC GACAACCGCG GCGTCGAGTT CGACCAGTAT
CCTTCGGAAA TCCTCAGCGG CGTGCTGGTC TACAAGACCC CGGACGCGGC CCTGATCGGC
CAAGGCCTGG CCGGCACCGC CGACCTGCAG ACCATCCGTC CGCTGAAGTA CGGCAAGAAG
GTGCTGTCGG CCAACGCCCG CTACGAGTTC AACAGCGAAG ACAAGCTGAA CCCGGACGCC
AAGGACAAGG GCCACCGCTT CAGCGCCACC TATGTCGACC AGTTCATGAA CGACACGCTG
GGCGTCGCCA TCGCGGTCTC CGACATCTCG ACCCCGACCC AGAGCCGCCG CTTCAACGCC
TGGGGCTATC CGACCACCGG CACGGGCGAC CTGGTCATCG GCGGCGCCAA GCCTTACGTC
CAGTCCAACA ACCTCAAGCG CACCGGCGTG ATCGGCGTGC TGGAATACAG CCCGACCGAC
AAGTTCCACA CCTCGCTCGA CCTCTACTAT TCGAAGTTCC GCGAGAAGCA GATCCTGCGC
GGCATCGAGC TGCCGCTGTT CTGGAGCTCG GCCACGCTGC AGCCCGGCTC AACCGCCAAC
GGCGGCCTGA TCACCAACGG CGTCTATTCC GGCGTGAAGG GGGTCATGCG CAACGACCTC
AACACCCGCC ACACCACCCT GAAGTCAGCC GGCTGGAACA TCGCCTACGA CACCGACAAC
GGCTGGACGC TGGCGGCCGA CCTCAGCCAC TCCGAAGCCA AGCGCACCGA CGTGATCTTC
GAGTCCTACG CCGGCACGGG TCCGTCCGGC GTCGGCGCCA CCGACACCAT GACCTTCCAC
ACGACGCCCG GCGCGGGCAC CACCTTCGGC TCGACGCTCG ACTACACCAA TTCGACCCTG
TTCATGCTGA CCGACCCGCA GGGCTGGGGC GCGGGCGCGG CCGGCGGGGC CCTGACCCAG
GCCGGCTTCT ACAACACCCC CTCGATCAAG GACGAACTGA ACGCCGTCCG TCTTAGCGCC
AAGCGCGACC TGGCCTGGGG TCCGATCAAC AAGGTCGAGT TCGGCTACAA CGCCAGCCGC
CGCGAGAAGT CCAAGCAGGT GCACGAAAGC TTCCTGACCT TCGGCGGCCG CATCGCCGAC
GGCGCGCCTC AGAGCCGCGC CATCCCGAAG GAAGCCCTGC TGGGCACGGT CAGCCTCGAG
CTGATCGGCA TCAAGGCCAT GCTGGCCTAT GACCCGACCT ACCTGCTCGA CAACGGCTAC
TACACGCTGA TCGCCGACCA GAACCCGGCC GTTCAGACCC GCAACTGGGC GGTGAAGGAA
GACGTCCAGA TCGCCTACGC CAAGTTCAAC ATCGACAGCA CGGTCGGTTC GATCCCCGTC
ACCGGCAACA CCGGCCTGCA GGTTGTCCAT ACCGACCAGT CCTCGACCGG CACGCGCATC
AACCCCGCCG ACACCGCCCA CCCGGCCAAC AACGACGGCG GCGCCAAGTA CACCTATGTG
CTGCCCAGCC TGAACCTAAC GTTCGACCTG AGCAACGAGA CCTTCCTGCG GTTCGGCGCC
GCCCGCACCC TGGCGCGGGC CCGGATGGAC GAACTGCGCG CCAGCCAGTC GTTCAACATG
AACGCCGGCA ACCTCACCTC GACCGACCCG AACAACGCGT ACTTCAGCAC CGACGGCGGC
AATCCGCAGT TGCGCCCCTA CATCGCCGAC GGCGTCGACG TCTCGCTCGA GAAGTACTTC
GGCCGCTCGG CCTATATCTC GGCGGCCGGC TACTACAAGA AGATGTCGAA CTTCGTGAAC
TCCAGCTCTT CGCACCTGGA GGACTTCTCG GCCTTCAAGC CGCTGCTCAG CCCGGCGCAA
CAGGCGGCCC TAGGCACGAC CCAAGGCGTG GCCAAGGGTC CCGAGAACGG CAAGGGCGGC
TACATCCGCG GGATCGAACT GTCGGCCTCG ATCCAGGGCG ACATCTTCTA CGAGCCCCTG
CGCAACTTCG GCCTGATCAT CAGCGGCTCG TACACCGACA GCTCGGTCAA GCTGGACGAC
AACTTGCCGG CGATCGACAT GCCCGGCCTG TCGAAGAAGG TGATCAACAC CACCTTCTAC
TACGAGAACA ACGGCTTCAA CGCCCGGATC AGCAACCGCT ATCGCAGCAA ATTCCTGGGC
GAAGTGGCCG GCCTCAGCGC CGCGCGGATC TATCGGACCG TCGACACCGA GTCGGTGCTC
GACGCCCAGA TCGGCTACGA GTTCCGCCAG GGACCGCTCG AGGGCCTGTC GATCCTGCTG
CAGGCCAACA ACATCACCGA CGAGCCGTTC AAGACCTACG AGAACGGCGA TCCCCGCCGG
ACCATCGACT ACCAGAAGTA CGGCTCCACC TACATGGTCG GGGCGTCCTA CCGGTTCTAG
 
Protein sequence
MTNQFMPRRA RLMSGAATGL ALLLTTAGLA HAQDKAAPAN DTVEEITVTG IRAGIENAIA 
LKKSSSSIVE AVSAEDIGKL PDTSIAESLA RLPGLTAQRL DGRAQSISIR GLGPDYNTVL
LNGREQVSTG DNRGVEFDQY PSEILSGVLV YKTPDAALIG QGLAGTADLQ TIRPLKYGKK
VLSANARYEF NSEDKLNPDA KDKGHRFSAT YVDQFMNDTL GVAIAVSDIS TPTQSRRFNA
WGYPTTGTGD LVIGGAKPYV QSNNLKRTGV IGVLEYSPTD KFHTSLDLYY SKFREKQILR
GIELPLFWSS ATLQPGSTAN GGLITNGVYS GVKGVMRNDL NTRHTTLKSA GWNIAYDTDN
GWTLAADLSH SEAKRTDVIF ESYAGTGPSG VGATDTMTFH TTPGAGTTFG STLDYTNSTL
FMLTDPQGWG AGAAGGALTQ AGFYNTPSIK DELNAVRLSA KRDLAWGPIN KVEFGYNASR
REKSKQVHES FLTFGGRIAD GAPQSRAIPK EALLGTVSLE LIGIKAMLAY DPTYLLDNGY
YTLIADQNPA VQTRNWAVKE DVQIAYAKFN IDSTVGSIPV TGNTGLQVVH TDQSSTGTRI
NPADTAHPAN NDGGAKYTYV LPSLNLTFDL SNETFLRFGA ARTLARARMD ELRASQSFNM
NAGNLTSTDP NNAYFSTDGG NPQLRPYIAD GVDVSLEKYF GRSAYISAAG YYKKMSNFVN
SSSSHLEDFS AFKPLLSPAQ QAALGTTQGV AKGPENGKGG YIRGIELSAS IQGDIFYEPL
RNFGLIISGS YTDSSVKLDD NLPAIDMPGL SKKVINTTFY YENNGFNARI SNRYRSKFLG
EVAGLSAARI YRTVDTESVL DAQIGYEFRQ GPLEGLSILL QANNITDEPF KTYENGDPRR
TIDYQKYGST YMVGASYRF