Gene Caul_0591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0591 
Symbol 
ID5898046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp650054 
End bp653152 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content63% 
IMG OID641561073 
ProductTonB-dependent receptor 
Protein accessionYP_001682222 
Protein GI167644559 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.880997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAAT ATTGGCTGTG CAGCGCGGCT GCAGTCTTCA TCGCGGGGGC GGCCATGGCA 
CAGACCGCGC CGATCGATTC GGTCGAGGAA GTCGTGGTGA CCGGTTCGCG GATCGCGCGC
GCCGGTTTTG ACACGCTTGA GCCAGCAACT ACCCTTTCCG TCGAGCAGCT GCAGAACCGC
AATGCGACGA ATGTCGGCGA AGTGCTTCTG CAGGTGCCTG GCTTTTCGAT GGGTCAGACG
ACGACAGGCG CGCAGTCGGA ATTTTCGCCC GGCGCGTCCT ATGTCGATCG GTTCGGGCTC
GGGTCTTCGC GCACCCTCAC GCTGGTGAAC GGTCGACGTT TCGTCTCGAC CAATCCGCCG
TCGACCCAAA ACGGTCAGAC CCCCGGCAAT CAGGTCGACC TCAACGTCAT CTCGCCGCTG
ATGGTCGAGC GGATCGAGAA CCTGGCGATC GGCGGCGCGC CAACCTATGG CACTGACGCC
ATCGCCGGCG TGGTGAACAT CATCCTGCGC AAAAAGTACG ACGGCGCCAT GGCCAACGTT
CAGGCCGGCG TCACCGAGCT GGGCGACAAC GAGCGAATTG CGTTTGCGGG CCTAATCGGC
CGCAACTTCG CCGACGGCCG CGGCAACATC ATGCTGGCGG CATCCTATGA CAAGTCCGAT
CCCGCCGACT ATCGCAAATG GCAGAGCCCT CAGAACTTCT TCGTCGGTAA CCCCCTGGCG
ACGTCGGCGG CGGCCAACAT TCCCGGTCGT ACGCCCGGGA ACGATGGCCG GATCAACCCG
AACGTGCCCT TCAACACCGG ACCGGCCGAC GGGATTCCCA ATTCCGTCCT GATCCACGAC
TACACCATTC CCTCCGTGAG CCGCGGCGGG ATCATCCTGC CGGTGGGCAC GATCCAGCAG
GCGAATTTCT ATCCGACGGG GTTTGGCGCC AACGGTCAGA CACTCATCCA GTTTGGCCCG
AACGGCGATA TCGTACCCTT CAATCCCGGA TCGCCCTTCG CGCCGACCTT TGCCTCCGGG
GGCGACGGCT ATCGTAACGC GATCGCCAAC GTGGTGGCCG GCACGAAGCG GAAGACCGTC
AACCTCAACG CGTCCTACGA TGTCACCGAT AAGATCTCGG CCTTCTTCGA GGGGAGCTAC
TTCAACGGCG GCGGCACGCT CAGCAACGCG ACGCCGAAAT ATTTCTCGCG CATCTTCGGC
GGCTCGCAAG TCGTCATGGG CCCGTTGTTG GCCTCGATCA ATGACCCGCG TCTGACGCCC
CAAGCCAAGG CGACGCTGCA GTCTCTGGGC GTCCAGAATT TCGAGATCTC GAAGGTCACG
CAGGAAATCG GCCAAGCGTA TCCGTCGACC GACAACTCGG TCTATCGCGG CGTCGTCGGC
CTGAGCGGCC AGTTTGCCGC GGTGGGCCGG ACCTTCCGGT TTGACGCGTC GCTAAATCGC
GGCCGGACCG AGGGCCACGC GTTCAAGACG AGCGTGATCC AGCAGAACTT CGTCAACGCG
ATGAACGTCA AGCTCGACGC GTCCGGCAAG ATCGTCTGCG ATCCCAATCC CACGCAACTG
GCCACGGGCG GCGCCGTCAA ACCTATCGCC GATGCGGCCT GCGTGCCTCT AAACCTGTTC
GGCGCCAATC AGGTGACCGA CGCCGCCCGC GCCTACGTCG AAGCCCGCGC CGAGGCCAAG
TCCCAGCTTG ACCAAACCGA CGTCCTGGTC AACTTCGGCT CGTCGAATCT GTTCTCGCTC
TGGGGGGCTG AGCCGGTCGG CTTCAGTGTC GGCCTGGAGT ACCGCAAGGA GTCTGGCGAG
TTTAATCCGG ATCCGCTGCA GGCCTCCGGG CGGACGCAGG AGGCGCTCGT CAGGCCGGTC
GCAGGCGAAT ACACGACCAA GGAGGTCTTC GGCGAAGTGC TTGTGCCCCT GGCCTCGCCT
GGGCAGTCCA TTCCGCTGAT CGACACCCTC GAGTTCGAAG GCCGCATACG CTACGTCGAC
AACTCGCTCA CCAACGGCTT CACCGCCTAC ACCTACGGCG GGCGTTATCG GCCGGTTCCA
GACATCGAAT TGCGGGGCAA CTTCACCAAG TCGCTGCGCG CGCCGTCCAT CGCCGAACTG
TTCACGCCGG CCTCGGTGGG CTCAGGCTTC TTCCCCGATC CCTGCGACGT CCGCAACATC
ACCTCTGGTC CAAACCCGAC GGTCCGCCAG AAGAATTGCG CGGTGTTCTT CAAGGCCTAC
GGGATCACCG ATCCGACGAG CTTTTTCTCA ACGACCGTCG GTGTCGCCAT CCCGATCCAG
CTTGGCGGCA ATCCCAAGCT CCAGAATGAG ACCGCCAAGT CCTACACCTA CGGCGTCGTG
CTGCGCCCGC GCTTCCTGCC GAAGTTCCAG GCGGCCATCG ACTGGAATCG TATCCTGGTG
AACGGCAATA TCACCGCCCT GACCTCGCTG GACATCGCTC AAGGGTGTTA CGACGACCCG
GACTTCAACG CGGCCAATCC GGACGCGGGC AACGCTTTCT GCTCGCTCTT CAGGCGGACG
AAAGGCGGCC CACAGAACGG TCAGCTCGTG GTGGATCCGC AAAATCCCGG GCTCTCGAAC
CAGTTCGTCA ACGGGGCTTC AATTCGCTTC CAGGGGCTGA CGGTCGACGC CGCCTATCGC
GACATTCCGA TCAAGGCGGC GTTCGTGGAG GGCTCGCTTC GGATCGACGC CAGATTCTAC
TACCTCGACA AGCTCTGTAC CTCCAACAAT GGGGTCACCA CCATCTGCCT GCAGGGCACG
CACACGCAAC CGCGCTACAC GGCGCAAGTC GACGCTACCT ATGTTCAAGA GCGCTTCGCT
CTGAACCTCC AGGCTAACTA TCGGCCCTCG ACGCAATACG ACCTGCTTTT CACCGAGGAG
AATCAGGATG TCCTGAAGCG GGGTTCTCAG GTCCTGTTCA ATCTTGGGGC CAGCTACAGG
CTCGGTGAGA ACACTCAGAT CCGGGGCGCC ATTCAGAACC TGCTGGATTC GTCTCCGCCC
GGCCCGATCG CCGGTTTCAA CAACTCCTTC GGCAACACCA CGGCGGTCGG AGACATCCTG
GGCCGGCGGT ACTCCGTGGC CGTGACCCAC ACCTTCTAA
 
Protein sequence
MRKYWLCSAA AVFIAGAAMA QTAPIDSVEE VVVTGSRIAR AGFDTLEPAT TLSVEQLQNR 
NATNVGEVLL QVPGFSMGQT TTGAQSEFSP GASYVDRFGL GSSRTLTLVN GRRFVSTNPP
STQNGQTPGN QVDLNVISPL MVERIENLAI GGAPTYGTDA IAGVVNIILR KKYDGAMANV
QAGVTELGDN ERIAFAGLIG RNFADGRGNI MLAASYDKSD PADYRKWQSP QNFFVGNPLA
TSAAANIPGR TPGNDGRINP NVPFNTGPAD GIPNSVLIHD YTIPSVSRGG IILPVGTIQQ
ANFYPTGFGA NGQTLIQFGP NGDIVPFNPG SPFAPTFASG GDGYRNAIAN VVAGTKRKTV
NLNASYDVTD KISAFFEGSY FNGGGTLSNA TPKYFSRIFG GSQVVMGPLL ASINDPRLTP
QAKATLQSLG VQNFEISKVT QEIGQAYPST DNSVYRGVVG LSGQFAAVGR TFRFDASLNR
GRTEGHAFKT SVIQQNFVNA MNVKLDASGK IVCDPNPTQL ATGGAVKPIA DAACVPLNLF
GANQVTDAAR AYVEARAEAK SQLDQTDVLV NFGSSNLFSL WGAEPVGFSV GLEYRKESGE
FNPDPLQASG RTQEALVRPV AGEYTTKEVF GEVLVPLASP GQSIPLIDTL EFEGRIRYVD
NSLTNGFTAY TYGGRYRPVP DIELRGNFTK SLRAPSIAEL FTPASVGSGF FPDPCDVRNI
TSGPNPTVRQ KNCAVFFKAY GITDPTSFFS TTVGVAIPIQ LGGNPKLQNE TAKSYTYGVV
LRPRFLPKFQ AAIDWNRILV NGNITALTSL DIAQGCYDDP DFNAANPDAG NAFCSLFRRT
KGGPQNGQLV VDPQNPGLSN QFVNGASIRF QGLTVDAAYR DIPIKAAFVE GSLRIDARFY
YLDKLCTSNN GVTTICLQGT HTQPRYTAQV DATYVQERFA LNLQANYRPS TQYDLLFTEE
NQDVLKRGSQ VLFNLGASYR LGENTQIRGA IQNLLDSSPP GPIAGFNNSF GNTTAVGDIL
GRRYSVAVTH TF