Gene Caul_2400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2400 
Symbol 
ID5899855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2609677 
End bp2612691 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content65% 
IMG OID641562891 
ProductTonB-dependent receptor 
Protein accessionYP_001684025 
Protein GI167646362 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.204664 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCAA CGGGCAGTGA GCCTTCGCGC CCCAACGCCA AGGGCGCCGA GGTCTCCATG 
CAGGACGCCG CGGCGGCAAC GAACAGAGGA CGTTTAGTCC GGCAAATGAG ACGCGAGGCT
TACGGCTCGG CCGGAGCGGC TGCTCTGCTC GTCAGCCTGA TCGCCACACC CGGTCTGGCG
CAGACCGTTC CCGCGTCCAA GCCGGAAGCT GTCACGGTCG GCGAGATCGT CGTCACCGGA
ACGCGCATTC AGACCACGGG CTTCACCGCG CCGACCCCGA CCTCCGTGAT CGGCGAATCG
CAGATCCAGA ACAACGCCCA ACCCAACGTC TTCGCCACCA TCGTCCAGTT GCCGTCCTTG
CAAGGCTCAA GCGGATCGGC GACGAACACC TTCAGCACCT CCAGCGGCCA ACAGGGACTC
AGCTCGTTCT CGCTTCGCGG CCTGGGCACG ATCCGCACCC TGACCCTGCT CGACGGCCAG
CGCGTGGTCG GGGCCTATTA CACCGGCGTC ACCGACGTCA GCCTGTTCCC GCAACTTCTG
ATCCAGCGCG TGGACGTGGT GAGCGGCGGC GCCTCGGCCT CCTATGGGTC CGACGCGGTG
GGCGGCGTCG TGAACTTCAT CACCGACACC CGCTTCAATG GGTTCAAGGG CAACGTGCAG
GCCGGTGTCA CGACTTACGG CGACGACGAG CAAGGTCTCG TCCAACTGGC GGCGGGCCGC
AGCTTCTTCA ATGATCGCCT GCACGTGGTC GGCAGCGGCG AATGGGCCAA GGAGGACGGC
GTTGGTCCTG GCGGCTTTGG CCTGGACTTG GCCGGCGATC GCGACTGGTT CACCCAGACG
ACGATGATAA ATCGCAACGT CAACAACGAC GGCGCGCCGC AGTACGTGAT GCGCGACTTC
GCTCAACCCT ACAACTACAC CAAATACGGC CTGATCTCGG CGGGTCCGCT GCAGGGCACA
GCCTTCGACC AGAGCGGCCA ACCGTTCCAA TTCCAGTACG GCTCCAACGG CGTCCCCACC
AAGAACGCCT CGGGCGCCGT CACCGGTTGC TTCCCCGGCT TCTGCGTCGG CGGCGACCTG
TCTGGCAATG TCGACAGCGG TCGCACGCTC CAATCGGCGA TCGAACGCCG AGTGGCTTAT
GGGCGGGTCG GCTATGACTT CGCCGAGAAC AACGAGGCCT ATGTCTCCTT CAACCTGGGA
CAGGTCAAAA CCAGCAATCA GCCCGTAAAT GGGGAAAATC GCCCGGGCCT CACCTTGCAG
TGCGCCAACC CGTACGTGCC CGCCTCGGTG CAGGCCGCCT GCGCGACCGC GGGCGTCACC
AGCTTCCAGT TCGGCACGAG CAACGCGCTG CTGCCCAATA CCGAGGTCCA CACCGACCGG
CGCCAGTATC GCGTGGTCAC CGGGCTGAAG GGGAAGTTCG CCCTCCTCAA CTCCGACTGG
ACCTACGACG CCTATTACGA GCACGGTGTG AACAAGACGG CGATCGACGT CGATCACATC
CTTCTCACCC CGCACTACAA CCAAGCGATC CAGGCGATCA CGCTCAACGG CGTGATCGCT
TGCGCCGATC CCGTGGCGCG GGCCAGCGGC TGCCAGCCGC TCAACATCAT TGGCGGCAAG
CCCCCGTCAG CGGCGGCTCT GGCCTATGTC CAGCCGGAGA ATGGCCCGTT CCAGCGTCTG
CGCATGACCC AGGACGTGGC CAGCCTCGCC TTCTCGGGCG CGCCGCTCAA CCTGTGGGCC
GGTCCGCTGT CGGTGGCGTT CGGGGCCGAA TATCGCCGGG AGTTCTACAC CGTTCGCGCC
GACGCCTACG GCGCTGGCGT GTCGGGCCGC AGCCCCAACA CCGCCGAGTT CCCGGCCGAC
CCTGTGCTCC TGCCGGGCGG CAACAACTGG TACGCGGGCA ACTACAAGAA CGGCAACGGC
GCCTACAACG TCAAGGAAGC CTTCCTCGAG CTTGATCTTC CGCTGTTCGA CTCCGACGCC
CTGGGTCGCG CCAACCTCAA CGGCGCGGCG CGGGTGACCG ATTACAGCAC CTCGGGGACC
ATCTGGACAT GGAAGGCGGG CGGCACCTGG GACACGCCGA TCAAGGGCCT GCGCCTGCGT
GGCGTGACCT CGCGAGACGT GCGCGCGCCC AACCTGTCGG AACTGTTCGC CGCGCCGGTG
ACGACCACGC TGCCCAACTT CCTCGACCCT GTCCGCAACG TGAACGTGGT GGCGATCCAG
AACGCGGTCG GCAACCCCGA CCTGACGCCG GAAATCGCGC GCAACACCTC GTTCGGGGTG
GTCCTGGCCA ACCCGTCGTG GCTGCCCGGC TTCAGCGCCT CGTTCGACTA CTACAAGATC
AAGGTCGATG ACGTGATCTC CAGCCTGGGC GCGGCTCAGA TCGTCGACCT ATGCTACCGC
AACATCCTGC CGGAAACCTG CGGGGCCTAT AACCTCAACA ACACCAGTGG CCCCAACTAC
ATCAACGTCC AGGCGTTCAA CCTGGCTTCG ATCAAGACAA GCGGCTTCGA TATCGAGGCC
AGCTATCGCT GGCGGCAGCC GCTGGGCCTG CCGGGCGCCT TCACCGTGCG CGCGCTGGCG
ACGCATATCC GTGAATTCAT CACCGATACG GGTCTGCCGG GCACGGCTCC CACCGACTCG
GCCGGCGTCA ACACCGGTGC GACGCCGGAC TGGAAGTGGC TGGCGATCCA GACCTATGAG
GGCGACCGGT TCAGCCTGAC GGTGCAGGAA CGTTGGTTCA GCGATGGCAA TTACGGCAAC
CAGTATGTCG TCTGCGCCGC GGGCAGTTGC CCCGTCTCGA CGGCGATCGC GCCCACCATC
GACAGCAACT CCATGCCGGG GGCGTTCTAT CTGGATGTCG GCGGCACCTA TAATATCCGC
AAGGACGTCA CGGCCTATTT CAAGGTCGAC AACGTCTTCG ATCACGACCC CGCCAAGTCG
CCGCAGTACG CCAATCCGGC GCTCTACGAC ATCGTCGGCC GCATCTATCG CGGCGGCGTT
CGCTTCCGCT TCTAG
 
Protein sequence
MFSTGSEPSR PNAKGAEVSM QDAAAATNRG RLVRQMRREA YGSAGAAALL VSLIATPGLA 
QTVPASKPEA VTVGEIVVTG TRIQTTGFTA PTPTSVIGES QIQNNAQPNV FATIVQLPSL
QGSSGSATNT FSTSSGQQGL SSFSLRGLGT IRTLTLLDGQ RVVGAYYTGV TDVSLFPQLL
IQRVDVVSGG ASASYGSDAV GGVVNFITDT RFNGFKGNVQ AGVTTYGDDE QGLVQLAAGR
SFFNDRLHVV GSGEWAKEDG VGPGGFGLDL AGDRDWFTQT TMINRNVNND GAPQYVMRDF
AQPYNYTKYG LISAGPLQGT AFDQSGQPFQ FQYGSNGVPT KNASGAVTGC FPGFCVGGDL
SGNVDSGRTL QSAIERRVAY GRVGYDFAEN NEAYVSFNLG QVKTSNQPVN GENRPGLTLQ
CANPYVPASV QAACATAGVT SFQFGTSNAL LPNTEVHTDR RQYRVVTGLK GKFALLNSDW
TYDAYYEHGV NKTAIDVDHI LLTPHYNQAI QAITLNGVIA CADPVARASG CQPLNIIGGK
PPSAAALAYV QPENGPFQRL RMTQDVASLA FSGAPLNLWA GPLSVAFGAE YRREFYTVRA
DAYGAGVSGR SPNTAEFPAD PVLLPGGNNW YAGNYKNGNG AYNVKEAFLE LDLPLFDSDA
LGRANLNGAA RVTDYSTSGT IWTWKAGGTW DTPIKGLRLR GVTSRDVRAP NLSELFAAPV
TTTLPNFLDP VRNVNVVAIQ NAVGNPDLTP EIARNTSFGV VLANPSWLPG FSASFDYYKI
KVDDVISSLG AAQIVDLCYR NILPETCGAY NLNNTSGPNY INVQAFNLAS IKTSGFDIEA
SYRWRQPLGL PGAFTVRALA THIREFITDT GLPGTAPTDS AGVNTGATPD WKWLAIQTYE
GDRFSLTVQE RWFSDGNYGN QYVVCAAGSC PVSTAIAPTI DSNSMPGAFY LDVGGTYNIR
KDVTAYFKVD NVFDHDPAKS PQYANPALYD IVGRIYRGGV RFRF