Gene Caul_2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2123 
Symbol 
ID5899578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2287471 
End bp2290248 
Gene Length2778 bp 
Protein Length925 aa 
Translation table11 
GC content64% 
IMG OID641562612 
ProductTonB-dependent receptor 
Protein accessionYP_001683749 
Protein GI167646086 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.154655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAC AGAACACGAA CGGCCGCCGC GCGCGCCGCA TGGCCTGGCT GATGACCGGG 
TGCGCCGCCA TCGGGCTCTC CGCCGCGACC GGCGCTCAAG CCCAGACGTC GCGGGCCGAG
CCGAACGATA GCGTGGAGGA AGTCGTCGTC ACGGGCAGCT ACCGCCGCAG CCTTGAGAAG
GCCGTGGATA TCAAGCGCGA CACTGTCGGC TTTTCGGACT CGATCGTGGC GACCGACGTC
GCCAACTTCC CGGATCAAAA CCTGGCCGAA GCGCTGCAGC GCATTCCCGG CGTGACGATC
GAGCGCAACA AGGGCCTGGG CGGCCGTGTC AGCGTTCGCG GCCTGCCCAG CGAGTTCACC
TTCGTCACCA TCAACAATCT GGCGACCGCC TCGGGCAGCG GCGGTCGTGA CGTCGAGTTC
GACATCTTCG CCTCGGAAAT CATCCAGCAG GTCACGGTCC AGAAGTCGCC CCGGGCGGCG
GACGAAGAAG GCGGTATCGC CGGCGCGATC AACATCTCGA CAACCCGTCC GTTCGACTAC
AGTGGCCGCA AGCTGATCGC CTCGACCGAG GGGGCCTATA ACTCGATCTC CAAGAAGACA
GACCCCAAGG TCTCGTTTCT GGCCAGCGAC ACGTGGGGGG ACTGGGGCGG CCTGGTGTCG
TTCTCGGCGG CGCGCCGCAC GAACCGCACC GACTCCAACT CGGGCATCAA CTTCCGCCCG
ATGTTCCGCT TCCTCGAAGC GGGCGGTGCG CGCGCCTCGC AGGCGGCCGC CGTCCTGGCC
CGCGACGCCG GCGTGATCGT CAAGAGCAAC ACCGACCGCA ACGAGACTGG CCGCATCATC
TTCCAGGACA AGGTCGGCGA CCGCGCCTAC CTGAACACCC AGGACCAGTG GGGCGGCACC
GCTTCGCTGC AGTACAAGCC CTCGGCCAAT TTCGACATCG CCTTCGACCT GATGCTGGGC
GGCTATGACG CGACCGAGGA CCAGTACGAC GCGGCCGCTT ATTCGGCCTC AAGCAAGAGC
ACGTTGGAGA CCATCCACAG CTACGACAAG ACCACCCTGG CCGACTACAA CATGGTCGTG
CTGCGCGACG TCTCCTACAC CGCGACCCAG CACGAGATGC TCAGCAAGGA GCAGATCAAC
AAGACCGACT ACGCCCAGTT CGGCTCGGAC CTGAACTGGC GCGGCGAGAC CTGGAAGCTG
CACGCCCTGG CCGGCTATTC GGGCGCCAAG AAGACGCTCG ACTATTCGAA CCTGAAGCAC
GTGGCCTACG CCCCGTCGCG CACCCGCTGG ACGGCCACCG GCGGCGAGAC GATCAAGAGC
GCCAACCCGG CCTCGATCGA CATGTACAAC TCGCCTTCGA AATATCTGTT CGAGGCCTAT
GAGACGACCC TCGAGAAGAT CACCGATGAC AAGTACGCGG CTCAGGTAGA CTTCACCAAG
GACTTCGCCT TCGACTTCTT TCCCGCGCTC AAGACCATCC AGATCGGCGC TCGCCACACC
GACAAGTCGA AGGAGCGCCA GTACGGCGCC CTGAACATCC AGGGGCCGGG TCCGGGCAGC
ACCGCCTATC TCAACACCCG CACCATGGCC GACAGCCCGC TGACCCCGAT CGGCGATCTG
GTGCCGGGCG GCGACTACAC GGTCCGCGAT ATCACCTGGA GCCAGATCTC GAACGATTAC
GCGCGCAAGA CCTTCCGCTA CGCCGGCTTC ACCACGCCGT TCACGCCGGG CGACTACTAC
AAGGTCGATG AGAAGGTCAC GGGCCTGTAC GCCATGGCCG ACCTGGGCTT CGACGTCGGT
CCCGTGCCGG TGGCGGTGAA CGGCGGCGTT CGCTACGTCG ACACCTCGAT CACCTCGTCG
GGCTATCATC AGATCCAGAA GCCGAATGGC TCGACGGGCT ACACCCAGGC GCCGGTGTCA
AGCGACGGCA GTTATAACAA GTTGCTGCCC AGCCTCAACG TCACCGCCGA GCTGACTGAT
AGCATCGTGC TGCGCGCCGC GGCGTCCAAG ACCCTGATGC GTCCGGCCCT GACGGACCTG
GCCTACAAGC GTACGGCCAG CTTCAACTCG TTCCGCTTCA CCGACGGCAA CCCGAACCTC
AAGCCGACCT TCGCCGAGCA GTATGAAGTC GGCCTTGAGA AGTACCTGCC GGAAGGCGGC
CTGCTGGCCG TCTCGTACTT CAAGAAGAAG ATCGAGGGCG TCGTCCGCCA GGCCCTGACG
GGCACGGTCA AGGGCGTCAC CAAGTACAAC GCCAACGGCA CGATCGACGG CGTCTACGAC
TTCGACGTCT ACCAACCGAT CAACGCCGCG GGTTCGTACA ATGTCGACGG CGTCGAGCTG
GTCGCCATAG TGCCGTTCGG CCTGCTGTGG GAGCCGGCCA AGGGCTTTGG CGTCAACGCC
AACTACACGA TCCTGGACAG CTCGCTGAGC GGCCAATCGA TCATCGGCGT CCCGACCCCG
CCGGTGGGCC TGGCCGACAA GGCTTACAAC TTCACGCTCT ACTACGAGAA CGACAAGTTC
CAGGCCCGCG TGTCCTATAG CTACAAGGGC AAGTATGTCG AAGGTATCGG CTACGAGATG
TATCCGATCT GGCGCTCGGG CTTCGGCCAG ACCGACATCT CGGTCAGCTA TAACATCAAC
GAGCGCCTTC AGTTGAGCCT GGAAGGGATC AACGTCACCG ACGAGGTCAC CAAGGGCTAC
ACGATGGATC CGTCGTTCCC GACCATGTAC GAGAAGTCCG GACGGCGCTT CTCGCTTGGC
CTACGGATGA ACTTCTGA
 
Protein sequence
MTAQNTNGRR ARRMAWLMTG CAAIGLSAAT GAQAQTSRAE PNDSVEEVVV TGSYRRSLEK 
AVDIKRDTVG FSDSIVATDV ANFPDQNLAE ALQRIPGVTI ERNKGLGGRV SVRGLPSEFT
FVTINNLATA SGSGGRDVEF DIFASEIIQQ VTVQKSPRAA DEEGGIAGAI NISTTRPFDY
SGRKLIASTE GAYNSISKKT DPKVSFLASD TWGDWGGLVS FSAARRTNRT DSNSGINFRP
MFRFLEAGGA RASQAAAVLA RDAGVIVKSN TDRNETGRII FQDKVGDRAY LNTQDQWGGT
ASLQYKPSAN FDIAFDLMLG GYDATEDQYD AAAYSASSKS TLETIHSYDK TTLADYNMVV
LRDVSYTATQ HEMLSKEQIN KTDYAQFGSD LNWRGETWKL HALAGYSGAK KTLDYSNLKH
VAYAPSRTRW TATGGETIKS ANPASIDMYN SPSKYLFEAY ETTLEKITDD KYAAQVDFTK
DFAFDFFPAL KTIQIGARHT DKSKERQYGA LNIQGPGPGS TAYLNTRTMA DSPLTPIGDL
VPGGDYTVRD ITWSQISNDY ARKTFRYAGF TTPFTPGDYY KVDEKVTGLY AMADLGFDVG
PVPVAVNGGV RYVDTSITSS GYHQIQKPNG STGYTQAPVS SDGSYNKLLP SLNVTAELTD
SIVLRAAASK TLMRPALTDL AYKRTASFNS FRFTDGNPNL KPTFAEQYEV GLEKYLPEGG
LLAVSYFKKK IEGVVRQALT GTVKGVTKYN ANGTIDGVYD FDVYQPINAA GSYNVDGVEL
VAIVPFGLLW EPAKGFGVNA NYTILDSSLS GQSIIGVPTP PVGLADKAYN FTLYYENDKF
QARVSYSYKG KYVEGIGYEM YPIWRSGFGQ TDISVSYNIN ERLQLSLEGI NVTDEVTKGY
TMDPSFPTMY EKSGRRFSLG LRMNF