Gene Caul_0516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0516 
Symbol 
ID5897971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp562872 
End bp565193 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content63% 
IMG OID641560999 
ProductTonB-dependent receptor 
Protein accessionYP_001682148 
Protein GI167644485 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.756879 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTTC GTAATTGGCT CATGGTCACG GGCGGCGCAT CGGCGCTGCT GGCCTGCGCA 
TCGACGGCCC TCGCCCAGAC TTCGGCGGGC GAGCGGGCTG CGGCGAGCGA CCAGATCGAA
GAGATCGTCG TGACCGCCCA ACGCAGGGCG CAGTCGATCG ATGACGTCGG GATGACCCTG
AATGTGGTGA GCAACCAACA ACTGCAGCGT CAAGGCGTCA CCCAGGCCTC GGATCTCGTC
AAGGTCGTTC CAGGCTTCAC CGTCGCGACC AACAGCGACG GCACACCGAT CTATACCTTG
CGCGGCGTGA GCTTCAATTC GCTCAACATC GGCACCTCGC CCACCGTCAG CGTCTACATC
GACGAGGCCT CGATCCCCTT CTCGATCATG ACCCAGGGCG GCCTGCTGGA TCTGGAACGG
GTCGAGGTCC TGAAGGGGCC CCAGGGCACC CTTTATGGAC AAAACGCCAC GGGCGGCGCG
ATCAACTACA TCACCGCCCA GCCCACCGAA ACGCTCGCTG GCGGCGTCAA GGCGAGCTAC
AGCCGGTTCG ATACCTTCCA GACGGAGGCC TTTGTCAGCG GTCCCGTCAC CGACACGCTG
AACGCCCGCC TGGCGGTGAG CGGCGTGCGG TCAGGGCCCT GGCAGAAAAG CGCGACGCGG
GACGACGAGC TGGGCGATCA GCGCAAGCTG GCCGCGCGCC TGATCCTCGA CTGGCGACCT
GTCGAGGCCC TTCGCTTCAA TCTCAATGTC AATGGATGGG GCGATTGGTC CGACACCCAG
GCCCCGCAAT TTATCCAAGC CAAGCCAAAC GCCCCGGCCT TCGCGTCTCC CCTCCTCAAC
ACCATCGCGC CCAACGACAC CAGTGCTCGG CTAGCTGATT GGGATGCGGA CAAGACGTAT
CGCAAGAACA ACAAATTCTA CCAGGCAGCC TTGCGCGGCG AGCTAAAGAT TTCCGATCAC
GTGCAGCTGA CCTCGCTGAC CGACTATACC TATATCAGCG TGCACTATCG CAACGACAAT
GACGGATCGT CCTATTTCGT TTCGAACATG GAGAACGACG GCTTTGCGCG CGCCTGGAAT
CAGGAACTTC GCCTGTCCGG AGACGTGGTG GACGGTCGCC TCAAGTACAT CGTCGGCGCC
AGCTACCAAC AAGACGACTC CCTGGAGGAC AACACCTTCA CGGGCGCCTT GATTTCGTCG
CAAGAGACGC CATTCGGTCG CTTCGCCGCC GCCCAGGCGC ACGGCCGCCA AGCCAATCGG
GCGCAAGCCG GATTCGCCAA TGTGGATTTC GAGCTCACCA AGCGCCTGAC GCTTTCGGGC
GGCGCCCGCT ACACCGAGGT CAAGCATGAC ACCGAAAGCT GCACACGCGA CGCCGGCGAT
GGCGAGCTGG CGGCGGTCGC AACCCAGCTT AGCGCCTATC TGCGCAGCCT CGCGGGCCTA
CCGCCGGGCG CGACGATCCC TCCGGGCGGC TGCGTGAGCA CCGGCCCGGA CCTTCTCCCC
TATCACCAGG TGGAGTCGTT CAAGGAACAC AACGTCTCTT GGCGTCTGAA CCTGAACTAC
GAACTTAATC CCGACGCCCG CCTCTACGCC ACCGCCTCGC GCGGCTACAA GGCCGGCAAC
TATCAGGTGG CCGTGAACTC CAGCTACGTA TCCTTCCCCC CGGTTCACCA GGAAGAGCTG
ACCGCCTATG AAGTCGGCGC GAAACTGAAG CTTCTGGATC GCCGGCTGGC TCTGAACGCT
GCGGTTTACT ACTACGACTA CCAAGATAAG CAGTTGCTGA CGCTGACGCA GGATCCGGTC
TTTGGCCTGC AGTTCTCGCT GGTCAACGTG CCCAAGTCGT CCGTCAAAGG CTTCGACGCC
GACATCACCT GGCTCCCCGT CCGTGGCCTG ACGATCCGCG CCGCGACGAC CTATGCCGAC
AGCGCGATCG ACAAGTTCCA GGGCTTCGAC GTGTTTGGCG CCCCCGCGGA CCTTTCGGGC
AAGGCGTTCA ACCTCACTCC CAAATGGATC GGCGTCGGCG ACGTCGAATA CCGCCACGAC
CTCAGCGGCG AATATGAAGG CTTCGTCGGC GCCAGCTTGA ACTACAACAG CAAGACCTAC
GCCGATATCG CCGGCAGCGA TGTCCTGGGC ATTGGGGCCT TCACCCTGGT TGACCTTCGC
GCCGGGGTCT CCTCCACGGA CGGACGTCGC GAGGCCATGG TGTTCGTCAA GAACGCCACT
GACAAATATC ACTGGAGCTA CGCCCAGCCC GGCGGCGACA GCGTCCTGCG CTATGCATCC
CAGCCCAGGA CCTTCGGGAT CACGCTCTCC CAGCACTTCT GA
 
Protein sequence
MKLRNWLMVT GGASALLACA STALAQTSAG ERAAASDQIE EIVVTAQRRA QSIDDVGMTL 
NVVSNQQLQR QGVTQASDLV KVVPGFTVAT NSDGTPIYTL RGVSFNSLNI GTSPTVSVYI
DEASIPFSIM TQGGLLDLER VEVLKGPQGT LYGQNATGGA INYITAQPTE TLAGGVKASY
SRFDTFQTEA FVSGPVTDTL NARLAVSGVR SGPWQKSATR DDELGDQRKL AARLILDWRP
VEALRFNLNV NGWGDWSDTQ APQFIQAKPN APAFASPLLN TIAPNDTSAR LADWDADKTY
RKNNKFYQAA LRGELKISDH VQLTSLTDYT YISVHYRNDN DGSSYFVSNM ENDGFARAWN
QELRLSGDVV DGRLKYIVGA SYQQDDSLED NTFTGALISS QETPFGRFAA AQAHGRQANR
AQAGFANVDF ELTKRLTLSG GARYTEVKHD TESCTRDAGD GELAAVATQL SAYLRSLAGL
PPGATIPPGG CVSTGPDLLP YHQVESFKEH NVSWRLNLNY ELNPDARLYA TASRGYKAGN
YQVAVNSSYV SFPPVHQEEL TAYEVGAKLK LLDRRLALNA AVYYYDYQDK QLLTLTQDPV
FGLQFSLVNV PKSSVKGFDA DITWLPVRGL TIRAATTYAD SAIDKFQGFD VFGAPADLSG
KAFNLTPKWI GVGDVEYRHD LSGEYEGFVG ASLNYNSKTY ADIAGSDVLG IGAFTLVDLR
AGVSSTDGRR EAMVFVKNAT DKYHWSYAQP GGDSVLRYAS QPRTFGITLS QHF