Gene Caul_4016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4016 
Symbol 
ID5901478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4349536 
End bp4352583 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content68% 
IMG OID641564537 
ProductTonB-dependent receptor 
Protein accessionYP_001685639 
Protein GI167647976 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCG GGCTTACGGG CGTATCGAGC GTCGCGCTGA TGATCGGGGT CGCCGGCGTG 
GCGCTGCCGG CATGGGGCCA GGACGCGCAA CGGCCGGCGG ACACGGCCAC GCAGGTCGAA
GAGGTCGTCG TCACCGGTTC CAACATCCGC GGTTCGGCGC TCGACAACGC CCTCCCGGTG
GAGGTCTATT CTCAACAGGA TCTGGAAAAG CAGGGCTCGC CCACGGCGCT GGAGTTCGCC
AAGAGCCTGA CCATTTCGGG TCCGACGACG GGCGAGTCGT ACTATTTCGG CGGTCCGGCC
CTGGTGGGTT CGGTGAACTA CAATCTGCGC GGCCTGGGCG CCGACAAGAC CCTGGTCCTG
CTCAACGGCC GGCGGATGAA CCAGAACACC GCCAACGTGC CGTCGATGGC CCTGGCCCGC
ACCGAGATCC TCAAGGACGG CGCGGCGGTG ATCTACGGCG CCGACGCCAC CGGCGGCGTC
GTCAACTTCA TCACCCTGGA CCACTTCACG GGCCTGCAGG CTCAGGGCCA GTACAAGCAG
ATCAAGGGCT CCAAGGGCGA CTATTCGGTC GGCGTCATGG CCGGGATCGG CGAGGATCGG
GTCAACCTGC TGGTGTCGGC CGAGTACGAG CACCGCTCGC GGCTGGGCAC GCTGGAGCGC
GACTTCACCA AGCCGTCGCT GACGCCGGGG GCGGGCTATA ATCCGGCGCC CTGGTCGACC
CTGACCAACC TGACGGGCTG GCTGCCGCGC GGCGCCCTGC CCGCCGTTCC CAGCGCCACG
GATGTCGGCG AGTGGGGCGC GGCGGTCGGC GGGATCGTCT CCGACTTCAC CGCCTCCAGC
TGCGCGGCGG TCGGCGGTCG GCCCGACAAC GCCTTCACCT GCGCCTACAA CTACATTCCC
TACTACCGGC TGGTGGAGAA CCAGGACACC TATCGGCTCT ACGCCCAGCT GAAGGCCGAC
ATCACCGACA AGATGAAGTT CCACGCCGAC GCCTCCTACG GGCGCGTGAC GCTGCCGCAG
GTGATGGGCT CGCCGGCCCA GCCCGTGACC CGCGGCCCCG CCCTGACCAC CGGGGCGGTG
AACCAGTTCT ACGTGCCAAT CACCAACCCG TTCGCGGCCG AGTTCGCCGC CGCGAACGGC
ATCGTCGGCG CCCAGGGCTT CACGCCGATC ACCTACCGCC TGTTCGGCCA CGGCGGTAAT
CCCTACTATT CGGGCGGGGA CGGCTTCGGC GTCGCCGACC GGATCGACAA CAAGGTCTGG
CGCATCTCGG GCGGGATCAC CGGCGACCTC GGCGACCTGG CGACCTTCGC CAAGAAGGTC
GGCTACGACT TCGCCCTGAC CTATAACGAC GCCTACAACT ACAACACCCA TGCCGATACG
ATCGGCTATC GCCTGGAAGA GGCGCTGAAC GGCTTCGGCG GCCCCAACTG CCATGCCGTC
GACCTGGACC CGTCGCGCTT CGGAACCCAG AACGCCGCCG CCGCGGGCAA GAACGGCTGC
ATGTGGTGGA ACCCCTTCTC CAGCTCGTTC AAGGGCCAGC CGGTCCGCGG CCTGGCCAAC
CCCAACTACA TCGCCGGCCA CGAGAACCCG CAGGACCTGA GCCTGTGGAT GTTCGATCCG
CGCGCCGTCG AGACCCGGAG CAACAACTTC ACCGCCGACC TGGTGTTCAA CGGCATGTCC
GGCCTGACGC TGCCTGGCGG CGAAGTGGGC TGGGCCCTGG GCGCCCAGTA CCGGACGTTC
AAGAGCCGCC AGACCGTCAC CAGCCTGTTC AACAACGGCA CGGTTCAGTG CGAATGGCCG
CACGGCACCA CCAGCGCCAA CGGCGCGGGC TCGCCGAACC TGGAGGCCAA CCCTACGCCG
ACGAATGACC CGAACTTCCG GGGCTGCACG CCCGACGCCC CGGGTCCGTT CGTCCTGTTC
GCGCCCAGCA TTCCGGCCCA GGCCGACCAG AGCCAGTATT CGCTGTTTGG CGAACTGCAG
GTGCCGGTGC TGTCGAACCT CAGCTTCCAG CTGGCCGCCC GCCGCGAGCG GTTCTCCAAC
GATCTGGGCG CGACGGTCTA CAAGGTGTCG GGCAAGTGGA ACGTCTGGGG TCCGCTGACC
TTGCGCGGGT CGTACGGCAC CAACTACCAG ACCCCGCCCC TGGGCGTGAC GCCCGGCGCC
GTGACCATCG CCGCGCGCAC CTACACGGTG GCGGCCAGCA ACTGGCTGGC GGCTCAGTTC
GTCACCGACG CCGACCTCAA GCCCGAGACC GCCAAGACCT CGAACCTCGG CGCCATCTGG
CAAAGCCGGG GCTTGGCGGA TGATCACAAC TTCCGCCTGA TCATCGACTA TTTCGACATC
CGGACGAAGG ACCAGATCGG CCAGGTCGCC GACCCCAACC AGATCGCCAG CCTGGTGTTC
AACGGCGCGG GCGGCACGAT CACCACCTGC GACCCGGCCA AGCAGCCCCT GCTGGCCCGC
ATCACCTTCA ACGCCGGCTG CGCGGTGGGG ATGAGCGGCG TCGGGACCTT CTCCGCCGTT
TCCACGCGCT ACGGCAACGG GCCGGGCCAG ACGACCAAGG GCTTCGACAT CCAGGCCAAT
TACGGCCTGC CGCTGGGTCC CGGCGATCTG GACGTCAACC TGACCGCCAC CCGGGTGACC
GAGTTGCGCA CCGGGGCCAC CACCCTGGAC GGCGTCGTGA TCTCCACCGG CGACGACCGC
CTGGGCACGC TGAACTTCGC GACCTTCGCG CAGGCGGCTC CGAAGTGGCG CGCCAACCTG
GGCGTCAACT ATCGCCTGAA CCGCCAGAAC TTCCGCCTCG GCGTGAACTT CGTCTCGGCC
GTCCAGGACG AGCGGGCCGG CGTCCAGTAC GGCGAGGACG GCGAGGACTG GGTCACCGCC
GACTTCACCT ACCGGATCGA GCTGAACGGC GACATGGCCC TCACCGCCAC GGTCGCCAAC
ATGTTCGATC GCGACCCGCC GCCGGCCCAG GAAGAGTTCG GCTACGATCC GTGGACCGGC
AATCCGCTAG GCCGGACCTT CGAGATCGGC TTCAAGAAAT CGTTCTAA
 
Protein sequence
MKIGLTGVSS VALMIGVAGV ALPAWGQDAQ RPADTATQVE EVVVTGSNIR GSALDNALPV 
EVYSQQDLEK QGSPTALEFA KSLTISGPTT GESYYFGGPA LVGSVNYNLR GLGADKTLVL
LNGRRMNQNT ANVPSMALAR TEILKDGAAV IYGADATGGV VNFITLDHFT GLQAQGQYKQ
IKGSKGDYSV GVMAGIGEDR VNLLVSAEYE HRSRLGTLER DFTKPSLTPG AGYNPAPWST
LTNLTGWLPR GALPAVPSAT DVGEWGAAVG GIVSDFTASS CAAVGGRPDN AFTCAYNYIP
YYRLVENQDT YRLYAQLKAD ITDKMKFHAD ASYGRVTLPQ VMGSPAQPVT RGPALTTGAV
NQFYVPITNP FAAEFAAANG IVGAQGFTPI TYRLFGHGGN PYYSGGDGFG VADRIDNKVW
RISGGITGDL GDLATFAKKV GYDFALTYND AYNYNTHADT IGYRLEEALN GFGGPNCHAV
DLDPSRFGTQ NAAAAGKNGC MWWNPFSSSF KGQPVRGLAN PNYIAGHENP QDLSLWMFDP
RAVETRSNNF TADLVFNGMS GLTLPGGEVG WALGAQYRTF KSRQTVTSLF NNGTVQCEWP
HGTTSANGAG SPNLEANPTP TNDPNFRGCT PDAPGPFVLF APSIPAQADQ SQYSLFGELQ
VPVLSNLSFQ LAARRERFSN DLGATVYKVS GKWNVWGPLT LRGSYGTNYQ TPPLGVTPGA
VTIAARTYTV AASNWLAAQF VTDADLKPET AKTSNLGAIW QSRGLADDHN FRLIIDYFDI
RTKDQIGQVA DPNQIASLVF NGAGGTITTC DPAKQPLLAR ITFNAGCAVG MSGVGTFSAV
STRYGNGPGQ TTKGFDIQAN YGLPLGPGDL DVNLTATRVT ELRTGATTLD GVVISTGDDR
LGTLNFATFA QAAPKWRANL GVNYRLNRQN FRLGVNFVSA VQDERAGVQY GEDGEDWVTA
DFTYRIELNG DMALTATVAN MFDRDPPPAQ EEFGYDPWTG NPLGRTFEIG FKKSF