Gene Caul_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1080 
Symbol 
ID5898535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1142135 
End bp1144948 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content63% 
IMG OID641561562 
ProductTonB-dependent receptor 
Protein accessionYP_001682708 
Protein GI167645045 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.72185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.584922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAAT CTCTGCGTAT GCGGAGCCTG CTGGCCATGG GCGCGTCCAT GACAGTCTTG 
GCCGCCGTCG CCGCCCCAGC CTTCGCCCAG ACCACGCCGG CGCCCCAGCA AGGCGGCGGC
AATGTGCTCG AGGAACTGGT CGTCACCGCC CAGAAGAAGG AAGAAGCCCT TCAGGACGTG
CCGATCGCGG TGTCGGCCTT CAGCCAGAAC AGCCTCGAGG CGCAGAAGAT CGACGGCGGT
CCCAACCTGC AGCAGGCGAT CCCCAACGTC TCCTTCGCCA AGAGCAACTT CACCAACAGC
TTCAACTTCG CGATCCGGGG CATCGGCAAC AAGGCCGTCG GCGTCTCGAC CGACGGCGGC
GTCGGCGTCC ACGAGAACAA CGCCCCGCTG CAGTCGGGCA ACCTGTTCGA CGCCGAGTTC
TTCGACGTGG AGCGCGTCGA AGTGCTGCGC GGGCCGCAAG GCACGCTGTA CGGCCGTAAC
GCCACCGGCG GCGTGGTCAA TATCATCACC GCCAAGCCTG TCGACACCTT CGAGGCCAAT
GTCCGAGCCG AATACGGCAA CTACAACTCG CAAAAGGTTC GCGGGATGAT CAACATCCCG
ATCCTGGGCG ACAAGCTGGC GATCCGCGCG GCCGGCAACT ACCTCAAGAG AGACGGCTTC
GTCACCAACA CCTTCAACAA CCACAAGGTC GACGACCGGG ATCTGTATTC GACCCGCGTT
TCGGTGATGT TCAATCCCAT CGACTCGCTG CGCACCAACT TCATGTGGGA GCACTTCAAG
GAAGACGACA GCCGGGCCCG CGTGGGCAAG CAGCTCTGCA CCAAGGACCT CGGCCCGGCC
ACGGTGGGCG GCGTCGCCTC CGGTTCGGCC CGCAACTTCC TGACCCAGGG GTGCCTGTCC
GCCTCGCTCT ACGGCGACAG CGCCTACGGC ACGGTCAACA CCTCGGGCAC CCTGACCGGC
GAACTGGGCA ACCTGGTGGG CTTCACCAGC GGCGACGCCA ACGCCGGCGA CACCGCCAGC
CATAACCTGC GCGAAATCGA GTCGCTGCTG GACCCGATCT ACCGGTCCAA GTCCAACATC
TATCAGTTCA ACCTGGCCTA TGACCTGACC GAGAACCTGA CCCTCACGGC CATGACCTCG
TACAGCAAGG GCGATGTCTA CACCAAGCTG GACTACAACC GGAACGTCTC GACGGTGCCG
TTCAACAGCA CCCCCTTCAC GCCGGGCGGC TTCTTCGCCG ATCCGCAGGT GGGCGCCACC
AACAAGTTCA CCACCCTGGA CGTCTCGTCG GGCTGGAGCA AGCAGTGGAG CCAGGAAGTT
CGACTGCAGT CGAACTTCGA CGGCCCGCTG AACTTCAACG TCGGCGGCAT CTGGTTCGAC
TACAAGACCG TGACCGACTA CTACGTGATC GGCAATTCGC TGACCCTCTC GGCCCTGGCC
CTGAACTACC AGAACACCGG CAACCCCAGC TGCAACCCGG TGACCTTGCC GGCCAGCTGT
ATCGGCATCG ACGCCAACGC CACGCCGGAC GGCAGCGGCC ATAACTACTA TGACAATCGC
TCGCCCTATC ACCTGAAGTC CAACGCCATC TTCGGCGAAC TGTACTGGCA GGCCAACGAG
AAGCTGAAGT TCACCCTGGG CCTGCGCCGC ACCCACGACG ACAAGATCCA GAAGAACTAC
GAGACCGTGC TTCTGGCGCC AGGCATCGGC CTGGAGCAGG ACCCGACCAC GCCACAGAAC
CGCACGGTGT TCAACGAGCT GACCGGCCGC TTTGGCTTCG ACTACAAGCT CAGCGACGAC
AACCTGCTCT ACGCCTTCTA TTCCAAGGGC TATAAGGGCG GCGGCTCGAA CCCGCCGGCG
GCGGCGGGTG GCGCTGGCAC GCAGGCCACC TTCGCGCCCG AGTTCGTCAA CGCGTTCGAG
CTCGGTTCGA AGAACACCCT GATGGGCGGC AGCGTGATGC TGAACGCCAC CGGCTTCTTC
TATGACTACC AGGGCTACCA GATCTCCAAG ATCGTCAACC GCACCTCGGT CAACGAGAAC
GTCAACGCCA CGGTCTACGG CCTTGAGCTG GAATCGGTCT GGTCGCCGAT CCACAATCTG
AAGCTCAACG CCAACATAGG CTATCTGCAC ACCAAGATCG GCAGCGGCGT CAGCTCGATC
GACACCATGA ACCGCACCCA GAGCAACCCC GCTTACCAAG TGGTCAAGGC GGGTCCCAGC
ATCCCCGGCG TTGCGGTCGG CTCCAACTGC GTGGTCACCG CGGCGGGCAT CGCCACCGTC
CTCAGCATCA ACCCCGCCTT GGGGGCGACG GTGCCGTTCG CCTGCGGCGG CAAGGCCTTC
TACCAGGGCT TCCTGCAACT TCAGGGCGTG CCAGCTCCGT TCGCGGCGGC CGCCGCCAAC
GCCATGTTCA ACTACGGATC CGGCTACTCG ATCGAAGGCA AGGCCGCGGA CCTGTCGGGT
AACGAACTAC CCAACTCGCC GCACCTGACC GCCTCGGTCG GCGCCCAATA CACCTGGGAT
TTCGCGGACG GCTGGTCGGC CACTCTGCGC GGCGACTACT ATCGCCAGAG CAAGCAGTAC
ACGCGGGTCT ACAACACCAC CTACGACCAG TTGAAGCCCT GGAACAACGC CAACATCACC
CTGAAGATCG AAAAGCCCGA GTGGGGCCTG CAGATCGACG CCTACGTCAA GAACCTGTCG
AACAAGACCC CGATCACCGA CGCCTACACG ACGGACGACA GCTCGGGCCT GTTCACCAAC
CTGATCACCC TGGAACCGCG CCTCTACGGC GTCAGCATCC AGAAGTCGTT CTAA
 
Protein sequence
MSQSLRMRSL LAMGASMTVL AAVAAPAFAQ TTPAPQQGGG NVLEELVVTA QKKEEALQDV 
PIAVSAFSQN SLEAQKIDGG PNLQQAIPNV SFAKSNFTNS FNFAIRGIGN KAVGVSTDGG
VGVHENNAPL QSGNLFDAEF FDVERVEVLR GPQGTLYGRN ATGGVVNIIT AKPVDTFEAN
VRAEYGNYNS QKVRGMINIP ILGDKLAIRA AGNYLKRDGF VTNTFNNHKV DDRDLYSTRV
SVMFNPIDSL RTNFMWEHFK EDDSRARVGK QLCTKDLGPA TVGGVASGSA RNFLTQGCLS
ASLYGDSAYG TVNTSGTLTG ELGNLVGFTS GDANAGDTAS HNLREIESLL DPIYRSKSNI
YQFNLAYDLT ENLTLTAMTS YSKGDVYTKL DYNRNVSTVP FNSTPFTPGG FFADPQVGAT
NKFTTLDVSS GWSKQWSQEV RLQSNFDGPL NFNVGGIWFD YKTVTDYYVI GNSLTLSALA
LNYQNTGNPS CNPVTLPASC IGIDANATPD GSGHNYYDNR SPYHLKSNAI FGELYWQANE
KLKFTLGLRR THDDKIQKNY ETVLLAPGIG LEQDPTTPQN RTVFNELTGR FGFDYKLSDD
NLLYAFYSKG YKGGGSNPPA AAGGAGTQAT FAPEFVNAFE LGSKNTLMGG SVMLNATGFF
YDYQGYQISK IVNRTSVNEN VNATVYGLEL ESVWSPIHNL KLNANIGYLH TKIGSGVSSI
DTMNRTQSNP AYQVVKAGPS IPGVAVGSNC VVTAAGIATV LSINPALGAT VPFACGGKAF
YQGFLQLQGV PAPFAAAAAN AMFNYGSGYS IEGKAADLSG NELPNSPHLT ASVGAQYTWD
FADGWSATLR GDYYRQSKQY TRVYNTTYDQ LKPWNNANIT LKIEKPEWGL QIDAYVKNLS
NKTPITDAYT TDDSSGLFTN LITLEPRLYG VSIQKSF