Gene Caul_0484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0484 
Symbol 
ID5897939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp524875 
End bp527046 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content66% 
IMG OID641560967 
ProductTonB-dependent receptor 
Protein accessionYP_001682116 
Protein GI167644453 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTC GATACGTATT GCTTGCATCA TCCGGCGTCC TGGCGCTGGC TCTGGCTTCG 
ACGGCCGTCG CCCAAACCAC GCCGGCGCCC TCTGAGGCCT TCGCGCTGGA TGAGGTGGTG
GTCACCGCCC AGAAGCGTTC GGAGAATCTG CAGGACGTGC CGCTGACCCT GTCGGTGTTC
GGCGCCAAGG AGATCGAGCA GGCCCGCATC GTCCAGGTGC AGGACGTGGC CAACCGCACC
CCGGGGCTAA ACTTCGACAC CTTCCCCAGC AGCCAGCCGC GGCCGACCCT GCGTGGCATC
GGCTCGTCCG ACCGCGGCGC AGCCGGCGAC CCCTCGGTGT CGGTGTTCAT CGACGAGGTC
TATTACGGCC GCCCCTCGGC CGTGGCCTTC GACGCCTTCG ACGTCGAGCG CATCGAAGTG
CTGAAGGGGC CGCAAGGCAC GCTGCTGGGC AAGAACGTGG TTGGCGGCGC GCTGAGCATC
ATCAATCGTA AGCCGCAAGC CGACGGTTTC GACGCCGGGG CGTCGGTTAC CGTTGGGAAC
TACTCGCGGC TCGAAGGCGC TGGATTCGTC AACGCCCCGC TGGCTGACGG AAAGGCCGCC
GTCCGCCTCA GCCTCAGCTC GCGCAATCAC GACGGCTATA CCAGGAACGC CTATGCCGGC
GGACGGGCCG AGGATCAGGC CACCGACAGC GGTCGTCTGC AGTTGCTGCT CCAGCCCAAC
GACGCCTTCT CGGCGATCCT GGCCGTGGAT GGCACGCGTG ACCGCGGCAA CAGCGACGCC
CGCCACGCGG TGCAGGTCGA TCCCAGCGTT CCAGCCAGCG CCCTGTGGCG CATCAAGGGC
CAGCGCGACG AGTACTATGC CGAGAACAAC GGCCGCAACG ATCGCGACAC CTGGGGCGTG
CGGGCCAATC TCAACTGGGA CCTGTCGGCC TTCGCGCTGA CCTATGTCGG CTCGTATCGC
GAGTTGACCT ATCACTACGC CGAGGACTTC GACGGCGGCA ATCCGACCCT CTACGCCTTG
AACTTCCGCG GCGGCCAGGA CGAGAAGAGC AGCTTCTACA GCCACGAACT GCGCGCCTCG
GCGCTGCCCA GCTCCAAGCT GCGCTGGGTG GTCGGCGCCT ACTATTTCGG AGCGGACACC
CAGCGGACAG ACAGCCTGCT GGCCGACATA AAATCGCGCC CGACCCCGCC GGCCACCTAT
GCGGCCAAGG ACCTCTTCCA CCAGGACGCC CAGACGGACA GCTGGGCCCT GTTTGTCGAC
GCCACCTGGC CGGTCACCGA CCAGATCAAC GTCTTCGGCG GCCTGCGCTA TTCCAAGGAC
CAGAAGGACT ACGCGCTCAA CAATCTGGAC TCCACGGCGC TGCTGCGCGC CGCGACTCGC
TACAGCATCG ACGCCAGCCA CGACTGGGAC AAACTGACCT GGAGGCTCGG CGGCGACTTC
TCGCCAAGCG ACAACGTCAT GCTGTACGGT GTGGTCTCGA CGGGCTTCAA GAGCGGCGGC
TACCAGGACA CGCCGTCTAC GGCGACCTCG GCGACCACGC CGTTCAATCC CGAGAACGCC
ACCCTCTATG AGCTGGGCGC CAAGACGACC CTGCTGGATC GCCGGCTGAC CTGGAACACC
TCGATCTACC AGACCGACTA CAAGGACCTG CAGGTGCGCA GCACCCGGGG CTTCGACACC
ATCACCACCA ACGCTGGCTC CGCGCGCATC CGCGGGCTGG AGACCTCGCT CGACGCGCGG
CTGGGCGAGC GCCTGCGGGT GGCGGCCAGT TACGCCTATA CCGACGGCAA GTTCCTCAAC
CTGGTCGACC GCGGGCAGGA TCGTTCGGGC AACCACATGA CCCGCAGCCC CAAGCACAAG
GTCACCCTGT CGCCCAGCTA CGCCCAGCCG CTGAGCTCGG GAGCGCGGCT GACGGGGGCG
GTCGATCTGG CCTACGAAAC CAAGATCTGG GACGATATCG ACAACAACCC GATCACCGTC
CGCAAGCCCA AGCTGCTGGC TGATGGACGT GTCGTCTACG ACAGCGCGGG CGGGCACTGG
AACCTGTCGG TCTGGGGCAA GAACCTGACC GATCGGCTCT ATCGCACCCA CCAGGTGGTG
TTCCAGGGCG CGGACCTGGC CACCTTCGGC GCGCCGCGCA CCTTCGGCGC CACCCTGAAC
TGGAAATACT AG
 
Protein sequence
MNARYVLLAS SGVLALALAS TAVAQTTPAP SEAFALDEVV VTAQKRSENL QDVPLTLSVF 
GAKEIEQARI VQVQDVANRT PGLNFDTFPS SQPRPTLRGI GSSDRGAAGD PSVSVFIDEV
YYGRPSAVAF DAFDVERIEV LKGPQGTLLG KNVVGGALSI INRKPQADGF DAGASVTVGN
YSRLEGAGFV NAPLADGKAA VRLSLSSRNH DGYTRNAYAG GRAEDQATDS GRLQLLLQPN
DAFSAILAVD GTRDRGNSDA RHAVQVDPSV PASALWRIKG QRDEYYAENN GRNDRDTWGV
RANLNWDLSA FALTYVGSYR ELTYHYAEDF DGGNPTLYAL NFRGGQDEKS SFYSHELRAS
ALPSSKLRWV VGAYYFGADT QRTDSLLADI KSRPTPPATY AAKDLFHQDA QTDSWALFVD
ATWPVTDQIN VFGGLRYSKD QKDYALNNLD STALLRAATR YSIDASHDWD KLTWRLGGDF
SPSDNVMLYG VVSTGFKSGG YQDTPSTATS ATTPFNPENA TLYELGAKTT LLDRRLTWNT
SIYQTDYKDL QVRSTRGFDT ITTNAGSARI RGLETSLDAR LGERLRVAAS YAYTDGKFLN
LVDRGQDRSG NHMTRSPKHK VTLSPSYAQP LSSGARLTGA VDLAYETKIW DDIDNNPITV
RKPKLLADGR VVYDSAGGHW NLSVWGKNLT DRLYRTHQVV FQGADLATFG APRTFGATLN
WKY