Gene Caul_5246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5246 
Symbol 
ID5897364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp177923 
End bp180211 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content66% 
IMG OID641555349 
ProductTonB-dependent receptor 
Protein accessionYP_001676680 
Protein GI167621895 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.751713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGGA ACACCAAAAC GCCGCGACGC GCCCTGTTGG CGACTTCGCT TCTCGCGTCG 
GGGCTGGCCA TCGCCGGGGC CGCTCACGCC CAGACGACCG CGGCGCCCTC GGGCCCCGAG
ACCGTGACGG AAATCATCGT CACGGCCAAC CGTCGGTCCG AGTCCGTTCA AGCGATCGGA
CAGAGCGTTT CGGCCCTGAC GGCCGAGAGT CTCGAGCGCG CCGCGGCGAC ATCGTTCTTC
GATTTCGCCA CGGCCATCCC CAACCTGTCG TTCGGCGCAG CGGCGGAAGG CACGACCAAT
TCGCGGTCGA TCGCCATACG CGGCATCGCC GACCGCAACA CGACCGGCTT CTATATCGAC
GAAACCCCCC TGCCCGACTC GCTGGATCCC AAGATCATCG ATGTCGCCCG CATCGAAGTG
CTGCGCGGTC CTCAGGGGAC GCTGTACGGC GCCCGATCGA TGGGCGGCAC GGTGCGATTG
ATTACCGAGC AGCCGAAACT GGACGATCCC AGCGGACGCC TTCATGTCAG CCTCTCCAAC
GTGCGATCCG CGGCCCACGG CAACTTCATG GTCGACGGGG CGCTGAATGT ACCGCTGCTG
AAGGATCGCG CCGCCTTGCG CATCGTGGCC GTGCATGACC AGGACGCTGG ATATTTCACG
CGAGCGATCG GACCCTACGG CGCGGCGCCG CTCAAAAATC GCGACAACAT CGGCCGGAGC
AGGACCGACG GGATCTCCTT GGCCGGCCTT GTCAAACTCA GCGATCAGCT CTCTGTCACG
CCGCGCGTCT TCTACCAGCG CACGCGCACC GACGGCTTTC CCTTCGCCGA TGTGCCCACG
GCCGCCGGCG GCGCCCAGAC CCGCCTGCAG CCCAGCAGCC TGATCCAGCG ACGCGGGTTC
GACGTGAACG AATATTCCGA CGACCATTGG GTGCTGGGCA CCCTGGACGT CCGCTACAAG
ACGCCGATCG GCGATATCGT CTCGGCCAGT TCGTACTTCC GCCGATACAC CACCAATGTG
GAGGACCAAT CCGACTTCAT CGCCTTCGCG TTTGGCACGC CGCTGCTGCC GACCCAGACG
CAACAGGACA ACCGTATCGA GGACTATACG CAGGAGCTGC GGTTCTCCTC GGACTTCTCC
GGTCCCCTCC AGCTGGTGAC CGGGCTCTAT TACGACCACA AGAACACGCT GCGTTACTAT
CCGCCAAGCT ACGCGCACGG CCTGAACGCC GCGTCTGGCG GGGCGATCGG CTCCGACTTC
ATCTACACCA GCAGCACGCC GACTTTGCAG ACGGAGTACG CCGCCTACGC CGAGGCGACG
TGGTCGGTGA CCGAGCGTCT GAAGCTGGTG GGCGGTTTGC GCGCCTTTGA CGTGGAAACC
GCCGCGTCCA GCCGGGCCGA CGGCCTGGTC ACCGGCGGAC CAACCCGCGT GCCGGCGACC
TCCCAGTCGG AAAACGGCGT CATTCCCAAG CTCTCGGCGC AATTTCGCTT CACGCCCGAC
AACCAGATTT ACGTGACCGC GGCCAAGGGG TTCAGGCCTG GCGGCGTCAA CGGAGTCGTA
CCGACCGCGC TGGGATGCGC GGCCGACCTG GCCGCCCTGG GACGTACGCC GCAAAGCGCC
GCCTTCTACC AGTCCGACTC GGTCTGGAGC TACGAGGTTG GGTCCAAGAA CAGCTTCCTG
GATCGTCGCC TCACCGCCAA CGTCAGCGTC TTCCGCATCG ACTGGAGCGA CATCCAGCAG
CAGATCGTGC TGCCGTGCGG CTTTGGCTTC CGCGGCAACG CCGGCTCGGC CCGCAGTCAG
GGCGCCGAGC TGGAGACCAC CTGGCGGCCG GTTCGCGATC TGACGATCTC GGCCGGGGTC
GGCTATACCG ATGCGGTGTT CACCAGCACG GCGGCCGGGA CGCGCTTTCG AGACGGCGAT
CGCGTGCCGC AGGTGCCGCG GTACACGGCC AACCTTTCGG GAGACTACCG CTTCGCGCTA
CCGCGCGGCC TGACCGGCTT CCTCTACGCC GACACCAAGT ACGTCAGCAG CAGCACCACG
GCCTTGAACG CCGGCGCCAA CGCGCAGGGG GCGCTGGTTC AACGCATCCG CCCCGCCTAC
ACTATCGTGG ACCTTCGCGG CGGCGTCGAG CTTGAGCGCT ACGAGTTGGC GCTGTTCGTC
AAGAACGCCA CGGACGAGCG GGCGAGCCTT GGCGACTCCC TGTCGATCGC CGCTGAGATG
CCTGGGCGCG CTCGCGTCCT GATGAGCCAA CCGCGCGAAA TCGGCGTTGA GGTGCGTGCG
CGCTTCTAA
 
Protein sequence
MIRNTKTPRR ALLATSLLAS GLAIAGAAHA QTTAAPSGPE TVTEIIVTAN RRSESVQAIG 
QSVSALTAES LERAAATSFF DFATAIPNLS FGAAAEGTTN SRSIAIRGIA DRNTTGFYID
ETPLPDSLDP KIIDVARIEV LRGPQGTLYG ARSMGGTVRL ITEQPKLDDP SGRLHVSLSN
VRSAAHGNFM VDGALNVPLL KDRAALRIVA VHDQDAGYFT RAIGPYGAAP LKNRDNIGRS
RTDGISLAGL VKLSDQLSVT PRVFYQRTRT DGFPFADVPT AAGGAQTRLQ PSSLIQRRGF
DVNEYSDDHW VLGTLDVRYK TPIGDIVSAS SYFRRYTTNV EDQSDFIAFA FGTPLLPTQT
QQDNRIEDYT QELRFSSDFS GPLQLVTGLY YDHKNTLRYY PPSYAHGLNA ASGGAIGSDF
IYTSSTPTLQ TEYAAYAEAT WSVTERLKLV GGLRAFDVET AASSRADGLV TGGPTRVPAT
SQSENGVIPK LSAQFRFTPD NQIYVTAAKG FRPGGVNGVV PTALGCAADL AALGRTPQSA
AFYQSDSVWS YEVGSKNSFL DRRLTANVSV FRIDWSDIQQ QIVLPCGFGF RGNAGSARSQ
GAELETTWRP VRDLTISAGV GYTDAVFTST AAGTRFRDGD RVPQVPRYTA NLSGDYRFAL
PRGLTGFLYA DTKYVSSSTT ALNAGANAQG ALVQRIRPAY TIVDLRGGVE LERYELALFV
KNATDERASL GDSLSIAAEM PGRARVLMSQ PREIGVEVRA RF