Gene Caul_0308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0308 
Symbol 
ID5897582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp345170 
End bp347545 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content67% 
IMG OID641560792 
ProductTonB-dependent receptor 
Protein accessionYP_001681943 
Protein GI167644280 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.264582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGT CCATCTGGCT GGTTTCGAGC GCGCTGTGTT CGATCCTGGC CGCAGGCCCG 
GTCCTGGCCC AGGACAAGGC GACCGCCGCG ACGGGCGGCC AGACCCTCGA CGAGGTGGTG
GTCACCGCCG AGCGCTTCGG CTCGGGCCTG GCCCGCGCCA CCTTCACCCT CGGCGCTGAG
GACATCCAGG AGCGTCCGCT GGGCGCCGAG ATCACCCAGG CCCTGGTCAA GGTGCCGGGC
GTGCAGGTCT CGACCGGCGA CGCGCGGGGC GGCAGCTTCT CGTTCGAGAT CTACATGCGC
GGTCTCAGCG ACGAGCAGAT CGGCCTGACC CTGGACGGCG TCCCGACCGG CGACTCGCGC
TTCAACGGCG GCTCGCCGCC CGCGCGGTTC ATCGAGTCCA GCAATATCGG CAAGATCACC
GTTTCGCAGA GCGCCGGCGA CATCGGCGCG CCGTCGCGCT TCGCCCTGGG CGGCTTCATC
GACTTCGCCA CCGACGCGCC GCGCCACGAC CTGGGCGCCA CGGTCGAGGC TGGGGTCGGC
TCGTTCGACT TCCGCCGGAT CTATGGCCGC GTCGACAGCG GCGAGATCGC GCCGGGCCTT
TCGGGCTACC TGACCTATTC GCACCAGGAG AACGACATCT GGGCTGGTCG CGAGAGCCGC
GGCTCCGAGC GCGGCCACTA CGAGCTGAAG CTGGTCAAGG ACTTCGACAA CGGCTCGTTC
CTGAAGGCCC GGGTCTCGTA CAACGACCAG ACCGACAATG ACTTCAACAT CGTCACCAAG
GGCGAGTTCA AGGCCGCGCC GCGCAGCGAC CGGGCTCTGG ACGCCATCAC CGGCCTACCG
GCCAAGGACA TCGACTTTGG CGGCGCCCTG GGCGGCTGGC GCAAGGACTG GCTGACCTAT
CTGAACGGCC ACTTCAAGCT GAACGACGCG CTCAGCCTCG ACGTCAATCC GTACTACCAG
ACCCTGAATG GGGAATCCTT CCGCTACCAG GACCGCCAGC GGATCCTGAC CGGCGGCGAT
CCGCGCGCCG TGACCGGCTA CAACGCCAAC GGCGGCGCCA TCCGCCCGGC CCTGTCCACC
CTGCGCAACA GCAATGTCGT GGGCGGCCCG GCCGACATGC GGGTCACCCC GCGCGAGCGC
GACCGTTACG GCGTGACCGG CGAGATCAAG GCGTCGAACG TCTTCGGCTC GGGCCACAGC
CTGCGGGTCG GCGGCTGGTG GGAAGGCGGC GAGTCCACCG AGAAGCGCAA CTTCTTCCCG
ATCATCGACT CGTCCAGGAG CATCGCCTAC GACCGCTCCA AGCTGAACTA TGTCGAGTAC
GAGCGCACGG CCTCGGTCGA GACGACCATG CTGTACGCCC AGGACGAGTT TCGGGCCTTG
GACGACAAGC TCAAGGTCGA CCTGGGCCTG ACCTGGTACG ACGTCAAGTA CGACGCCAAG
TCGCCGCTGG AGTACAAGGC CAACGTCAAG TTCTCGCAGC ATTCGGAGGT CAATCCGAAG
CTCGGCGCGA CCTATCAGCT GGCGCCGGCC TGGGAACTGT TCGGCGGCTA CGCCAAGAAC
TTCGCCGGCA TCCCGGAAGA CGCCTTCCTC GGCTCGACGG CAGTGATCAG CCCAAAGGAC
CTGGACCCGG TCGAGACCGA GAACCTGGAC CTGGGGCTGC GCTATGTGAA GCCGAACATG
GCCTTCTCGA TCCAGGCCTA TGACGTGGAC CTGAAGAACA ATGTCGGCAT CGTGCCGCGC
GATCCGACCG CGGCCCTTGA CCCCGACGAA GTGGTCCGGG GGAATGTCGC GACCAAGGCG
GTCAATATCG CCGGCATCAA GACCAAGGGC GTGGAGCTGA CCGGCTATTA CGACTTCGGC
GCCTTTGACC TCTACGGCGC CTATTCGCGC CAGGACGCCA AGCACGACAA CCCGGCCGTC
GGCAGCGCCG CGCGCAAGGC CCTGGCGGCG GTGGCGGTGA TCGGCGGGGC GGGCGTGCGA
GACATCCCCA AGAACAGCTT CTATGGCCAG GTCGGCTGGA AGCCGCTGGA GGGGCTGAAG
CTGGACGCCA ATGTCCGCTA TGTCGGCGAC CGTGTCGGCG GCCACATCGT CGCCCCGACC
ACCTTCCAGG AGATCGGCGT CGAGATGATC GACGGCTACG CCCTGGTCGG GCTGACGGCG
ACCTACGATC TCAAGCGGGC CGGCGTTCCC GACCTGCGGT TCCAGCTCAA CGTCGATAAC
CTATTCGACG AGGAATACAT CGGCGCGGTC AGCGGCTCGA CCGCCACCCA ACCGGAGTTC
GGCTACACGG TCGCGACGCC GAACGCCCGC ACCCTGGATC GCTACTTCAT CGGCGCGCCG
CGCACCTACA CCCTTTCGGT GCGGACCCGC TTCTGA
 
Protein sequence
MKTSIWLVSS ALCSILAAGP VLAQDKATAA TGGQTLDEVV VTAERFGSGL ARATFTLGAE 
DIQERPLGAE ITQALVKVPG VQVSTGDARG GSFSFEIYMR GLSDEQIGLT LDGVPTGDSR
FNGGSPPARF IESSNIGKIT VSQSAGDIGA PSRFALGGFI DFATDAPRHD LGATVEAGVG
SFDFRRIYGR VDSGEIAPGL SGYLTYSHQE NDIWAGRESR GSERGHYELK LVKDFDNGSF
LKARVSYNDQ TDNDFNIVTK GEFKAAPRSD RALDAITGLP AKDIDFGGAL GGWRKDWLTY
LNGHFKLNDA LSLDVNPYYQ TLNGESFRYQ DRQRILTGGD PRAVTGYNAN GGAIRPALST
LRNSNVVGGP ADMRVTPRER DRYGVTGEIK ASNVFGSGHS LRVGGWWEGG ESTEKRNFFP
IIDSSRSIAY DRSKLNYVEY ERTASVETTM LYAQDEFRAL DDKLKVDLGL TWYDVKYDAK
SPLEYKANVK FSQHSEVNPK LGATYQLAPA WELFGGYAKN FAGIPEDAFL GSTAVISPKD
LDPVETENLD LGLRYVKPNM AFSIQAYDVD LKNNVGIVPR DPTAALDPDE VVRGNVATKA
VNIAGIKTKG VELTGYYDFG AFDLYGAYSR QDAKHDNPAV GSAARKALAA VAVIGGAGVR
DIPKNSFYGQ VGWKPLEGLK LDANVRYVGD RVGGHIVAPT TFQEIGVEMI DGYALVGLTA
TYDLKRAGVP DLRFQLNVDN LFDEEYIGAV SGSTATQPEF GYTVATPNAR TLDRYFIGAP
RTYTLSVRTR F