Gene Caul_2407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2407 
Symbol 
ID5899862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2624525 
End bp2627479 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content63% 
IMG OID641562898 
ProductTonB-dependent receptor 
Protein accessionYP_001684032 
Protein GI167646369 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000117567 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.325532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAG TCCAGCTTAT TTATCAGAGT AATGCGCCCA CGATCCAGCG CGCCTCGAAG 
GCTGCTCTCG TCGGTGCGTC CAGCTTTCTG GCGCTCGCGC TTTGCGGCGC GGCCAACGCC
GCCGACCGGC CGGCCGAGGA GCCAATAACG ACGGCCGCGG CCGTCGCCGC GCCAACCGGC
GAGCAGGCTT CCGAGCCGAA ATCCAGCGGC GGCACGGTCG TCGAAGAAGT GGTTGTCACA
GGCCTACGCG GTTCGCTGCA ACGCAACCTC GACATCAAGC GGACGTCACC GGGCGTCGTC
GACGCCATTT CGGCCGAGGA CATCGGCAAA TTCCCCGATT CCAACGTCGC CGCATCATTG
CAACGCCTGC CGGGCGTCTC CATTCAGCGC GCGGGCGCGC GTGGCGAACC GCAGGGTATC
ACCGTTCGCG GCTTTGGCGG CGACTTCAAC GAGACCCTCT ACGACGGTCG TCGGATCTCC
ACGGCCACGG GCGGCCGCTC GGTGGACTTC AGCACCGTGG GCGCCGACTT CGTCGGCGGC
CTGTCGGTGC TCAAGACACC CGACGTCACA CTCTCGAGTA GTTCGATCGG CGCGACCGTC
AACGTCGCGT TTCCAAAGCC GTTCGATCAT CCCGGCCGGC GCATGGCGTT CACCGCCTCG
GGCTCGCTAC AGGACGACGC GGGCAAGGTG GCGCCCACCG TCGGCGCCCT GTTCAGCGAC
ACCTTCGCCG ATAACCGGTT CGGGATCCTG GTCGATGCGA TGTACACGCG CCACGACACC
CAGACCAACC GGGTCTATGT CAGCGGCTGG CCGGGCGGAC GCTACGCGCC GTGCCAATTG
ACGCCGACCT GCACGCCAAC CCGGTTGGCC GACAAGTCCA TCGTCGGATG GTTCGAGCAA
CAGTATGGCG CAAGCCAGAT CTATACCAAG GACGAGCGTG TCGATGGCCG CATCGCCCTG
CAATGGAGTC CGTCCGAAGA CCTGACGGTC ACCCTGGACG ACAACTACTC GCGTCAGAAC
ATCCGCGCCG ACAATTTCGG CTATGGCATC TGGTTTAATC AGGACGGCCT GAGAAACGTC
AAGCTGGACA AGAACGGGAC GACCGTCGAT TTCACCCAAG CCGGATCTCA GACCGATTTC
GTGGCTGGAA CCGATCGCTC GATCCTCCAG ACCAACCAGA CGGGTCTGAA CCTCAAGTGG
GACGTGTCCC AGAACCTGAA CTTCGAGGCG GACGCCAGCT ACGCCAAGAG CTGGCTCAAC
CCCGGCGGCG AGATCAGCAG CGACAACGCC GACGTGGGCT ACGGCTTCGC CATCGGCCCT
GCTCTGGGCA TCAGCATCGG CGGCGACAGC AAGAACACCC TGCCGGTGTT GCATGGCTAC
GGCCCAAACG GCGACGCCGC GCGCTGGGCC GACACTTCGG TCCTGGGCTC TCACGTCACC
GTGCGCCAAG CTCAGGAAAA CACCGACGTC GTCAAGCAAC TGCGGTTCGC CGGCTCGTGG
GAACAGGAAG GCTTCCGCAT CAAGGCCGGC GGCAGCTATC TGGAAGACCA CTACCAGTTC
CAGCAGAGCA ACACCTTCGT CAACAACTAC TGGCAAGCCT ATCCGGGCTA CGGCCCGCCA
TCGGGTCCCA ACGGCGGTGT CCTGGCGCCG TCCAGCCTGT TCACCGACAA GGTCAGCACC
AACCACTTCA TCCCCGGCTT CTCCGGCGCC CTGCCGCCGA CGCTGTTGAA GTTCGACGCC
CACGCCTATC AGCAGTTCCT CACGGCTCTT GGAAATCCCC AAACCCAGAC TATTCCGGGC
TTCAACTATA GCGGCGGCAA CGTCGGGACC ACGTTCACCG GCGCGTTCAA TCTGGGGCTC
GACAACGGCA GTATTCGCGA CATCACCGAG AAGACCTGGG CGCTGTTCCT GCGGGCCAAC
TTCGACGTCG ACGTGGCCGG CATGCCGTTC CACTTCAACG CCGGCGTACG CGAGGAGAAC
ACTCACGTCA CATCCAACGG CTTTGGTCAG GTGCCCACCG CGATCACCGG CAGCGCCGGC
GATCCGACGC TGCTGACCGT GACCTTGAGC GCGCCTCAGG CCGTATCGAC CAAGAGCAAC
TATTCCTACC TGCTCCCAAG CATCGACCTG AAGCTGGAAC TGACCGAGAG CATCCATCTG
CGCCTCGATG CGTCTCGAAC GTTGACCCGT CCGAGTCTGA ACCTGCTCAC GCCGGTGGCC
AGCGTCGGCA CCGGCCAGCG GGTCGGCGCC CTCACGGCCA GCGGTGGCAG CCCCTCGCTC
AAGCCTTATC TGGCCGACAA TTTCGATGCG GCGGTCGAAT GGTATTACCG GCCCAACTCG
TATGCGTCGG TCAATTTCTT CATCAAGGAC GTCAGCAACT TCATCATCGG CGGCACCCAG
CGACAGACGA TCAACGGCGT CATCGATCCC ACGACCGGCC AGCCGGCGAT CTTCAGCGTC
ACGCAGCAGG TCAACGGTCC GGAAGCGACC GTGCGTGGGG TTGAATTGGC CTGGCAACAC
GTGTTCGGCG ACAGCGGTTT CGGCTTCAAC GCCAACGCCA CCCTGGTCGA CACGAACAAG
CCCTACGATC GCACCGACAT CTCACAAAGC GGCTTTGCGA TCACCGGCCT GGCCGACTCC
GCCAACCTCG TGGCCTTCTA CGACAAGAAC GGTCTCGAAG CCCGGGTCGC GGTCAACTAT
CGCAAGGAGT ACCTGCGAGG CTTTGGTCAG AACCAGAACA CCGGCGCCTT CGGTTCTGAA
CCGACGTTCG AAAATCCGAA CCTGCAGATC GACTTCAGCA CCAGCTACGC CCTGACCAAG
CAGATCAACC TGTTCTTCGA AGCCCAGAAC CTCACCAACG AGACGCAGAG CACGCACGGA
CGGTTCGACA ACCAACTGCT CGACGTATTC GCCTATGGCC GGCGCTACAC CGCCGGCGCA
CGTTTCCGCT TCTAG
 
Protein sequence
MKTVQLIYQS NAPTIQRASK AALVGASSFL ALALCGAANA ADRPAEEPIT TAAAVAAPTG 
EQASEPKSSG GTVVEEVVVT GLRGSLQRNL DIKRTSPGVV DAISAEDIGK FPDSNVAASL
QRLPGVSIQR AGARGEPQGI TVRGFGGDFN ETLYDGRRIS TATGGRSVDF STVGADFVGG
LSVLKTPDVT LSSSSIGATV NVAFPKPFDH PGRRMAFTAS GSLQDDAGKV APTVGALFSD
TFADNRFGIL VDAMYTRHDT QTNRVYVSGW PGGRYAPCQL TPTCTPTRLA DKSIVGWFEQ
QYGASQIYTK DERVDGRIAL QWSPSEDLTV TLDDNYSRQN IRADNFGYGI WFNQDGLRNV
KLDKNGTTVD FTQAGSQTDF VAGTDRSILQ TNQTGLNLKW DVSQNLNFEA DASYAKSWLN
PGGEISSDNA DVGYGFAIGP ALGISIGGDS KNTLPVLHGY GPNGDAARWA DTSVLGSHVT
VRQAQENTDV VKQLRFAGSW EQEGFRIKAG GSYLEDHYQF QQSNTFVNNY WQAYPGYGPP
SGPNGGVLAP SSLFTDKVST NHFIPGFSGA LPPTLLKFDA HAYQQFLTAL GNPQTQTIPG
FNYSGGNVGT TFTGAFNLGL DNGSIRDITE KTWALFLRAN FDVDVAGMPF HFNAGVREEN
THVTSNGFGQ VPTAITGSAG DPTLLTVTLS APQAVSTKSN YSYLLPSIDL KLELTESIHL
RLDASRTLTR PSLNLLTPVA SVGTGQRVGA LTASGGSPSL KPYLADNFDA AVEWYYRPNS
YASVNFFIKD VSNFIIGGTQ RQTINGVIDP TTGQPAIFSV TQQVNGPEAT VRGVELAWQH
VFGDSGFGFN ANATLVDTNK PYDRTDISQS GFAITGLADS ANLVAFYDKN GLEARVAVNY
RKEYLRGFGQ NQNTGAFGSE PTFENPNLQI DFSTSYALTK QINLFFEAQN LTNETQSTHG
RFDNQLLDVF AYGRRYTAGA RFRF