Gene Caul_3243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3243 
Symbol 
ID5900698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3503570 
End bp3505927 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content66% 
IMG OID641563748 
ProductTonB-dependent receptor 
Protein accessionYP_001684868 
Protein GI167647205 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.738407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCG CGTCAACGAA GCTGCGCAGC CTGCTGATGG GTGCTGGATC GGCGCTATCG 
ATCATGGCTG CGCAGGCCCA CGCCCAGACC GCGGGCGCCG CCTTGCCAGT CGCGCCCCAG
GCCCAGCCCG ACGAACTGAC CGAGATCGTC GTCACCGCCC AGAAGCGCGA ACAGACCCTG
AGCGACGTGC CCATGTCGGT GACGGCCTTC AGCGGCGACC AGCTGACCAG GCGCGGCATC
ACCGACGTCC AAGGTCTGGT GAAGATCACG CCCGGTTTAA GCTATGTCGA GAGCGGAAAC
GGGGTGCCGG TCTATTCCCT GCGCGGCGTC GGCTTCTTCG AGACCTCGCT GGGCGCGCGG
CCCAGCGTCG CCATCTATGC CGACGAGGCG CCGCTGCCGT TCGCCAGCAT GGCCAGCGGG
GCCGGCCTCG ACCTCGAGCG AGTCGAAGTG CTGAAGGGGC CGCAAGGCAC GCTGTTTGGC
CAGAACGCCA CCGGCGGCGC GATCAACTAC ATCGCCGCCA AGCCCACGGC CGAGCGCGCG
GGCGGCGCCG CCGTCAGTTG GTCGCGCTTC AACACGGTCG ACACCAGCGC CTATGTGGGC
GGGCCGATCA GCGACACCGT GGGCTTTCGC ATCGCTGGCC GCGCCCAGGT TGGCGACGAC
TGGCAACGCA GCATCACCCG CGACGCGACC CTGGGCGCCA GGCGCTTCTA CCAGGGGCGG
GCGCTGCTCG ATTGGCGCCC CAGCGACAAG CTGAAATTGC TGCTCAACGC CAACGGCTTC
CAGGACAAGT CCGAGACCCA GGCGGCTCAG CGCGTCGGGC TAATCGTGGC CGCCAGCCAG
ACCTCTTTCC TGTCGCGCAT CCCGCTGATG GTGAACTATC CGCCGGCTCC GCGCGATAAC
CGGGCCGCGG ACTGGGACGC CGGCCAGCCC CTGCGCAAGG ACAACGCCTT CTACCAGGTC
TCGCTACGTG GCGACTATGC GCTGGCCGAG ACGCTGACCC TGACGTCGCT GACGGCCTAT
TCGCACATGA AGATCAATCA GTTGATGGAC CAGGACGGCA CATCGCTGAC GGCCTCCCTG
ACCAAGGTGA CGGGGACCTT GTCGTCGTTC TCGCAAGAGG TGCGCGTGGC GGGCGGCGCG
GGTCCGGCGC AGTTCGTCGT CGGCGCCAGC TATGCGCGCG ACAAGTCCGA CGAGGCCGAC
TATTTCCGCT ATCCCTACAC CATCAGCAAC TTCTCCGGCT CGACCGGCCT GACCGGCGCC
ACGGACCTGA CGGGAAGGCA GGACTTCGAC ACCAAGGCCG TCTTTGGCAA CCTCGACTGG
GATATCACGA GCCAGATCGT CGCCCATGGG GGCCTGCGCT ACACCAAGAC CGATCTTGGC
TACGTGTCGT GCGCCCGGGC CGGCGACGCC CAGACGGCGA ATTCGTTCTC GATCCTGGTC
AACGTGCTGC GGGCTCGCGC CGGCCTGGCG TCCATCGCCC CCCTGGCGGT CGGCGGCTGC
GTCTCGCTCG ACGAGGGGCT CAATCCGGTC GGCCTCAACA GCCACCTGAA GGAGGACAAC
CTGTCCTGGC GCGCCGGCCT GGACTGGAAG CCCGCGCCCC GGACGCTCGT CTACGCCAAT
GTGAGCCGGG GCTACAAGTC GGGCAGTTCG CCCACCCTGC CGGCCATCGC CGCCAACGCC
CTTCGGCCGG TGACCCAGGA GTCAGTCCTG GCCTACGAGG CCGGCTTCAA GGCGCCGCTC
ATCCGCCGGG TCGTCGATGT GACCGGGGCG GTGTTCTACT ATGACTATGA CGACAAGCAG
TTGCTGGGTC GCAGCAACGC CCAGCCCGCG GTGCTGGGGG TCCTGCCGTC GCTGGTCAAT
GTTCCCAAGT CCCGGATCAA GGGCGCGGAG TTCCAGATCA ACGCCTTCCC GGCGACCGGC
TTTCGCCTGT CGCTGGCGGG CACCTATCTG GATTCCAAGG TCACCAAGGA TTTCAACAAC
TTCACGATCC TGGGGCTGCC GGCCAATTTC AAGGGCGACG CCTTCCCCTA TACGCCCAAG
TACCAGATGG TCGCCGACAT GCAGGTCGAT CGTCCGATCT CCGACGCGCT CAACGGCGTG
GTGGGCATGA ACATCAACTA CCGTTCCAAG ACCAATGCCG GGTTCGGACA CGACGCGCGC
CTGGACATCG ACTCCTATAC ACTGATTGAC GTTCGCGCCG GAGTAAGGTC GCCCGACAAG
GGCTGGGAAG CCAATGTCTT CGTGCGCAAC CTGACCGACA AGTACTACTG GACCAATGTC
GCGCGCCTGT CGGACGTGAT CCGCCGCTAC GCCGGCGAGC CGCGCACCTA CGGCGTTCAG
GTCAGCAGCA AGTTCTAG
 
Protein sequence
MAIASTKLRS LLMGAGSALS IMAAQAHAQT AGAALPVAPQ AQPDELTEIV VTAQKREQTL 
SDVPMSVTAF SGDQLTRRGI TDVQGLVKIT PGLSYVESGN GVPVYSLRGV GFFETSLGAR
PSVAIYADEA PLPFASMASG AGLDLERVEV LKGPQGTLFG QNATGGAINY IAAKPTAERA
GGAAVSWSRF NTVDTSAYVG GPISDTVGFR IAGRAQVGDD WQRSITRDAT LGARRFYQGR
ALLDWRPSDK LKLLLNANGF QDKSETQAAQ RVGLIVAASQ TSFLSRIPLM VNYPPAPRDN
RAADWDAGQP LRKDNAFYQV SLRGDYALAE TLTLTSLTAY SHMKINQLMD QDGTSLTASL
TKVTGTLSSF SQEVRVAGGA GPAQFVVGAS YARDKSDEAD YFRYPYTISN FSGSTGLTGA
TDLTGRQDFD TKAVFGNLDW DITSQIVAHG GLRYTKTDLG YVSCARAGDA QTANSFSILV
NVLRARAGLA SIAPLAVGGC VSLDEGLNPV GLNSHLKEDN LSWRAGLDWK PAPRTLVYAN
VSRGYKSGSS PTLPAIAANA LRPVTQESVL AYEAGFKAPL IRRVVDVTGA VFYYDYDDKQ
LLGRSNAQPA VLGVLPSLVN VPKSRIKGAE FQINAFPATG FRLSLAGTYL DSKVTKDFNN
FTILGLPANF KGDAFPYTPK YQMVADMQVD RPISDALNGV VGMNINYRSK TNAGFGHDAR
LDIDSYTLID VRAGVRSPDK GWEANVFVRN LTDKYYWTNV ARLSDVIRRY AGEPRTYGVQ
VSSKF