Gene Caul_2383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2383 
Symbol 
ID5899838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2590565 
End bp2593345 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content69% 
IMG OID641562874 
ProductTonB-dependent receptor 
Protein accessionYP_001684008 
Protein GI167646345 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTACA AGGACATCAG CTACGCGGTC CTGCTGGCCG CGACGACCGC CGCCGGCGTC 
CACGCGCCGG CCTTCGCCCA GGCGCCGACG GTGACGTTCG ACATTCCGGC GGGCGACCTG
ACCGCGTCGC TGAACACCTT CGCGCGCCAG GCCGGCGTGC AGATCTTCTT CCCGTCGGCC
GAGCTGGCGG GCCGCAAGGC GCCGGCCATC AAGGGCGCGA TGCCGGCCCA GGCCGCGCTG
TCGAGGCTGC TGGCCGCCGG CGAGCTGGAG ATCGCCGCCG ACGATGGCCA CACCATCTCG
CTGCGCCCGG CCCGGCGGGA CGCCGCCGTG GCCCTGGACG AGATCCTGGT CACGGCCCAG
AAGCGCGAGC AGAAGACCAT CGACGTGCCG TTCGCCCTGA CCGCCTACAG CGGCAAGGCG
CTGGAGCGGC TGGGCGTCAC CAACTTCCGC GAACTGTCGA CCCACGTGCC AGGCCTGATG
GTCGAGGACC AGTCGCCCAA CAATCCGATC TTCGTGATGC GCGGCATCAC CTCGTCGGGC
GGGGACTCGT TCACCGAGCC GCGCGTGTCG GTGTTCCAGG ACGGCGTGTC GATCTCCAAG
TCGCGCGGCT CCTACATCGA GCTCTACGAC AACGCCCGGG TCGAGGTGGC CAAGGGGCCG
CAGTCGACGC TGTTCGGGCG CGGCGCCTTG ATCGGCGCGC TGAACGTGAT CCAGAACAAG
GCCGGTCCGG CCCCCGATTG GTCGGTGGCG GCCGAGGGCG GCAACCTGGG CTACCACCAG
CTGGACGCCA TGCTGAACCT GCCGGTGTCC GACACGGTGT CCCTGCGCGT GGCGGGGCGT
CGCAAGAGCC GTGACGGCTA TGAGAAGAAC CTCGACCCCG CCGCCGACGG CGACCTGAAC
GCCATCGACA CCAACGCCTA CCGGGTCGCG CTGAGCTTCA AGCCGAACGA CCGCTTCAGC
GCCGACCTGA TCTACAACCA CCAGGACGAC GAGACGAACG GCACCGGCTT CAAGTCGATG
TATGTCAGCC CGACCGACCC GGCGACGGGC AAGGTGCTGG CCGGCACGCG GGTGGACGAT
CCGGTCTGGC TGTCCAAGCC CGCCGACTTC GCGCTCGGCC ACCATCTGGG CGTCGACCAG
TACCAGAACG GCGTCATGGC GCTGGTGAAG TACAAGGTCT CCGACGCCCT GACCCTGAAC
GCGACGACCA GCTATCGCGA CATCGACGCG GTCGAGGTCT ACGACGCCGA CGGCACGTCG
CTGCCGCTGT TCACCAACAT GGAGGACGTC GGCGGCACGC AGGTCAGCCA GGAGCTGCGC
CTGAACTACG ACAAGGGTGG TCGCTTCTCT TGGTTCGCCG GCGCCAACTA TTACCGCGAG
CGCTCCAAGG CCCGGGTCGA TGTCCGCTTC GACGAGCGGA TGCTGCTGGC CCAGGCGGCG
GGCATGCTGA GCGGCGGACC GTTCACCGGC CTGCCCAAGA CCACGCCAGC GCCGGCCTCG
CTGTTCGAAA GCACGGCCTT CACCGGCGCG CTGCTGCAGG GTCTGGTCAC CCAGTCCAGC
AAGGGCAATC TGGTGCTGAC CAGCGCCGAG GCCGGCGCTC TCGCTGCGCG GCTGGATCCA
CACCACGTCG AGACCTCGCG CAACGAGTCC GACCTCGACG CCTATGATCT GTTCGGCGAC
ATGACCTTCC ACCTCACCGA CCGCTTCGAG CTGTCGGGCG GCCTGCGCTA CAGCCGCGAC
GAGAAGACCA CGATCTGGGG CAGCTCGGTG CAGGGCCGCA GCATCCTGGG CGGCGCCATC
GGCGCGGCCG GGATCGCCGC CACCGGCGCG CCGGCCGGCG TCGCCACGGC CCGCGCCCTG
ATCCAGGGCA TGACCTTCTA TGGCCCGACC CTGAACGGTC CGGTCCCGCT GTTCGGGGTG
TCGGCTCAGC CCACGGCCCA TAATGGCGAC TTCGCCAGCC GCGACCTGAC CGATGATGGC
GTCACCTGGC GCCTGACCGG CCGCTATGCG CTCAGCCCTA CCGCCAACCT CTATGCCAGC
TATTCGCGTG GTCGCCGGCC GGGCGTACTG TCGGCCGGGG CGCCGGGCGC GCCAGACGGG
ACACCCACCT TCGCCATCGC GCCGGCCGAG ACCGCGCAAG CCTACGAGAC CGGGATCAAG
GCCGACCTGC TGGACCGCCG CCTGCGGATC GACGGCTCGC TCTACTACTA TGACTACGAC
AACTTCCAGA CCCGCGAGCA GCGTGGCTCG ACCTTCGTGA CCACCAACGC GGGCACGGCG
CGGGCCTACG GGTTCGAGAG TCAGGCCGAT TTCGCCGCGA CGCCGAACCT CGACCTGTTC
GGCACCTATG CCTACAACCA CGCCCGCTTC ACACGCGGCG CCTATGAGGG CAACCACTTC
GCCCGCTCGC CGGACCACAT GGTGTCGCTG GGCGCCTCAA TGCGCTGGAC GGGCCTGGGG
GGCAGGTTCG ACTTTCGGCC GACCTACACC TGGCGTTCGA AGATCTTCTT CGCCGACGAC
AACGACCGGC CCGAGCTGCA GGCGGGCCTG CTGGTCCCCG ACGCCGGCCA GGACGAGTTC
CAGAACGGCT TTGGCCTGCT CAACGCCCGG ATCAGCTACG CGCCGGAACG GGGTAGCTGG
GAAGTGGAGG CCTTCGGCAG CAACCTGACC GACGAGATCT ATCGCAAGGG CGCGGGGAGC
GCCGGCAAGT CGATCGGCTT GCCGACCAAT GTGCTGGGCG AGCCGCGCGT CTACGGCCTG
CGCCTGACCA TCCACCGCTA G
 
Protein sequence
MTYKDISYAV LLAATTAAGV HAPAFAQAPT VTFDIPAGDL TASLNTFARQ AGVQIFFPSA 
ELAGRKAPAI KGAMPAQAAL SRLLAAGELE IAADDGHTIS LRPARRDAAV ALDEILVTAQ
KREQKTIDVP FALTAYSGKA LERLGVTNFR ELSTHVPGLM VEDQSPNNPI FVMRGITSSG
GDSFTEPRVS VFQDGVSISK SRGSYIELYD NARVEVAKGP QSTLFGRGAL IGALNVIQNK
AGPAPDWSVA AEGGNLGYHQ LDAMLNLPVS DTVSLRVAGR RKSRDGYEKN LDPAADGDLN
AIDTNAYRVA LSFKPNDRFS ADLIYNHQDD ETNGTGFKSM YVSPTDPATG KVLAGTRVDD
PVWLSKPADF ALGHHLGVDQ YQNGVMALVK YKVSDALTLN ATTSYRDIDA VEVYDADGTS
LPLFTNMEDV GGTQVSQELR LNYDKGGRFS WFAGANYYRE RSKARVDVRF DERMLLAQAA
GMLSGGPFTG LPKTTPAPAS LFESTAFTGA LLQGLVTQSS KGNLVLTSAE AGALAARLDP
HHVETSRNES DLDAYDLFGD MTFHLTDRFE LSGGLRYSRD EKTTIWGSSV QGRSILGGAI
GAAGIAATGA PAGVATARAL IQGMTFYGPT LNGPVPLFGV SAQPTAHNGD FASRDLTDDG
VTWRLTGRYA LSPTANLYAS YSRGRRPGVL SAGAPGAPDG TPTFAIAPAE TAQAYETGIK
ADLLDRRLRI DGSLYYYDYD NFQTREQRGS TFVTTNAGTA RAYGFESQAD FAATPNLDLF
GTYAYNHARF TRGAYEGNHF ARSPDHMVSL GASMRWTGLG GRFDFRPTYT WRSKIFFADD
NDRPELQAGL LVPDAGQDEF QNGFGLLNAR ISYAPERGSW EVEAFGSNLT DEIYRKGAGS
AGKSIGLPTN VLGEPRVYGL RLTIHR