Gene Caul_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1859 
Symbol 
ID5899314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1991688 
End bp1994354 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content66% 
IMG OID641562349 
ProductTonB-dependent receptor 
Protein accessionYP_001683486 
Protein GI167645823 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCT CCAAGAATTC GTCGGCGCGC TGCGCCGGCG CCGTCGCCAT TCTCGCCTTC 
GCCGCCCTGG CCTCCAGCCA GGCCGCCCAA GCCGCCGAAC CCGCCGCCGC GCCGGTCGCC
GAGCCCGCCG AAACGGCGGA CACCGTCGAC ACCATCGTCG TCACCTCCTA CGGCAAGAGC
CTGCAGGCCG CCGCCGCCCT GAAGCGCTCG GCCGCCTACG GCCTGGACGC CATCAACGCC
GAGGACATCG GCAAATTCCC GACCCGCAAC GCCGCCGAGG CCCTGCAACT GGTCACCGGC
GTGACCATCG ACCGCCAGCG CGGCGCGGGC CTCTATGTCA GCGTCCGGGG CTTGGGTCCG
CAGTTTCAGT ATGTCGAGCT GAACGGCCGC TCGATCGCGG TCAACGAGCT GATCGAGAAC
GGCGGCGCCA AGGGCCGCCA GTTCCGGTTC GAGGTGCTGC CGGCCGAACT GATCTCGCAG
ATCGAGGTGG TCAAGACCCC GACCGCCGAC ATGGACGAAG GCGCCCTGGG CGGCAACATC
AACATCAAGA CCTTCAAGCC GCTGGACGTG GGCGACAAGG CAGCCTTCTC GATCCGCGAG
ACCTATGGCG ACGTGCGTAA GAAGGCTGAT CCCTCGGCCT CGGGCCTGCT CAGCTGGACC
AATGCCGACA ACACCTTGGG CCTGCTGGCC TCGGGCCTCT ACGACGAACG CCACGTGCGC
AACGACCGCC TGTACCAGGT CGGCTGGAAC CTCAACAAGT TCACCTCGAC CCTGGGCGCG
GGCCGCTACA CCCCCTCGCG GACCCGCCCG ACCATCGAAA CCGAACACCG CAAGCTCTAT
TCGGGCGCGG TGACCGGCCA GTGGCGGCCC AATAGCGACT TCCAGACCGA TGTGGACGTG
CTCGTGACCC GCCTGGACGT CGACTATGAC GAGTTCGGCC TCGACATCTA CCCCGACGAC
ACCACCTTCC AGAAGCCGAT CTTCGTGGCC GGCAGCGAGA AGCTCATCGG CGACACCGTG
GTCGGCGGCA CGATCAACAA CGTGCGCTGG ATGGCCTCGC GCGAAACCAG CCTCAACCGC
CATGACCTGA CCGCCGTGGG CCTCAAGCAG GCCTGGACGC CGGGGGCGTG GACTTTCAAC
GGCGAATACG CCTATTCCCG AGCGCGCAGC TATCACCCCG ACGGCGAAGC CACCAAGCGC
AACCGCCTGG CGTTCTTTGG CCCGCTGACC TACGATTTCT CGCGCGGCTA CAAGTCGATC
CCGCAACTGA CCACCACGGT GGACTACACC AACCCGGCCA ACTTCGTTGG TCAGGCCTTC
GACTATACTT GGAAGGACTC GCGCGACACC GACGAGACCA TCCGTGCCGA CGGCGGGCGG
GTGTTCGACG GCTGGTTCAG CAAGATCGCC TTCGGGGCCG AGCGCCACAA CCTGAAGCGC
GACTACCGTC GCCGCGACTG GGTGCTCAAC AACGACCTGA ACGTGCCGAC CAGCACGCTC
GGCTCTGCCT ACTACGAGCC CCTGCCCTAT TCGGGCTTCC TTAAGGACTT CGACGGCAAT
ACCCCGCGCA ACTGGGTTAC GCCGACCCGC GACGCCTTCT ACAATGCGCT GTTCACGCCG
ACCGTCGCCG GCCAGCCGAT CTCGGCGGCG GATGCGCGCA ATTCGTTCGT GGTCGACCAG
CACATCACCA GCGCCTATGT GCGCGGCGAT TTCGCGTTCG CCACCGCCAT GCCGATCAGC
GGCAATGTCG GCGTTCGCTA CGCTCGCACC GAACAGGTGG CCAGCGGCAC GTTGACCAGC
ACCGACGCCA GCAACAATCC CGTGCTGACC CCGGTGTCCT ATAAGCAGGA CTATGGCAAC
TGGCTGCCCA GCCTGAACGT GAAGATCGAA CTGCGGGACG ACCTGATCGG CCGCTTCGCC
GCTTCGCGCG TGGTCAACCG GCCCAACGTC ACCGACAGCG CCCCGCGCAT CAGCGTGGCG
CGGGACACGC CCAGCGCCTC GGGCGGCAAC CCGGACCTGA AGCCGTTCCT CGCCGACCAG
TTGGACGCCT CGCTGGAATG GTACCCGGCC CCGACCACGG CCCTGACCGG GGCGGTGTTC
TACAAGAAGA TGGACAACTA CATCACCCAG CAGAACACCA CGATCCAAGT CCCGGGTCGC
GGCGACGTGC TGCTGTCGAC CAGCGTCAAC GGCGGCGACG CCAAGCTGAC CGGCGTCGAG
GTGGCGTACA ACCAGAGCCT GGCCTTCCTG CCGGGTCCGT TCGACGGTCT GGGCGTCCAG
GCCTCGGTCA CGCTCGTGGA CAGCAAGGCC AGCTACTTCG CCGGCAATCG CCAGATCAAG
GACGATCTGA TCGGCCTGTC CAAGACCAGC TACAACCTGG TCGGCTACTA CGAGAAGGGT
CCGCTGGCCG CGCGCCTGGG CTGGTTCTGG CGCTCGCGCT ACCTGTCGGG CACGGGCAGC
ACCACCACAG CCGAGTCCTA CATCGACGCC TACGGCTCGC TGGACGGCTC GATCTCCTAC
GACCTGACCA AGAACTACGC CCTGACGCTG GAAGGCTCGA ACCTCACGGA CGAGATCCGC
TACGTCTACG GCAAGACCAA GGACCAGCCG ATGGAGACCT ATCACTGGGG CCGCACGGTC
TCGCTGACCC TGCGCGGCAA GTTCTAA
 
Protein sequence
MIRSKNSSAR CAGAVAILAF AALASSQAAQ AAEPAAAPVA EPAETADTVD TIVVTSYGKS 
LQAAAALKRS AAYGLDAINA EDIGKFPTRN AAEALQLVTG VTIDRQRGAG LYVSVRGLGP
QFQYVELNGR SIAVNELIEN GGAKGRQFRF EVLPAELISQ IEVVKTPTAD MDEGALGGNI
NIKTFKPLDV GDKAAFSIRE TYGDVRKKAD PSASGLLSWT NADNTLGLLA SGLYDERHVR
NDRLYQVGWN LNKFTSTLGA GRYTPSRTRP TIETEHRKLY SGAVTGQWRP NSDFQTDVDV
LVTRLDVDYD EFGLDIYPDD TTFQKPIFVA GSEKLIGDTV VGGTINNVRW MASRETSLNR
HDLTAVGLKQ AWTPGAWTFN GEYAYSRARS YHPDGEATKR NRLAFFGPLT YDFSRGYKSI
PQLTTTVDYT NPANFVGQAF DYTWKDSRDT DETIRADGGR VFDGWFSKIA FGAERHNLKR
DYRRRDWVLN NDLNVPTSTL GSAYYEPLPY SGFLKDFDGN TPRNWVTPTR DAFYNALFTP
TVAGQPISAA DARNSFVVDQ HITSAYVRGD FAFATAMPIS GNVGVRYART EQVASGTLTS
TDASNNPVLT PVSYKQDYGN WLPSLNVKIE LRDDLIGRFA ASRVVNRPNV TDSAPRISVA
RDTPSASGGN PDLKPFLADQ LDASLEWYPA PTTALTGAVF YKKMDNYITQ QNTTIQVPGR
GDVLLSTSVN GGDAKLTGVE VAYNQSLAFL PGPFDGLGVQ ASVTLVDSKA SYFAGNRQIK
DDLIGLSKTS YNLVGYYEKG PLAARLGWFW RSRYLSGTGS TTTAESYIDA YGSLDGSISY
DLTKNYALTL EGSNLTDEIR YVYGKTKDQP METYHWGRTV SLTLRGKF