Gene Caul_4100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4100 
Symbol 
ID5901562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4452896 
End bp4455853 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content64% 
IMG OID641564620 
ProductTonB-dependent receptor 
Protein accessionYP_001685722 
Protein GI167648059 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAAC GTACTCGTGC GCGCGTCTAT TGGGTCGGCG GCGCGTCAGC CGCCGCCGTC 
ATATTGTCAG CCGGCCTAGC GAACGCTCAG GCAAAACCGC CCACGGGCAA CATGGTCGAG
GAAGTCGTCG TCACCGCCCA GCACAAGCAG GAGGCGGCCC AGACCGTGCC GATCGCCATC
TCGGCCTTCT CGCAGAAGAC CCTGGACGCC GCCAAGATCG AAGGCGGCCC CGACCTGTTG
AAGGCCATCC CCAATGTCAG CTTCACCAAG ACCAATTTCA GCGGCGTGAA CCTGACGATC
CGCGGCATCG GCACCCAGGC GGTGTCGGTC TCCACCGATC CCGGTGTCTC GATCAATTTC
AACGGCACGG CCTTCATCCG CAACCGCTTC TTCGAGCAGG AGTTCTTCGA CCTGGAGCGG
GTCGAGGTGC TGCGCGGGCC GCAGGGCACG CTTTACGGCC GCAACGCCAC GGCCGGCGCG
GTCAACATCA TCTCCGCCCG CCCCACCGAC CTCTTCGAGG GCGAGATCAA GGGCGAGGTC
GGCGACTACG CCAGCCGTCG GCTGAGCGGC TTCGTGAACA TCCCGCTGAT CGGCGACAAG
CTGGATCTGC GCCTGGCGGG AGCCTCGACC AACCGTGACG GTTTCGCCAA CAACACCACC
ACGGGCAACA AGATCGACGG CCGCGACCTC TATTCCACCC GGGTGAGCCT GGGCTTCAAG
CCAGCCGAAT CCGTGCGCGC CAATCTGGTC TGGGAGCATT TCAGCGAGAA CGACAACCGC
GCCCGCAGCA CCAAGCAGCT CTGCACGCGC GACCCCGGCA TCCCGAGCCT GGGCGGCATG
ACGCTCAGCC CGCAAACCAG CGCCTTCTTC AGCCAGGGCT GCCTGAACGC CTCGCTCTAT
TCGGACGCCG CTTACGGCAC GCCCAACGGC CTGTCGATCC CGTTCGTCCT GGCCTCGGTG
ACGAAAACCG CCGTCCTGGG CCTGAATCCC GACGGGTTTT CCGGCTCGGT GGATCTGCTG
AAGGCCGTCG ATCCCTATGG CGGCGCGATG CAGTCTCGGG ACCTGCGGCA GGTGTCGTCG
ATGTTCGACC CGAAGTACCG AGCCAAGGCC GACACCTTGG AGCTGAACGT CGATTTCGAC
CTGTCCCCGA CCCTGACTCT GACCTCGCAG ACCGCCTATA ATCGCGATCA CTATTTCTCC
AGCCAGGACT ACAACCGCTT CAACACCCAG CCTGGAGTCT TCAACGACTC CACGGGCCTG
GACAACTTGG TCGTTGCCGG AACGCCCGCG CCGAACCTCA CTCCGGGCGG CGTCTATTGC
GATCCCCAGC TGGGCTGTTC CAGCACCATC GCCGGCCTCG ATCAGTCCAA GGCCAAAAGC
CGGCAATTCT CGCAGGAGCT GCGCCTGCAA TCGGCCTTCG ATGGCCCGCT CAATTTCAGC
CTGGGCGCCA ACTACACCAA CTACCGAACG ATCGAGGACT ACTACGTCTT CTTCAACATC
ATCTCGGCCA TCGCCCAGAG CAGCGGCTAT GGCAATAACA GCCTCGACCC TACCAAGTGC
GGCAAGAATT TCGACGGTTC GACACCGACG ATCACCCCCG GCGGCACGAC GGGCTGCATC
TATATCGACC CCAACCCGCT GGACCAGGTG AATGGCCTGG GCCACAACTA TTTCCGCAGC
CAGAACGTGT ACAAGGTCAG CTCAAGCGCA GCGTTCGGCG AGCTGTACTA CAATATCACG
CCGGACCTGA AGCTGACGGG TGGTCTGCGC TACACACGCG ATCGCAAGAC CTTCACCCCC
GTGCCGACCC AGGTGCTGCT GTCGTCCGGG AACGGCGGAA CGGTCGATGG CGGCTATCCA
GTCGGCGAGG ACATCAAGCA GCACTGGGGC GTGATCACCG GACGCCTGGG CCTGGACTGG
TCGCCCAAGC TGTCCTTCAC CGACCAGACC CTGCTCTATG CGTTCTACAA TCGCGGCTAC
AAGGGCGGCG GCGCCAACCC GCCCGCCATC GGCTACAGCG ACGAGCCGCT GGCCCCTGGT
CTGCCGCCCC TGGTGCAACT GACGTCCTAT CCCGACACGT TCAAGCCCGA ATACGTCAAC
GCCTTCGAGG TCGGGACCAA GAACACGCTC CTGGGCGGCG CCCTGACGCT CAACACCTCG
GCCTTCTACT ATGACTACAA GGGCTACCAA GTTTCCCAGA TCCAGGATCG GACCGCGATC
AACGAGAACT TCGACTCCAA GGTCTGGGGT CTGGAGTTCG AGTCGAACTG GCGCCCCACC
CAGGCCCTGC GCCTGACCCT CAACCTGGGC TATCAGGACA GCAAGCTCGC CGACGGCTCC
AAATCGATCG ACGTCGAGAA CCGCACCCAG GGCAACCCCG ATTTCACGGT GATGAAAGCC
AACCCCTTCC TGCCGTCCAA CTGCGTGGTC TCGAAGGCCT ATGTGACCAG CTTGATCGCC
GCCAACGTCG CCACCAACCA GCCCGATTAC GCCTATTTGG GCGGCGTCTG TCCGGGTTCG
ATCGCGAGCC TGTTCCTGGT CCCCGCCCCG ACCGAGGCCG ACCTGCCCAA TGGCGGGCGC
GGGTTCTATG CCGACCTGTC GGGCCACGGC CTGCCCAATC AGCCCAAGTG GACCCAGTCG
CTCAGCGCCG AATACAGTCT GCCCCTGAGC GACGGCTGGG TCGGCGCGAT CCGCGGCGAC
GTCTACCACC AGTCCCAGTC CTGGGCGCGG GTGTACGAGG ACGCCATCGA CAAGCTGCAC
GGCTGGTACA ACGTCAACCT GCGGCTGACC GTGGCCAAGC CGGACTCCGG CCTGGAGTTC
GAGCTCTACG CCAAGAACCT GCTGGATGAC TCGCCCATCA CCGGCGCCTT CATCAACAGC
GACGACACCG CCCTCACCAC CAACGTCTTC ACGCTCGATC CGCGCGTGAT CGGAGCCAGC
GTCCGCAAGA GATTCTGA
 
Protein sequence
MSQRTRARVY WVGGASAAAV ILSAGLANAQ AKPPTGNMVE EVVVTAQHKQ EAAQTVPIAI 
SAFSQKTLDA AKIEGGPDLL KAIPNVSFTK TNFSGVNLTI RGIGTQAVSV STDPGVSINF
NGTAFIRNRF FEQEFFDLER VEVLRGPQGT LYGRNATAGA VNIISARPTD LFEGEIKGEV
GDYASRRLSG FVNIPLIGDK LDLRLAGAST NRDGFANNTT TGNKIDGRDL YSTRVSLGFK
PAESVRANLV WEHFSENDNR ARSTKQLCTR DPGIPSLGGM TLSPQTSAFF SQGCLNASLY
SDAAYGTPNG LSIPFVLASV TKTAVLGLNP DGFSGSVDLL KAVDPYGGAM QSRDLRQVSS
MFDPKYRAKA DTLELNVDFD LSPTLTLTSQ TAYNRDHYFS SQDYNRFNTQ PGVFNDSTGL
DNLVVAGTPA PNLTPGGVYC DPQLGCSSTI AGLDQSKAKS RQFSQELRLQ SAFDGPLNFS
LGANYTNYRT IEDYYVFFNI ISAIAQSSGY GNNSLDPTKC GKNFDGSTPT ITPGGTTGCI
YIDPNPLDQV NGLGHNYFRS QNVYKVSSSA AFGELYYNIT PDLKLTGGLR YTRDRKTFTP
VPTQVLLSSG NGGTVDGGYP VGEDIKQHWG VITGRLGLDW SPKLSFTDQT LLYAFYNRGY
KGGGANPPAI GYSDEPLAPG LPPLVQLTSY PDTFKPEYVN AFEVGTKNTL LGGALTLNTS
AFYYDYKGYQ VSQIQDRTAI NENFDSKVWG LEFESNWRPT QALRLTLNLG YQDSKLADGS
KSIDVENRTQ GNPDFTVMKA NPFLPSNCVV SKAYVTSLIA ANVATNQPDY AYLGGVCPGS
IASLFLVPAP TEADLPNGGR GFYADLSGHG LPNQPKWTQS LSAEYSLPLS DGWVGAIRGD
VYHQSQSWAR VYEDAIDKLH GWYNVNLRLT VAKPDSGLEF ELYAKNLLDD SPITGAFINS
DDTALTTNVF TLDPRVIGAS VRKRF