Gene Caul_2646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2646 
Symbol 
ID5900101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2871511 
End bp2873826 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content65% 
IMG OID641563137 
ProductTonB-dependent receptor plug 
Protein accessionYP_001684271 
Protein GI167646608 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.994259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTGC AGTTTCGAGG CCTGAAGGCC CTGGCTCTGG GCGCCACGGC GCTCACGGCG 
CTGAGCGTTC ATCCCGCCTT CGCCCAGGAC GCCGCCTCCG CGCCGGCCGA CAACAGCATC
GAGGAAATCG TCGTCACCGC CCTGAAGCGC TCGACCACGA TCCAGACCAC GCCCATCAGC
ATCAGCGCCG TCACCGAGAA GTCGCTGCAG TCCCTGGGCG CCTCGGGCAT CCAGGATTAT
TTCCGCACGG TTCCGAACCT GCAGGTCGAG GGCAATTCCC CCACCAACCG GCGCCTGACC
CTGCGCGGCG TGCGCAGCGC CGGCGAAGCC ACCGTCGGCC TCTATTACGA TGAAACCCCG
CTGACCGGCC CGGCCGGGAC CACCGCGGAC GCCAGCTCGA CCAGCGCCGA CGTCAACCTC
TTCGACGTCG AGCGGGTCGA GGTCCTGCGC GGCCCTCAAG GCACGCTGTA CGGCTCTGGT
TCGATGGGCG GCACGCTCCG CGTCATCATG AACAAGCCCG ACAGCCACCA GTACACCGGC
GCCGTCGAGG CCCAGGGCAC CGCCACCAAG GACGGCGGCC CGGGCTACTC CGTCAAGGGC
ATGGTCAACG TGCCGCTGAT CCAGGACAAG CTGGCGGCCC GCCTGGTCCT CTACGAAGCC
GAGCAAGGCG GCTATGTCGA CGATCTCCTG CTGAACAAGA AGGACATCAA CGACCAGCAC
TCGACCGGCG GTCGCTTGAT GCTGGGCTTC ACGCCGACCG ACAACCTGAC GCTCACCGCC
ACGGGCGTGT TCCAGAAAAC TACCCTGGAC GGCCAGAACA GCTGGTATCC CGCCCTTGGC
TCGAAGGACT ACTCGACCAA TGCTCGCGTC ATCGCGCCCA CCGACGACAA TCTGCGCATG
TACAACGTCA CGGCCAAGTG GGACCTGGGT TTCGCCACCC TGACGGGCAC CTCGTCCTAC
TACAAATGGA CGCTGCTACG GAACTCCGAC TACAGCCCCA CCCTCTCGGC CAGCCGCGCC
AACGCCACCT CGTGCCGCAA CTACATCGCT GGCGGCGCCC CCGCGAGCGG CACGACGAAC
CCGGCCTGCA CCACCACCCA GATGGCGCAA TACACCGCCT ATGCCGACAG CCGCGTTCCC
GGCGCGCTCT ACCAGCCCAT GGGCCTGACC TCGTGGAACC ACGAACTGCG GGCCAACGGC
GCCTTGTTCG ACAACAAGGT CGATTGGACC GCCGGCGTCT ATTACGAAGA TCGCTCGGAC
TATATCGAAA GCCAGGTCGC CAAGGCCGAT GCGACGACCG GCGTCATCAA TCCCAGCGAC
CTGACCGCCT GGCGCCATGT CGGCACCGAC ACCAAGCAGA CCGCCTTCTT CGGCGAGATC
ACCTACAAGC CGATCGAGAA GCTGAGCCTG ACCTTCGGCG CGCGGCGCTT CGACTATGAC
AAGACCGTCT CGGGCCAGGT GCTGATCAGC AACTTCATCA CCCAGTCCTA TGTCGGCCCG
GCCGCACAGG TCGACGCCAG CGCTAGCGGC TGGGTCAGCA AGTTCAATGT CAGTTACCAG
GTCACGTCAG ACATCATGGT CTATGGCTTG GCGGCCAAGG GCTTCCGCCC CGGCGGCGCC
AACAACATCC CAGGCCTGAC CTCGGCCCTG CTGGCCTACG GGCCCGACAG CCTCTGGAAC
TACGAAGCCG GCGTGAAGAG CCAATGGTTC GACCATCGCC TGACTCTGAA CGCCGCCGCT
TACCAGATCG ACTGGAGCAA CATGCAGATC TCGGCGACCA GCGCCAACGG GGCTTTCAGC
TACCTGACCA ACGCTGGCGC CGCCCGGATC AAGGGAGTTG AACTGGAGGC CGTCGCCCGT
CCGATGCGCG GCCTGACGCT GAACGCCACC GCCGCCTTCG TCGACGCCAA GCTGACGGAG
GACCAGGCCA ATTCGACCAT CCTGATCACC GGATCGACGG GCCTGCGCGG CGACGAATTC
CCCAACGTCG CCGACTTCAG CGGTTCGGCC GCGGTCGAAT ACAGCTGGCC GCTGACCGAC
GCCTTGAACG GCCTGCTCCG GGCCGACTAC GCCTATGTCG GCGAGTCCGC GTCGCAGTTC
CGCCCGACCT ATACCTACTA CGAAAAGCAG GGCGACTACG GTTACGCCAA CCTGCGGGGC
GGCGTCGAAG GAGCCGACTG GGGGGCCTAT CTGTTCGTCA ATAATGTTGG CAACGAAGTG
GGCCTGATGA GCGTCACCTC GGCCCTCAAC AACAAGCAGC AGGCCGTGAG CATCAATCCA
CGCACGGTCG GCGTCTCCGT CCGTAAGCGT TTCTAG
 
Protein sequence
MRLQFRGLKA LALGATALTA LSVHPAFAQD AASAPADNSI EEIVVTALKR STTIQTTPIS 
ISAVTEKSLQ SLGASGIQDY FRTVPNLQVE GNSPTNRRLT LRGVRSAGEA TVGLYYDETP
LTGPAGTTAD ASSTSADVNL FDVERVEVLR GPQGTLYGSG SMGGTLRVIM NKPDSHQYTG
AVEAQGTATK DGGPGYSVKG MVNVPLIQDK LAARLVLYEA EQGGYVDDLL LNKKDINDQH
STGGRLMLGF TPTDNLTLTA TGVFQKTTLD GQNSWYPALG SKDYSTNARV IAPTDDNLRM
YNVTAKWDLG FATLTGTSSY YKWTLLRNSD YSPTLSASRA NATSCRNYIA GGAPASGTTN
PACTTTQMAQ YTAYADSRVP GALYQPMGLT SWNHELRANG ALFDNKVDWT AGVYYEDRSD
YIESQVAKAD ATTGVINPSD LTAWRHVGTD TKQTAFFGEI TYKPIEKLSL TFGARRFDYD
KTVSGQVLIS NFITQSYVGP AAQVDASASG WVSKFNVSYQ VTSDIMVYGL AAKGFRPGGA
NNIPGLTSAL LAYGPDSLWN YEAGVKSQWF DHRLTLNAAA YQIDWSNMQI SATSANGAFS
YLTNAGAARI KGVELEAVAR PMRGLTLNAT AAFVDAKLTE DQANSTILIT GSTGLRGDEF
PNVADFSGSA AVEYSWPLTD ALNGLLRADY AYVGESASQF RPTYTYYEKQ GDYGYANLRG
GVEGADWGAY LFVNNVGNEV GLMSVTSALN NKQQAVSINP RTVGVSVRKR F