Gene Caul_2543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2543 
Symbol 
ID5899998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2757870 
End bp2760260 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content65% 
IMG OID641563034 
ProductTonB-dependent receptor 
Protein accessionYP_001684168 
Protein GI167646505 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.546908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACA CGCGTACCTC GCCGGCCTCA CGCCATTTCC GGGCCTTGCT GCTCGCCGCC 
ACCGTTCTGG GCGGCGCGAC GCCCGTCCTG GCCCAGGAGG CCGATAAGAC TTCAACCGTC
GAGGAGGTGG TCGTCACCGG CAGCCGGGTG TCCGAAGCCA GCGTCGCCAT CGGCACCGAC
CACGCCACCG CCACGGTCTC GATCACCCGC GAGGCCCTGC TGTCGGCGCC CGCCGGCGTG
ACCGGCCTGA AAATGCTGGA GTCCCTGCCC GGCTTCAACG TCCAGGCCAA CGACGCCCTG
GGCATGTACG AGTTCGGCAA TTCGGTCTCG GTGCGGGCCT TCAACTTCCA GCAGATCGGC
TTCCTGCTCG ACAACATTCC AATGGGCCGC AGCGACCAGT TCGGCGGCAG CCCGATCTAT
CGTTATGTCG ACAATGAGAA CCTCAACCGC GTGACAGCCT CGGCCGGGGC CGGCGACGTC
TCCCTGCCCA GCTACGCCTC GCTGGGTCCC ATCGTCGACT ACTTCACCCA GAAGCCTTCG
GACGAGGCCG GCGGCGCGGC CAGCGCCACC CTGGGCAGCG ACGCCCTCAA GCGCGGCTTC
CTGCGCCTGG AGACCGGCAA GATCGGACCC GTCTCGGCCT ATGTCAGCGG CTCGTGGATC
AAGGGCGACC TATGGCGCGG CCCGGGCACG ATCGACCGCA AGCACTATGA AGGCAAGCTG
AACTACGAAC TGCCCAACGG CGGCGACATC AGCTTCCAGA CCGTCCACAA TGACTATTAC
GATTATGACA GCCCCTCGAT CACCAAGGCC CAATACGCGG GCACGGCGGG TGACGTCTTC
GGGCGCTCGG GTCGCAGCTT CGCCTATCTC GGCGAGGTGC CGCTGTCGGT CCCGCTGGGC
ACGCCGGTGC TCATTCCGAC CGCCAGCCTA CCGCAGACCG TGGCGGGGAT CGTCTATTCC
AACCCCAACT ACGCCAACTA CTACAAGTTC GCGGTCAACA AGAGGAAGGA CCACCTCTAC
GGCCTGACCC TGACCACGCC GATCACCGAC ACGATCGACC TGACAACGAC GGCCTACTAC
GAAGACAAGG GCGGCTATGG CGTCTCGCCC GAGGACTATG CGACGTCCAA GGGCAACTAC
GACGCCGAGA TTCTGGCCGG TCTCACGGGA CTGACCGCGC CCAAGGGCTT GCAATACGGC
CTGTCGGCGA TCGACGGCAC GCGCAAGGGC GTCACGGCCA AGGGCAGCTG GAAGGTCGGC
TTCAACACGT TCGAGGCCGG GGTCTGGCTC GAGAAGGACG ACTATCACCG CACCCAGGCG
CGCTACAACA CCGTCAACGG CGACCCCGAC GGCGCGCCGT TGCTGAACGA ACCAGTGCAC
CTGCAACGAG ACTATGTCTC GACCCGCGAC ACCACCCAGT TCTTCCTGAA GGACACCCTG
AGCCTGCTGG ACGACGCGCT GAAACTGGAA CTCGGCTTCA AGACGCTCGA CGTCGACTAC
AACATCCACG GCAAGCGCCA GATCGCCGAC TACCGGACGG GCCGTACGCC GTCGATCGAC
GCCAAGTGGA AGGACAACTT CCTGCCCCAG GTCGGCCTAG TCTACAGCGT CAGCAGCCGC
GACCAGGTGT TCGCCTCTTA TTCGGAGAAC ATGGCCCTGC CGCGCGGCGC CGACGATGTG
TTCTCGGCCG CCAGCCCGTC CGCGCCCGGT CCCAAGCCGG AAACCTCGAC CAACGTGGAA
CTGGGCTATC GGGCCAATCG CGCGACCTTC AACGCCTCGT TCGTGGTCTA CAAGACCGAG
TTCAAGAACC GGCTGCAGCA GTTCAACGCG GTGGTGCCCG GCAGCACCAC GCTGGAAAGT
TTCTATCAGA ACGTCGGGGC GGTGAAGGCG TCCGGCGCCG AGTTCAGCGG CCAGTGGAAG
CCGGAACTGC TGGGCGGCAA GATCTATTTC AACGCCAACG CCTCGTACAA TAAGTCGGAG
TTCCAGGACG ACGTCCTGAA CTACCGGTCC AGCGCCACGG CCACGCCGGT GACCCTGTCG
ACCAAGGGCA AGGCGGTGCC CGACTTCCCC GAGTGGCTGT TCCAGGGCGG GGTGACGGTC
GAGCCGACGG ACGGCGTGGT GTTCAACCTC TCGGCCCGCC ACATCGACGA CCGCTTCACC
AACTTCATCA ACAGCGAGAG CACCAAGGCC TATACCCTGT GGAACGCCTA TCTGGACCTG
GGCGACGGCT TCGCGGCCGG GCCATTCAAG CAGGTCAAGA CCCGGGTCAA TATCGATAAC
ATCTTCGACA AGGACTATCT GGGCACGATC AACACCACGG TGAACACCGC CGCCAGCTTC
CGGCCCGGCT CGCACCGCAC GATCCAGTTC ACCGTCTCCG CCGACTTCTA G
 
Protein sequence
MTNTRTSPAS RHFRALLLAA TVLGGATPVL AQEADKTSTV EEVVVTGSRV SEASVAIGTD 
HATATVSITR EALLSAPAGV TGLKMLESLP GFNVQANDAL GMYEFGNSVS VRAFNFQQIG
FLLDNIPMGR SDQFGGSPIY RYVDNENLNR VTASAGAGDV SLPSYASLGP IVDYFTQKPS
DEAGGAASAT LGSDALKRGF LRLETGKIGP VSAYVSGSWI KGDLWRGPGT IDRKHYEGKL
NYELPNGGDI SFQTVHNDYY DYDSPSITKA QYAGTAGDVF GRSGRSFAYL GEVPLSVPLG
TPVLIPTASL PQTVAGIVYS NPNYANYYKF AVNKRKDHLY GLTLTTPITD TIDLTTTAYY
EDKGGYGVSP EDYATSKGNY DAEILAGLTG LTAPKGLQYG LSAIDGTRKG VTAKGSWKVG
FNTFEAGVWL EKDDYHRTQA RYNTVNGDPD GAPLLNEPVH LQRDYVSTRD TTQFFLKDTL
SLLDDALKLE LGFKTLDVDY NIHGKRQIAD YRTGRTPSID AKWKDNFLPQ VGLVYSVSSR
DQVFASYSEN MALPRGADDV FSAASPSAPG PKPETSTNVE LGYRANRATF NASFVVYKTE
FKNRLQQFNA VVPGSTTLES FYQNVGAVKA SGAEFSGQWK PELLGGKIYF NANASYNKSE
FQDDVLNYRS SATATPVTLS TKGKAVPDFP EWLFQGGVTV EPTDGVVFNL SARHIDDRFT
NFINSESTKA YTLWNAYLDL GDGFAAGPFK QVKTRVNIDN IFDKDYLGTI NTTVNTAASF
RPGSHRTIQF TVSADF