Gene Caul_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1111 
Symbol 
ID5898566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1178007 
End bp1180271 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content69% 
IMG OID641561593 
ProductTonB-dependent hemoglobin/transferrin/lactoferrin family receptor 
Protein accessionYP_001682739 
Protein GI167645076 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.192516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGCT CCAAGCTCGC CAAGCTCACC TGCTTCGCGG CGGCTTCCGC CACCGCCCTG 
CTCTGCGCCC AGGCCGCGTT CGCCGCCGAC GCCACGGCCG ACGCCGCCGT CGAGCTCGAC
AGGGTGACCG TCACCGCCAC CCGCTCGGAG AAGAAGCTGC AGGACGCCCC GGTGACCGCC
AGCGTCATCT CCGACCAGGA GATCGAGGAC GGGCTGGTCA AGGACATCAA GGACCTGGTC
CGCTTCGAGC CCGGCGTCTC GGTGCGCCGC GCGCCGTCCC GCTTCACCGC CGCCGGCGCC
TCGACCGGCC GGGACGGCAA TTCGGGCTTC AACATCCGCG GGCTGGAGGG CAACCGCGTG
CTGATCCTGG TGGACGGCGT GCGCGTGCCG GACGGCTACG CCTTCGGGGC CCAGAACATG
GGCCGCGGCG ACTATGTCGA CCTCGACGTC CTGAAGTCGG TCGAGATCGT GCGCGGCCCG
GCCTCGGCCC TCTACGGCAG CGACGGCCTG GCTGGCTCGG TCAACTTCTT CACCAAGGAC
CCGGCCGATC TGCTGACCTC AAGCAAGGCC TTCACCCTGC GCGGCAAGGT CGGCTACGCC
TCGGCCGACG AGAGCTGGAC CGAGAGTCTG CTGGCGGCAG GCCAGACGGG CGACTGGGAA
GGCCTGGTCG CCTATACTCG CCGCGATGGC GAGGGCCAGA AGACGGCCGG GACCAACGAC
TCCGCCAACA CCGACCGCAC CACCGCCAAC CCCGAGGACG ACCAGTCCAA TGCCCTGCTG
GCCCGGGTGG TCTATGCGCC CAGCGACCAC AGCCGCTTCC GACTGACCTA CGACCATCTC
GACCGCGACG TCGACTGGAC GGTGCTGAGC GCCATCGCCA AGCCGCCGCT GGCCTCGACC
GGCGTCCTGG GCCTGACCGC CTTCGACAGG ATGAAGCGCG ATCGCGTCAG CTTCGACCAC
CGCTACACCG GCGGCGAGGG CGTGATCCAG TCGGCTACGA CCACGCTCTA TTACCAGAAC
AGCACGACCC GGCAGTTCTC GGCCGAGGAC CGCAACACCG CCGCCGACCG CACCCGCGAC
GCCACCTTCG ACAACGAGGT CTGGGGCGCT GCCCTCGAAC TTCATAGCCA GGCCGACCTT
GGCGGCGTGA CCCACAGGTT CGTCTGGGGC GGCGACGCCG CGATCACCCG CCAGGAGGGC
GTGCGCGACG GCACGGTCCC GCCGGCCGGC GAGACCTTCC CCACCCGCGC CTTTCCGACC
ACCGACTACA CCCTGGCCGG CGCCTATGCG CAGGACGAGA TCGCGGTGGG GCCGGTGACC
TTCTATCCGG CCGTGCGCTT CGACTACTAC AAGCTGGAGC CGAAGAAGGA CGCCCTGTTC
ACCGCCAACG TTCCGGCCAG TCAGAGCGAT TCCCGCGTCT CGCCGAAGCT GGGCGCGGTC
TGGAAGGCCA GCGACCTGGT GACGGTGTTC GCCAACGCCG CCGCCGGCTT CAAGGCTCCC
TCGCCGTCGC AGGTCAACAA CGGGTTCACC AACCCGACCC AGTTCTACAT GTCGATCTCC
AACCCCAACC TGAAGCCGGA GACCAGCGAG ACCTTCGAGT TGGGCTTCCG CCTGAACCGC
GCCCGCTGGA ACGCCAGCGT CACCGGCTTT ACCGGCAAAT ACGACGACTT CATCGACCAG
GTGCAGGTGC GCGGTTCGTT CAGCCCCACC GACCCGGCGG TCTACCAGTA CGTCAACCAG
TCCAAGGCCG AGATCACCGG CGCCGAGGCC CGGTTCGCCG CCGAACTGGG CCGGGGGTTC
ACCCTGCAGG GCGCGGCGTC CTATGCCCGC GGCAACGCGG AAAACGCCGG CAAGAAGACG
CCCCTGACCT CGATCGATCC GGTCAAGCTG GTGGCGGGTC TGTCATACCG CGACCCGGCC
GGCCGGTTCG GCGGCGCGCT CAACGCCGTG CACGCCGCCA AGGAATCGGC CGGTCGCTCG
GGCGTCAGCT GCAGCATGAC GGTGGCCCGT CCCGCGCCCC TGCCGCCGCT GACCCAGACC
GGCCCGGACT ACTGCTGGAT GCCCAAGGCG TTCACGGTGT TCGACCTGAC CGCCTATTGG
AACCTGACCG GCAACGTCAC CCTGCGCGGC GGCGTCTTCA ACATCACCAG CCAAACCTAT
GCCTGGTGGA GCGACGTGCG CGGCGTCGCC GACACCTCGC TCGTCAAGGA CGCCTACACC
CAGCCCGATC GCAACTACAG CGTCTCGCTG GCGGTGAAGT TCTAG
 
Protein sequence
MSRSKLAKLT CFAAASATAL LCAQAAFAAD ATADAAVELD RVTVTATRSE KKLQDAPVTA 
SVISDQEIED GLVKDIKDLV RFEPGVSVRR APSRFTAAGA STGRDGNSGF NIRGLEGNRV
LILVDGVRVP DGYAFGAQNM GRGDYVDLDV LKSVEIVRGP ASALYGSDGL AGSVNFFTKD
PADLLTSSKA FTLRGKVGYA SADESWTESL LAAGQTGDWE GLVAYTRRDG EGQKTAGTND
SANTDRTTAN PEDDQSNALL ARVVYAPSDH SRFRLTYDHL DRDVDWTVLS AIAKPPLAST
GVLGLTAFDR MKRDRVSFDH RYTGGEGVIQ SATTTLYYQN STTRQFSAED RNTAADRTRD
ATFDNEVWGA ALELHSQADL GGVTHRFVWG GDAAITRQEG VRDGTVPPAG ETFPTRAFPT
TDYTLAGAYA QDEIAVGPVT FYPAVRFDYY KLEPKKDALF TANVPASQSD SRVSPKLGAV
WKASDLVTVF ANAAAGFKAP SPSQVNNGFT NPTQFYMSIS NPNLKPETSE TFELGFRLNR
ARWNASVTGF TGKYDDFIDQ VQVRGSFSPT DPAVYQYVNQ SKAEITGAEA RFAAELGRGF
TLQGAASYAR GNAENAGKKT PLTSIDPVKL VAGLSYRDPA GRFGGALNAV HAAKESAGRS
GVSCSMTVAR PAPLPPLTQT GPDYCWMPKA FTVFDLTAYW NLTGNVTLRG GVFNITSQTY
AWWSDVRGVA DTSLVKDAYT QPDRNYSVSL AVKF