Gene Caul_1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1911 
Symbol 
ID5899366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2051569 
End bp2054022 
Gene Length2454 bp 
Protein Length817 aa 
Translation table11 
GC content65% 
IMG OID641562401 
ProductTonB-dependent receptor 
Protein accessionYP_001683538 
Protein GI167645875 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCCA TGACCAGTCG ACGTGTGTCT GAGTTCCTGG GGCGTGGAAG CGCTCTGGGC 
CTGCTGGCTT GCACGGCCTT GACCGCGCCC GCGTTCGCCC AGAGCGCCGC GCCTCGACCC
CAGTCGGACC AGCTCGAAGA GGTCGTCGTC ACGGCCCGCC GGACGGAAGA AAATCTTCAG
AGCACCCCCG TTGCGGTCAC GGCCCTCGGC GCCGCGGCGC TGGAACGGGC GCAGGTGGCC
GATGTCACCG ATCTCCAACG CACAACGCCC AGCCTCTCGA TCGCCACCGG AGCGCCGTCG
GCGTCGGGCT TCGCCTTCGT GGCCATGCGC GGTCAGGGAA ACCTTCAGGC GCTGGTCTCC
AATGACCCCG CCGTCGCCAT CTATGTCGAC GGCGTCTACA TTCCGCGCCC GTCGCAAGGG
CTGACCGATC TGCTGGACCT GCAGCGCGTC GAAGTCCTGC GCGGCCCGCA AGGAACGCTG
TTTGGCCGCA ACACCATCGG CGGCGCCGTC AACATCCTGA CGGCCGACCC GAACGATGAG
TTCGGCGGCA TGGTGAAGGT CGAGGCTGGA AACTATGACC AGCGCGGCGC CAGCCTCGTC
GTCAACGGCG CCTTGACGGA TAAGCTCTCG GGCCGCCTCG TGGCCGCCAC CAAGAGCCGC
GACGGCTACG GACACGACGT TCCGCTAAAC CGCGATGTCT GGTCCCAGGA CAGCGATTTC
GTCCGCGGCA AGCTCAAGTA CGAGAGCGGG CCCGTTCAAC TGGTCCTGTC GGGCGACTAC
AACAAGATCA GCGACAACGG CCAGTTCACG GCCCTGCACG CCTATGCTCC GGAGCTGTTT
GGGCCCACCG GGTCGTTCGG TCCGTTCGGC CTGGCGCCGA CCTTGAATGC GTCGCTGCAC
AACCAAGGCT TCTACGACAG CTATGCGACG GGCTTCTTCA TCCCCTCTAG CAACCCGAAC
TACGCCACCC TGCCGGCCAA CATCAAAGCG ATGTACGCCA TGCCCCTGGG CAACCGGTTG
GAAGCCTATG GCTTTGGCCT CAACGGCTCG GTGGACCTGG GCGGCGCGAC CCTCAAATCG
ATCACCGCCT ATCGTTTCAG CGACACCGAC GGCGTCGTGG ACACCGACGG CACGCCCGTC
CCGATCCTGA CGACCTGGGC TGGCTACAGC TCGAAGCAAT GGTCGCAAGA GCTTCAGATC
AACGGCGGCT TCGGCGACAA GTTCACCTAT ATCGCCGGTG GCTATTATTC GGACGAGAAG
GGGCAGGAGT TCAGCCGCTC GCAGAACTTC GGCTTCCTGC CCTATTCGGC GACGACGCGT
CCTTATGCCG GCCTCGCCGG ACAGAATCTG GCGGACGTGC ACAACACCTC GCTCGGCGTG
TTTGGCCAGG GCTACTACCA GCTGACCGAC AGCCTGCGGC TCGCCGGCGG GCTGCGCTGG
ACCTGGGATG ACCGCGACAC CAAGCTGTAC AACTACTCTA CCTGGGGCGT GGCCTCGACC
TGCAATCTTC CCAGCCCCGA TGTCGCCGGC GTCTGCGCCC AGACGCAGAA GACCAGCTTC
GATTATCCGG CCTGGACCTT CGGTCTCGAC TGGCAGATCG ACGAAGACGT GTTCGCCTAC
ATCCAGACGC GCCGAGCCTC CAAGTCCGGC GGCTGGAACA CGCGCGCCGG CGGTCTTCCG
GCCTTCCGGC CCGAGACGAC GCGCGACGTC GAAGCGGGCT TTAAGGCCAC CTGGCTGGAA
AATCGCCTGC GGACAAACAT CGCCGTCTTC CACTCCTGGC AGTCGGACGT GCAACGTAAC
GCGGCGGCTC TGACGCCGGC GGGCGCCAGC ACCCAGTTCA TCGTCAACGC CGGCGACGCC
CGGGTCTACG GGGTCGAGCT GGAAGGCGCC TACCGTCCTT GGACGGGCAT GGAGTTGACG
GCCAACCTCA GCCTGATGAA CGGCGAGTAC AAAAGCGGGT CGTTCATCGA ACAGCAGTCG
GTGCCTGGCG TGGCGCTCGC GGGATGTACG CCGGCGGGAA CGTCGTCGAT CTGTCCTGTG
AACCGCAGCG CCGAGGAGCT ACCGCAGCTT CCCAAGCGCC AGTTCAACCT GGGCGCCACC
CAGTCCCTGC CGACGTCCTT TGGCCGGGTG ACGCTGCACG CCGACTACGC CTATCTGGGC
AAGCAGTACT ATAATCCCGT CACGCCGGCC GCCCAGCAGT CGGCCACCAC CAAGGCCATC
TACGCCGCCT CGAACGCCAT CACCGAGACC CCGGGCTATG GCTTGCTGAA CGCGCGCTTC
ACCGTGGAAT TCGACGCCCA CGACATCGAG CTGGCCATCT ACGGCAAGAA CCTGACCAGC
AAGCACTACA ATGTTCGTCA GTTCGCCGAC GTCTATGCCG CTGGCCTGGG CTTCGCCACC
GACTTCATCG GCGAACCGCG CACCTACGGC GCCTCCCTGA CCAAGCGTTT CTAA
 
Protein sequence
MLSMTSRRVS EFLGRGSALG LLACTALTAP AFAQSAAPRP QSDQLEEVVV TARRTEENLQ 
STPVAVTALG AAALERAQVA DVTDLQRTTP SLSIATGAPS ASGFAFVAMR GQGNLQALVS
NDPAVAIYVD GVYIPRPSQG LTDLLDLQRV EVLRGPQGTL FGRNTIGGAV NILTADPNDE
FGGMVKVEAG NYDQRGASLV VNGALTDKLS GRLVAATKSR DGYGHDVPLN RDVWSQDSDF
VRGKLKYESG PVQLVLSGDY NKISDNGQFT ALHAYAPELF GPTGSFGPFG LAPTLNASLH
NQGFYDSYAT GFFIPSSNPN YATLPANIKA MYAMPLGNRL EAYGFGLNGS VDLGGATLKS
ITAYRFSDTD GVVDTDGTPV PILTTWAGYS SKQWSQELQI NGGFGDKFTY IAGGYYSDEK
GQEFSRSQNF GFLPYSATTR PYAGLAGQNL ADVHNTSLGV FGQGYYQLTD SLRLAGGLRW
TWDDRDTKLY NYSTWGVAST CNLPSPDVAG VCAQTQKTSF DYPAWTFGLD WQIDEDVFAY
IQTRRASKSG GWNTRAGGLP AFRPETTRDV EAGFKATWLE NRLRTNIAVF HSWQSDVQRN
AAALTPAGAS TQFIVNAGDA RVYGVELEGA YRPWTGMELT ANLSLMNGEY KSGSFIEQQS
VPGVALAGCT PAGTSSICPV NRSAEELPQL PKRQFNLGAT QSLPTSFGRV TLHADYAYLG
KQYYNPVTPA AQQSATTKAI YAASNAITET PGYGLLNARF TVEFDAHDIE LAIYGKNLTS
KHYNVRQFAD VYAAGLGFAT DFIGEPRTYG ASLTKRF