Gene Caul_0563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0563 
Symbol 
ID5898018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp611271 
End bp613484 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content60% 
IMG OID641561045 
ProductTonB-dependent receptor 
Protein accessionYP_001682194 
Protein GI167644531 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.354996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAATC GGTCGAATAC TCGTAAATTC TTGCTTTCCA ATGCGGCTCT GGCCGCCATG 
CTCTACGCGA CGTCTGCAGG AGCGCAGACG TCGCCCGAGG CAGGCGGCGC TTTCGTCGAG
GAGGTGATCG TCACGGCGGC CAAGCGCTCG GAAAACCTCC AGTCGGTCCC GATCTCTGTC
TCCGCGATCA CTGGCGATTC ACTGGCCAAG TCCCGCATCG ACAGCGTCGA TGCGCTGGTC
AGCAGGATTC CCAACCTCCA GCTGAGCTCG ACTGTGGGCG CCAACACCCC GATCTTCGCG
GTTCGCGGCG TCTCGATGTC GGACTACTCG CTCAGCCAGG CGAGCCCGGT CGCGACCTAT
TATGACGAGG TCTACAAGGG CAACTTCGCC TTGCTCGGCG TGGCGATGTA CGACCTCGAG
CGTGTTGAAG TGCTGCGGGG CCCCCAGGGG ACGCTCTATG GCAAGAACAC GACTGGCGGC
GCAGTCAATC TCATCAGCCG CGACGCCAAG CTGGGCGAGA CCAGCGGCTA CGTTAGTGCA
GGGTATGGCA ACTACAACCG CGTCGATCTG AACGGCGCCG TCAACCTGCC GCTAGGGGAG
GACGCCGCTC TGCGGTTAGC CGGCACCTAT AGCCGCGCGG ACGGCTGGTT CGAGAGCGTG
ACCCCTGGCA TCCGGGACCT CAACGAAGTT CGCGAATACG GCGTGCGCGC GACCCTCAAC
TACGAGCCGG CCGACGGCGT CCACCTGAAG CTTGTTGGGT CGACAAGCCT CCAAAACCCG
CAGAACTACG GGATCTACGC GACCCCGGAA GCGACCAATC GGCCCGGGCT CGACACTTGG
GAAGTCGCCT CGAACATTCC CGATCGCCGC CGGGCGCGGA CTTGGTCCCT TGCGCTCCAC
GCCGACGTCG ACGTTTCCGA CACCCTGACT TTGACGAGCA TCACCTCCTG GGACAAAGGA
ACGCTTCACT TCGTAGAGGA CACCGACGGG AAAGCGACCA GTGAGCTGGA AATCACCTAC
GGCGATCGGG CCAGTCAGTT CGCTCAGGAT CTGCGCCTGA CCAGCGACAC CGGCGGTCCG
TTCAACTTTA TCCTCGGGGC CTATTTCAAT CGTGAGAAAG TGTTCAACAA CAACGTCCAG
AAAATGATGA ACGAGCTCGA CGTCAACGGT GACGGCGTCA TCGATTACAA CGACTGCATC
GACAGTGGTT TTGTCGCAGC TTGCCAGCTC ACGAACCAGT TCTATCAAAC CAAGAAAAGC
TACGCGCTTT ACACCGACGT GAAGTATGAG CTCGATGAGA ATCTGACGCT TCGCGGCGGA
CTGCGCTACA CCCGCGACGA AGGCGAGCAG GCCGACTACC AGTCCGACGC TTATGGCGCC
GACGGCGTCC TGATCACGAA CCTCATCCCG CCGACGAACC TTGAGTACTC GGCCAAAAAC
CTGTCGGGCA AGATCGGGGT CGACTATAAG CTGTCGTCGA ACAAGATGGT CTACGCGACC
ATCAGCACCG GCTATCGTGC GCCGAGCTTC AACGCACAGG CTTTCTTCGA CCCCAGCGAA
ATCACTGTAG CCAAGGCCGA GAAGGTCACT TCGTACGAGG TGGGTGCGAA GACGCAGTTC
GCCGATCGCC GAGTGACGCT GAACATGGCC GCCTTCTACT ACGACTATCG CAATCAGCAG
TTTCTGAACA TCGACGCGGC CACCGCGCGC CAACAGCTAC TCAACATCCC GAAGTCTCGC
ATCTATGGCG GCGAGGCGGA ACTGTCGGCA TACGTGAACG AGAGCTTCAG CCTGCGCGCC
GGCCTGGGCC TGCTATCGAC CGAAATCCGA GAAGGTACGG TCAGCGGCGT CGATGTCAGC
GGCAACAAGC TGTCCAATGC GCCAGAAGTC TCGGCCAATC TGGGGCTCGA CCTCACCATC
TTCGAAAACG AGAATGGCAA GCTCTCGCTA CACCCGGAAG TGGCCTATCA ATCGAGCCAG
TACTTCGAAG TGATCAACAT TCCGAGGCTC AGGCAGGACG GCTACGCGCT ACTGTCGGGA
CACATCAGCT ATGAGACAAC GGATGGTCGC TGGAACGCCT CGGCGTGGAT CAAGAACGCC
GCCAATGAGA AATACTTCAC GTCGCGCATC GATCTGCTCA ACAACTGGGG CTTCGATTAC
AACCAGCTCG GCACACCGAG AACGTATGGG ATCTCTATAG GAGCCAAATT CTAG
 
Protein sequence
MYNRSNTRKF LLSNAALAAM LYATSAGAQT SPEAGGAFVE EVIVTAAKRS ENLQSVPISV 
SAITGDSLAK SRIDSVDALV SRIPNLQLSS TVGANTPIFA VRGVSMSDYS LSQASPVATY
YDEVYKGNFA LLGVAMYDLE RVEVLRGPQG TLYGKNTTGG AVNLISRDAK LGETSGYVSA
GYGNYNRVDL NGAVNLPLGE DAALRLAGTY SRADGWFESV TPGIRDLNEV REYGVRATLN
YEPADGVHLK LVGSTSLQNP QNYGIYATPE ATNRPGLDTW EVASNIPDRR RARTWSLALH
ADVDVSDTLT LTSITSWDKG TLHFVEDTDG KATSELEITY GDRASQFAQD LRLTSDTGGP
FNFILGAYFN REKVFNNNVQ KMMNELDVNG DGVIDYNDCI DSGFVAACQL TNQFYQTKKS
YALYTDVKYE LDENLTLRGG LRYTRDEGEQ ADYQSDAYGA DGVLITNLIP PTNLEYSAKN
LSGKIGVDYK LSSNKMVYAT ISTGYRAPSF NAQAFFDPSE ITVAKAEKVT SYEVGAKTQF
ADRRVTLNMA AFYYDYRNQQ FLNIDAATAR QQLLNIPKSR IYGGEAELSA YVNESFSLRA
GLGLLSTEIR EGTVSGVDVS GNKLSNAPEV SANLGLDLTI FENENGKLSL HPEVAYQSSQ
YFEVINIPRL RQDGYALLSG HISYETTDGR WNASAWIKNA ANEKYFTSRI DLLNNWGFDY
NQLGTPRTYG ISIGAKF