Gene Caul_3760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3760 
Symbol 
ID5901222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4075691 
End bp4078099 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content67% 
IMG OID641564283 
ProductTonB-dependent receptor 
Protein accessionYP_001685385 
Protein GI167647722 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCC GCGCCTTTCT GCTCGCCTCC GCCATGACCT TCGGCGTCGT CGCGTCGAGT 
CAGGCCCTGG CCCAGGACGC GGCCCAGCCG AACGCCGTCG ACGGAGTGAT CGTCCAGGCG
CGCGACAAGG CCGGCCTGCT GGAGAAGCAG CCCAGCACCA CGGTGTTCGG CCTGGAGAAG
TCACTGCTGG AGACCCCGCG CTCGGCCAGC TTCGTCAGCG ACACCACCCT TCAGCGGTTT
GGCATCGAGA CGATCGACGG CCTGACGGCG GTGTCGCCGG GAACCTATAC CGCCAGCTTC
TACGGCGTTC CGGGCGTGCT CAACATTCGC GGCACCCTGG CCGAGAACTA TTTCCGCGGC
TTCAAGCGCG TCGAGAACCG CGGAACCTAT TCCACGCCGA TCGGCGGCGC GGCCCAGATC
GAGATCGTGC GCGGCCCGCC CACCCCGATC TATGGAGCCG GCAAGGTCGG CGGCATGTTG
AACTTCATCC CCAAGTCGGC GCGCGACGAG GGCCGGTTCC TGACCGACCC CAAGGGCGAA
CTGACCGCCA CGGTCGGGGC CTATGACAAG AAGAACCTGA CCGGACAGGT CGCCCTGCCG
GTCAAGCTGG GTTCGGCCGA CGGCGGGGTC TACGCCTATG GCGAGCTGGA CGACTCCAAG
AGCTTCTACC GGGGCCTGCA CCCCAAGCGG CAACTGGCTG AACTCTCCGC CGACTTCGAC
CTCAACAACG GCTGGAGCAC CGCCTTCGGC GGCATGGTCT ACCATTCGAC GGGCGACGTG
CAGACGCCCG GCTGGAACCG CCTGACCCAG GACCTGATCG ATCACGGGAC CTACATCACC
GGCCGCGACA CCAGTCTGGT CGACGCCGAC GGCAACGGCA GGATCACGCC GGGCGAGGCG
GGCTTCTACC CGTTCGGCAG CGCGCTCTAT ATTCCCTACG GCGGCGCGCC CGCGACCGAC
GCCAACCACA CGCTGGACAC GGGTGTCGGC ACGACCAAGC TCGACCCGCG CACCGTCTAT
ATCAGCGCCG CCGACTTCTC GAAGACCTGG ACCAACACCC TCTATCTCGA CCTGGCCAAG
CGGTTCGACG ACGACAGCGT GCTGAAGCTG CAGCTGTTCT ACGACGACCA GGAGAACAAG
CGCTTCGTCT CATACGGCTA TCCGGCCTGG TTCGACAGTT CGGTCTGGGA GGCGCGGGCT
AGCTACGCCT TCGCCCGCGA CTTCGGGGCG GTCAGCACCC AGACCATCGT GGGCGCCAGC
TATCGCGAGT TCCAGGGCCG CCGTCGCGAG AGCTTCAACA GCGGCCTGAT CGCCCTGGAT
CGACGCGACA TCCGCTATGG CGCCACGCCC AACGACATCA TCGACAGCCC GTTCAGCACC
GAGCCCGCCG GTGTCCAGGG CCTGGAATGG GAGAACGACA ACAGGAGCAC TTGGAGCCAG
ACGGGCCTGT TCTTCACCAG CGATGTCAAG TTCGGCCAGC GCCTCACCCT GACCCTGGGC
GGCCGCTATG ACGGGTACGA CGTCGCGGCG CACGACACCG GCTACCTGCC CTTCACCGTC
CCCGGACGTC AGACCGACAG CCGGGGGAAG GGGACCTACA GCGCCAGCCT GACCTACCAG
ACCCCGGTCG GCCTGATGCC TTACATCAGC TACGCCAAGG CCTCGGCCCT GGAGGTCAGT
CAGGCCGGCG ACATCGCCCC GGGCCTGGTC GCCGACGGCT CCTGGCTGTC GGACAGCGAC
CTGGCGGAGG CCGGCGTCAA GTTCCAGCTG CTGCGCGGGA CCCTGGTCGG CTCGCTGGCT
GGCTATCGCC AGAACCGCAC CCAGTTGTCG GGCCTGACCC CGGTGGTGCA GGGCACGCGG
GCCAAGGGCG TGGAACTGGA GATCCGCTGG CTGGCGTCCG AGCATGTCAG CTTCACCGCG
ACCGGCAACA CGCAGCATAC GACGGTGAAG GGGCCGGACC TGTCGTTCCA GTACATCCCG
GCCTACACGG CCGGCGTAAG CGGCGCCCAG GCCTATGGCG GGTCCTACGT GGTCTGGACG
TTCGGCGGGC CGACGGGCCT GCCGGGCCGC CTGGGCGACT ATGATTACAC CCTGATCCCC
AAGTCGGTGG TCAGCCTGTA CGGCGCCTAT ACCAGCGACA CGCATGACTG GGGTTCGGCC
GGGGCGACCC TGGGCGTCAC TCACGTGACC AAGACCTCGG GGACGGTGCA GGACGCGGTG
ACCTATCCCG CCTATGCCGT GGTCAACGCC TCGGCCTATT TTGCCCGCGG GCCGTACACC
GCCGAGCTCA ATGTCGATAA CCTGTTCGAC AAGCTCTATT TCACCCCGGA CGCCGACACC
TATGCCAACC TCGGAGCTCT GCCCGGCAAG GGGCGGGAAT GGCGCGTGAC CCTGAAGCGG
ACGTTCTGA
 
Protein sequence
MSIRAFLLAS AMTFGVVASS QALAQDAAQP NAVDGVIVQA RDKAGLLEKQ PSTTVFGLEK 
SLLETPRSAS FVSDTTLQRF GIETIDGLTA VSPGTYTASF YGVPGVLNIR GTLAENYFRG
FKRVENRGTY STPIGGAAQI EIVRGPPTPI YGAGKVGGML NFIPKSARDE GRFLTDPKGE
LTATVGAYDK KNLTGQVALP VKLGSADGGV YAYGELDDSK SFYRGLHPKR QLAELSADFD
LNNGWSTAFG GMVYHSTGDV QTPGWNRLTQ DLIDHGTYIT GRDTSLVDAD GNGRITPGEA
GFYPFGSALY IPYGGAPATD ANHTLDTGVG TTKLDPRTVY ISAADFSKTW TNTLYLDLAK
RFDDDSVLKL QLFYDDQENK RFVSYGYPAW FDSSVWEARA SYAFARDFGA VSTQTIVGAS
YREFQGRRRE SFNSGLIALD RRDIRYGATP NDIIDSPFST EPAGVQGLEW ENDNRSTWSQ
TGLFFTSDVK FGQRLTLTLG GRYDGYDVAA HDTGYLPFTV PGRQTDSRGK GTYSASLTYQ
TPVGLMPYIS YAKASALEVS QAGDIAPGLV ADGSWLSDSD LAEAGVKFQL LRGTLVGSLA
GYRQNRTQLS GLTPVVQGTR AKGVELEIRW LASEHVSFTA TGNTQHTTVK GPDLSFQYIP
AYTAGVSGAQ AYGGSYVVWT FGGPTGLPGR LGDYDYTLIP KSVVSLYGAY TSDTHDWGSA
GATLGVTHVT KTSGTVQDAV TYPAYAVVNA SAYFARGPYT AELNVDNLFD KLYFTPDADT
YANLGALPGK GREWRVTLKR TF