Gene Caul_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0572 
Symbol 
ID5898027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp621332 
End bp623608 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content60% 
IMG OID641561054 
ProductTonB-dependent receptor 
Protein accessionYP_001682203 
Protein GI167644540 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACC GACAACTTGG CCTCACCGTC TCACGTCTGG CGATTTGCGC AGGCCTGATG 
GGATCATTCT TTGCAGCGGA GGCCCGCGCC CAGACAGAAC CTGGCGCCGT CGCCCACGGC
GCCGACCAGC AACCCGCCGC CATCGGACTT GAAGAGATCG TTGTCACCGC GACGCGCCGC
GAGACTTCAC TCCAAAAAAC AGGACAAGCG ATCTCGGTCG TCTCGGGGGC GGAGCAACTG
CTGACGGGCC GTCAGGTCCT GGATGACCTG AAAACCTCAA TGCCCAACGT CAACTTCGCG
TCGACCAGCA ATACGACGCA GATGTTCGTT CGCGGCATCG GCAACACGTT CCTGACGGCC
GGCGGCGATC CGGGCGTCGC CCTCTATCAG GACACCGCCT ATATCTCGGA CCAAACCACC
TCCAGCGTCA GTTTCTTCGA TGTGGAGCGT GTGGAAGTCC TGCGAGGACC GCAGGGCGGG
CTCTATGGTC GTAACGCCAC GGGCGGCGCC ATCAATATCA TTTCGGCCAA GCCGACGTCC
GAGATGACCG GTCGGCTTTC CGTGCTCGCC GGCGACTATG GTCGGCTGGA GAGCGAGGGC
TTCGTCTCGG GGCCGCTGGG CTTTGCCAAC ACCGACTTTC GTCTTTCATA CCAGCTTCAT
CGATTGGATG GCTTTGTCCG CAACATCTAT GAGCCGACCG TCGGCGCTGC GGGTTTCGCC
GCTGCGCCGG ACCGGCTCGA TGACATGAAT TCCGATGCGG TGCGCCTTCA GACGGCGACG
CAATTGGGTT CGGGCGGCGT CTTGCGTGTC ATCGTCACCC ACTATCGTGA AACGGACAAT
GGCGCCGCCC TCGCGGTCGT GCCCGACTCT GGCTTCATCT ATCCGGCCCA GCATCTCTAC
GGTCTTGTTC CAACCAGCGA TCCGCGCAAC ATCACCGTCG ATGAGGGCTA CAACAAGATC
CGCCTGACCA ATTACAACGC CAACCTCGAT CAACCGATCG CGGACGGTAT GCTGACGGTC
ACCGGCAACT ATCGCCGCTC CCACCGGGAC TTTCTCAACG ACTGCGACGG GACGTCCGCC
AATAGCTGTT CGTTCCGGGT CGAAACCTCG AGCGACGACT ATTTCGGCGA TATTCACTAC
GCCTCGTCAA ACGAAGGCCC GTTCCGTTAT ATTGTCGGCG CGACCTACAC GCACTTCATC
CAAGATCAAA TCGCCACCAT ACCGTTTGAA TTTCCGACGT TTTATCTGAC GGGCAACCCG
GCCGATCGAG CAGCGTTTGA TTTTCCGACG GCGTCGGGCG GAAGGCTCCG GACGAACGCG
TGGGCGGCGT ATGCGGACGC GCGATACGCC TTGTCGGACG TCTGGTCGCT GATCGGACAA
GTCCGTTATA GCCGCGCGAG GAAAAAGGCG CTCGAAACCC TGAAGTTGCC CGCCTTCGGT
CTTGTCGTCG TCGACTCGCC CAATCGCGCC AGTGACTCCG GCGTACCCTT CAAGGTCGGT
GTGGAAGGAC AACTGACGAA CGATGTCCTG GTCTATGGCA ACTTCTCCAC CGGGTTGAAG
GACGCGGCAA TAAACCTTGG CACCCTGCAA ACGGCGCCGG TGGCCAAGGA AACGGTTCAA
AGCGTCGAGG TTGGCTTCAA ATCAAGCTTG TTTGACCGCC GGCTGCGGAT CAACGGCGCG
GCCTTCAACA GCGACTATAA GAACTTGCAG ATCTCCCAGC TCAAGGGGAC CTTGGCCACG
CTCGCCAACG CCCCGAAAGC TCAGATCAGC GGCGCCGAAC TGGAGGTTAC CGCAGAGCCG
GTGTCGGGGC TGCGATTCAA TGGCAGCGTC GGCTATCTCG ATCCCAAGCT CAAGCGGTTC
GAAAACACGC CGAACCTGCC GGCCGATTTC GTTCCCCAGC CTGTCCTCGT GTCGTTGGAT
GGAAATCAGC TGCCCTATGT CGCGAAGTGG AACGTCACCC TGGGCGCAAC CTACAGGTTC
GAACCTACCG CGGGCATCGC CGTGGAGCTG GGCGGAAACT ACTACTACCA GAGCCGGATC
TACTTCAACG AATTCAATGC ATTGAGAAAT TCTCAAAAGC CGGTCGGGCG GGTCGATCTG
TCGGCTTCGA TCGGGCCGTC CAATGACCAA TGGAAAGTCT ATGGCTACAT CCGCAACCTG
ACCGATGAAA CGGTGCTGGC CGGAACAACG ATCTATGCGG GACTGCTGGG GGCTGAAAAG
GGTGTGTCCT ATGCGCCTCC CCGCAACTTC GGCATCGGAT TCTCCTACAA TTTCTGA
 
Protein sequence
MSHRQLGLTV SRLAICAGLM GSFFAAEARA QTEPGAVAHG ADQQPAAIGL EEIVVTATRR 
ETSLQKTGQA ISVVSGAEQL LTGRQVLDDL KTSMPNVNFA STSNTTQMFV RGIGNTFLTA
GGDPGVALYQ DTAYISDQTT SSVSFFDVER VEVLRGPQGG LYGRNATGGA INIISAKPTS
EMTGRLSVLA GDYGRLESEG FVSGPLGFAN TDFRLSYQLH RLDGFVRNIY EPTVGAAGFA
AAPDRLDDMN SDAVRLQTAT QLGSGGVLRV IVTHYRETDN GAALAVVPDS GFIYPAQHLY
GLVPTSDPRN ITVDEGYNKI RLTNYNANLD QPIADGMLTV TGNYRRSHRD FLNDCDGTSA
NSCSFRVETS SDDYFGDIHY ASSNEGPFRY IVGATYTHFI QDQIATIPFE FPTFYLTGNP
ADRAAFDFPT ASGGRLRTNA WAAYADARYA LSDVWSLIGQ VRYSRARKKA LETLKLPAFG
LVVVDSPNRA SDSGVPFKVG VEGQLTNDVL VYGNFSTGLK DAAINLGTLQ TAPVAKETVQ
SVEVGFKSSL FDRRLRINGA AFNSDYKNLQ ISQLKGTLAT LANAPKAQIS GAELEVTAEP
VSGLRFNGSV GYLDPKLKRF ENTPNLPADF VPQPVLVSLD GNQLPYVAKW NVTLGATYRF
EPTAGIAVEL GGNYYYQSRI YFNEFNALRN SQKPVGRVDL SASIGPSNDQ WKVYGYIRNL
TDETVLAGTT IYAGLLGAEK GVSYAPPRNF GIGFSYNF