Gene Caul_1319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1319 
Symbol 
ID5898774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1394550 
End bp1397750 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content64% 
IMG OID641561804 
ProductTonB-dependent receptor plug 
Protein accessionYP_001682947 
Protein GI167645284 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0274612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATGA CTTCCACCAA GGGCGGCGCT CGCGCGCGTC TTTTGACGTC GACGCTGCTG 
GCTGGCCTGG CCACCGTCGC CGCGCCGCTG GCGATCACGG CGATCGCCAC GGCGATCCCG
ACCCTGGCCT CGGCGCAGGA CTACACGAGC GGCACCCTCG TCGGCACCGT CCGTGACGCC
AGCGGCGCTC CGGTCAGCGG CGCCGCCGTC ACCGTCAAGT CGCTGGGTCA AGGCTTCACC
CGTCAACTGG TCACCGGCAG CGACGGTCAG TTCCGCGTGC CGCTGGTGCC GCAAGGCGGC
TATTCGGTCG CGATCTCCAA GGAGGGCTTC CAGCCCACGA GCGACGGCGC CGTCGCCGTG
CGTTCGGGCG GCGACAGCGC CTACAGCTTC ACCCTCTCGT CGGCTGACGC GTCGGTTTCG
GAAGTCGTGG TCACCGCCAC CGCCAATCCG CAACTCGACT TCGGCGGCAC CACCACCGGC
CTGTCGGTCG ACCTGGAAAC CCTGACCAAG CAGGTTCCCG TCAACCGCAC CATCACCAGC
GTCGTTCTGC TGGCGCCCGG CGCGGTTCAG GGCAGCAACA CCAACTTCCG TGGCCAGCCT
TCCATCGGCG GTTCGTCGGT CGCTGAAAAC GCGTTCTACG TGAACGGCCT GAACATCACG
AACTTCGACA ACTACCTGGG CGGCTCGACC GTCCCGTTCG ACTTCTACAA GTCGGTGGAC
GTGAAGACCG GCGGCTATCA AGCCGAATTC GGCCGTTCGA CCGGCGGCAT CGTCAACGCC
GTCACCAAGG CCGGCACCAA CGAGTTCAAG TTCGCCGTCC GCGGCCAATG GGAACCCGAC
AGCCTGCAAG AAGACCAGAA GGACACCTTC CTGCGTCGCG GCAAGCTGGC CAAGACCGAC
AACAAGTCGC TGACCCTGGA AGCCGGCGGT CCGATCATCC CCGATCGCCT GTTCTTCTTC
GCCATGACGC AGATGCGCGA CAACCAAACG ACGTTTGGCA GCATCACGGG CGGCAGCTAC
AATAAAGAAA CCCAGCGCGA CCCCTTCTAT GGCCTGAAGC TGGACGGCTA CATCACCGAC
CGCCAGCACC TCGAATTCAC CTATTTCGAC ACCAAGGGTT CGGCCAAGCG CAGCACTCGG
CAATACGAGT TCGACGACAC CACCGGCACC GACACCTTCG GCGACAAGCT GGGCGGCACC
CTGTTCTCGC TCGGCGGCGC AAACTATGTC GGCAAGTACA CCGGCACGTT CACCGACTGG
TTCACCCTGT CGGCGGCCTA CGGCGTCACC AAGGACAGCT ATCGCGTCAC CCCGCAGGAT
CTGTCGGGCA ACTACGTAAC GAACACGGCT GATCCCGCCC ACCCTGGCGA GACTTCGGTC
ATCAGCCGTC AAAAGACGTC GTCCTATGAC TCGAGCTACG AAACCAAGCG CGAGTTCTAC
CGCATCGACG CCGACTTCTA CTTCGACCTG CTGGGCAAGC ACCACATTCG CGCTGGCTAC
GACCAAGAAG ACCTGACCCT GGATCACGTG AACCAGTATC CGGGCGCCGG CACCGATTGG
GACTTCCTCC TGGCGGGTGC GACTGACGCG CGTGGCGTGG CTGCGGGGCA GACCTACGTC
AAGGGCCGCA CGTTCAAGAC CGGCGGCATC TTTGAAGGCA CCAACAAGGC CTATTACATC
CAAGACTCGT GGGACATTCT GTCCAACCTG ACCCTGAACC TGGGCATCCG CAAGGACCAG
TTCCAGAACA GCGGCGCTCG CACCGCCAAG GGCAGCGAAA CCTTCGTCGA GTTCGACAAC
GAAATCGGTC CGCGGATCGG CTTCACCTTC GACCCGTTCA GCACCGGCAA CGACAAGATC
TTCGGTAACT TCGGTCGCTA CTACCTGCCG GTCGCGTCGA ACACCGCGTT CCGTCAAGCC
ACGGCCAGCT ACGACATCGA CACCTTCTTC ACCGCGCCCC AGGGCGTGGC GCTGGGCGCC
GATGGCACGC CGATCCGTGG CACGCAGATC ACTCAGACGA CCAACCCGGG CTTCGCCTCG
GCGGCCGCCT GTCCGGCTCC GGTGGCTGGC GTGACCCCGC CCGGCGCCAC CGACGCGGTC
GGCTGCGCCG TCCGTGGCGA CGGTTCGCTG CAACCCTTCG CCGCGAACAC CAGCAAGAAC
CTGAAGTCCA CCCAGGAAGA CGAATACATC CTGGGTTACG AGCACCAGTT CAACTCGCTG
TGGAAGGCCA GCGCCACGCT GACCTATCGC AACCTGAACC GGGTTTCGGA AGACGTCGCC
ATCGACGCCG CGGTGCGCAA CTACTGTGTC AAGAACGGCA TCGCCGGCTG CGGCTCGACC
TACAATGTCG CCGGTCCGAC CCCGGGCTGC ACCACCTTCT CGGCCGGTCC GCGCGCCGGG
CAGACGCGTT GCGCGGGCTT CTCGGGCTTC CGTCAGTACA CGATCGTCAA CCCGGGCGAG
GCCTCGACCA TCACCCTGCG GCAGCCGCTG CCGGGTGAAG CCACGGCTCG CACCATCAGC
TTCTCGAAGG CTGACCTGGG CTATCCGACC GTGAAGCGCG AATATGTCGG TCTGGAAATG
AAGGTCGAAC GCGCCTTCGA CGGCAAGTGG GGCTTCCAGG GCTCGTACGT CCTGGCGGAA
TCGAAGGGCA ACTACGAAGG CTTCGTGAAG TCGGACGCCG GCAACGGTCA AACCGACTCG
GGCATCACCC AGGACTTCGA CCAGGTGAGC CTGACCGACG GCGCCTACGG CCTGCTGCCG
AACCACCACG CCCACCAGTT CAAGCTGTTC GGTTCCTATG CGATCACCGA CAACCTGCTG
GTCGGCGGCA ACGCCCTGGT CCTGTCGCCC AAGCATTACG GCTGTATCGG TCTTCACCCG
ACCGACGACA TCGTGAACTC GGGCTACGGC GTGGCGTCCT TCGCCTGCGG CGGCAAGATC
GTTCCGCGCG GTTCGGCCTT CGAAACGCCC TGGACGGCTC GTCTGGATAT CGCGGTGCGT
TATCTGGTGC CGACCACTAA GTTCATCCCG GGTGGCCTGA CCCTGCGCGC CGACATCAGC
AACATCCTGA ATTCTCGGAC CGAGACCGAA GCTTGGGAAT TTGGCGACAG CGACGCGGGT
GGTGCGGACG AGCACTACAA GGACCCGATC CAATACCAAG CGCCCCGTTC GGTGCGTCTG
GGCTTCGACT GGGAGTTCTA G
 
Protein sequence
MKMTSTKGGA RARLLTSTLL AGLATVAAPL AITAIATAIP TLASAQDYTS GTLVGTVRDA 
SGAPVSGAAV TVKSLGQGFT RQLVTGSDGQ FRVPLVPQGG YSVAISKEGF QPTSDGAVAV
RSGGDSAYSF TLSSADASVS EVVVTATANP QLDFGGTTTG LSVDLETLTK QVPVNRTITS
VVLLAPGAVQ GSNTNFRGQP SIGGSSVAEN AFYVNGLNIT NFDNYLGGST VPFDFYKSVD
VKTGGYQAEF GRSTGGIVNA VTKAGTNEFK FAVRGQWEPD SLQEDQKDTF LRRGKLAKTD
NKSLTLEAGG PIIPDRLFFF AMTQMRDNQT TFGSITGGSY NKETQRDPFY GLKLDGYITD
RQHLEFTYFD TKGSAKRSTR QYEFDDTTGT DTFGDKLGGT LFSLGGANYV GKYTGTFTDW
FTLSAAYGVT KDSYRVTPQD LSGNYVTNTA DPAHPGETSV ISRQKTSSYD SSYETKREFY
RIDADFYFDL LGKHHIRAGY DQEDLTLDHV NQYPGAGTDW DFLLAGATDA RGVAAGQTYV
KGRTFKTGGI FEGTNKAYYI QDSWDILSNL TLNLGIRKDQ FQNSGARTAK GSETFVEFDN
EIGPRIGFTF DPFSTGNDKI FGNFGRYYLP VASNTAFRQA TASYDIDTFF TAPQGVALGA
DGTPIRGTQI TQTTNPGFAS AAACPAPVAG VTPPGATDAV GCAVRGDGSL QPFAANTSKN
LKSTQEDEYI LGYEHQFNSL WKASATLTYR NLNRVSEDVA IDAAVRNYCV KNGIAGCGST
YNVAGPTPGC TTFSAGPRAG QTRCAGFSGF RQYTIVNPGE ASTITLRQPL PGEATARTIS
FSKADLGYPT VKREYVGLEM KVERAFDGKW GFQGSYVLAE SKGNYEGFVK SDAGNGQTDS
GITQDFDQVS LTDGAYGLLP NHHAHQFKLF GSYAITDNLL VGGNALVLSP KHYGCIGLHP
TDDIVNSGYG VASFACGGKI VPRGSAFETP WTARLDIAVR YLVPTTKFIP GGLTLRADIS
NILNSRTETE AWEFGDSDAG GADEHYKDPI QYQAPRSVRL GFDWEF