Gene Caul_0970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0970 
Symbol 
ID5898425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1024674 
End bp1027649 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content67% 
IMG OID641561452 
ProductTonB-dependent receptor 
Protein accessionYP_001682598 
Protein GI167644935 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGC GGGCGCTTGC CCTGATCAGC ACGAGCATTC CCGCGGCCGC GCTGATGGCG 
ACGCCCGCGG TTGCCGAGGA CTATACGAAC GTGCAGGCCT CGGGCCGCGT GGAGTCGACC
GACGGCAAGC CGGTCGCCGG CGCGGTGGTC CGGATCACGT CGGAAACCCT GGGCGTGAAG
CGCTCGGCGG TCACCAGCGC CAGCGGCGCT TACGTCATTC CGCAGCTCGC GCCGGGCGTC
TACGCGGTCT CGGTGACGGC GGCGGGCTTC GACACCTATG CCGAGAAGGG CGTGCTGATC
AGCCGTTCGG GCGCATCGAA CCTCTTCACC CTGGCCCCGG TCGGCTCGGT CGAGGCCATC
GAGGTCAAGG CTGGCCGCAC CCGCACGCCG GACTTCGAAA TCACCACCAC GGGCGCGGCC
ATCGAGATCG GCGAACTGGC CGAACGCGTG CCGGTTGGCC GCTCGCTGCG TGACATCGCC
CTGATGTCGC CGGGCGTGGT CCAGGGGTCG TCGTCGGCCA ACGGGACCTT CGCCAACCAG
ATCTCGATCT CCGGCGCCAG CTTCATCGAG AACGCCTTCT TCGTGAACGG CCTGAACATC
ACCAACTTCC GCATGGGCCT GCTGCCCGTC GAGGTGCCGT TCGACTTCTA CAAGTCGGTG
GAAGTGAAGA CCGGCGGCTA TCCGGCCGAG TTCGGCCGCT CGACCGGCGG CTTCGTCAAC
GCCATCACCA AGTCGGGCGC CAACGCGTTC CATGGCGGCG TGACGGCGAC GTGGGAGCCG
GGCGACCTGC GCGATTCCGC GCCCAACACG TTCAAGAACA ACTTCAAGGA CGCCACCTCC
AGTCGCGAGG AATGGGTGGC CGAGCTGGGC GGGCCGATCA TCAAGGATCG CCTGTTCTTC
TACGGCCTCT ACAATCAGCG CAAACTCGAG TCCTTCAGCC CGTCGGCCAG CCAGGACAAC
GCCACCCGCA CCCGCGACGA CGCGCCGTTC TGGGGCGGCA AGCTGGATGG CTGGCTGACC
AGCAAACAGC ACCTTGAGTT TACCTATTTC GACAGCACCA AGGACGTCCG CAATCGCAGC
CTGAACTACA ACCGCGCCAC CAAGCAGGTC GGCGCGGAAA CCGGCGGCAC CAACCAGCGC
TCCGGCGGCG AGAACTACAT CGCCCGCTAC ACCGGCACCT TCACGCCCTG GTTCACCCTG
TCGGGCGCCT ATGGCGTCAA CAAGTTCCGC GATGGCCAAC TGCCGCTCGA CACGACCCAC
GAGCGCGTCA TCGACTACCG GACCAATTCG GCCGGTGTCG ATATCGGCGT CAACAAGGTG
ACCGACGCGA TGTCGTTCAA CGACGACGAG CGCACGTTCT ATCGCGCCGA CGCCGACTTC
AACTTCGACC TGTTCGGCGC CCACCACCTG CGGGTCGGCT ACGATCACGA GGAGAACACC
GCCACCCAGG TGTTCGAGAC CATCGGCTCG GGTTTCCTGA AGGTCTTCCG CGCCACCGGC
GCCGACCAGA CCGCCCTGCC GGCGGGCACG GACTATGTGA CCACCCGCGT CTACCGCAAC
AGCGGCTCGT TCAGCACGGT CAACCAGGCT TTCTACGCCC AGGACCAGTG GTCGCTGTTC
AATGATCGAC TGCAGTTCGA GGCGGGCGTC CGCAACGACC GGTTCGACAA CCGCAACGCC
GACGGCTCGA CCTTCTACGA TCCGGGCAAC CAATGGGCGC CGCGCATCTC GGTCTCGGGC
GACCCCTTCG GCGACGGCAA GTCCAAGCTC TACGGCTACT TTGGGCGCTA CTACCTGCCG
CTGCCCAGCG ACCTGTCGCT GAAGTTCGCC GGGTCGCTGG TGACCTATAC CCGCTACAAC
CTGCTGACCG GCGTCGCCGC CGACGGTACG CCGAGCCTGG GCGCGCCGGT CACCAGCGTG
GCGGGCATGG CGCCCTGTCC GGACACCGGG ACGGCCAATT GCCTGGTCTC GGCCTCGGGC
CTGGTGGCCG ACACCGCCGA GTCGATCGCC CACAACCTCA AGCCCCAGTC AGCCGACGAG
TTCGTGCTGG GCTACGAGCG GCGCTTTGGC GACCTGATCA AGGTGGGGGC CTATTTCACC
CACCGCGAGC TGAACAACAT CGTTGAGGAC GTGGCCATCG ACACCGGCGC GCGCGCCTAT
TGCGTGAAGG CCGGCTTCAC CGCGGCCCAG TGCCGGTCCA GCTTCGGCGG CGGTCGCCAA
TGGGTGATCG TCAATCCCGG CGAGGACGTG ACGATCCGCC TCAACAGCCT GCCGGACGGC
TCCAAGCCGG TCGTCACCCT GGCGGCCGCG GACCTGACCT ATCCCAAGCC CAAGCGCGAC
TACAACGCCC TGACCGCCAC CTTCGACCGG ACCTTCGACG GCAAGTGGTC GCTGTCGGGC
TCCTACACCT GGGCCAGCCT GAAAGGGAAC TACGAGGGCG GCGTGCGCTC CGAGAACGGC
CAGCTGGCCA TCAACACCAC GGCCGACTTC GACTCGCCGG GCTTCCTGGA CGGCGCCGAC
GGCTACCTGC CCAATCACCG CCGCCACACC TTCAAGGGCT ATGGCAGCTA CCAGGTCAAC
AAGTGGCTGA CCCTGGGCGG CAACGGCTCG GTGCAGTCGC CGCGAAAGTA CAGCTGCATC
GGGATCGTGC CCAATGCGGT CGATCCGGTC GCCTTCGGCT ACCAGGGCTA TGGTTTCTAC
TGCCAGGGCA AGGTCGTCGA ACGCGGCTCG GCCTTCGAGG GCGACTGGGT GACCCAGTTC
AACACCAGCG CCGTGCTGAC CCTGCCGACG CCGGGCGACC GCTTCGACGC CTCGCTGCGA
CTGGACGTCT TCAACCTGTT CAATTCGCAC GCGGTGACGG CCTATCACGA GTTCGGCGAC
GTGGGCGGCA GCGGCGTCGC GGACGTCAAT TTCCGCAAGC CGGTCGACTA CCAGACACCG
CGCTATGCGC GGATCCAGCT GCGCGTCGCC TTCTAG
 
Protein sequence
MALRALALIS TSIPAAALMA TPAVAEDYTN VQASGRVEST DGKPVAGAVV RITSETLGVK 
RSAVTSASGA YVIPQLAPGV YAVSVTAAGF DTYAEKGVLI SRSGASNLFT LAPVGSVEAI
EVKAGRTRTP DFEITTTGAA IEIGELAERV PVGRSLRDIA LMSPGVVQGS SSANGTFANQ
ISISGASFIE NAFFVNGLNI TNFRMGLLPV EVPFDFYKSV EVKTGGYPAE FGRSTGGFVN
AITKSGANAF HGGVTATWEP GDLRDSAPNT FKNNFKDATS SREEWVAELG GPIIKDRLFF
YGLYNQRKLE SFSPSASQDN ATRTRDDAPF WGGKLDGWLT SKQHLEFTYF DSTKDVRNRS
LNYNRATKQV GAETGGTNQR SGGENYIARY TGTFTPWFTL SGAYGVNKFR DGQLPLDTTH
ERVIDYRTNS AGVDIGVNKV TDAMSFNDDE RTFYRADADF NFDLFGAHHL RVGYDHEENT
ATQVFETIGS GFLKVFRATG ADQTALPAGT DYVTTRVYRN SGSFSTVNQA FYAQDQWSLF
NDRLQFEAGV RNDRFDNRNA DGSTFYDPGN QWAPRISVSG DPFGDGKSKL YGYFGRYYLP
LPSDLSLKFA GSLVTYTRYN LLTGVAADGT PSLGAPVTSV AGMAPCPDTG TANCLVSASG
LVADTAESIA HNLKPQSADE FVLGYERRFG DLIKVGAYFT HRELNNIVED VAIDTGARAY
CVKAGFTAAQ CRSSFGGGRQ WVIVNPGEDV TIRLNSLPDG SKPVVTLAAA DLTYPKPKRD
YNALTATFDR TFDGKWSLSG SYTWASLKGN YEGGVRSENG QLAINTTADF DSPGFLDGAD
GYLPNHRRHT FKGYGSYQVN KWLTLGGNGS VQSPRKYSCI GIVPNAVDPV AFGYQGYGFY
CQGKVVERGS AFEGDWVTQF NTSAVLTLPT PGDRFDASLR LDVFNLFNSH AVTAYHEFGD
VGGSGVADVN FRKPVDYQTP RYARIQLRVA F