Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2995 |
Symbol | |
ID | 5900450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3254322 |
End bp | 3257123 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641563492 |
Product | TonB-dependent receptor |
Protein accession | YP_001684620 |
Protein GI | 167646957 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAGC GATATCCCGT TCAGGCCATG CGCCCGAACC ACAAGATCTT TTTGCTGGCC ACCGTATCGG CGCTGGTGGT GGGCGCCAGC TCATCGGCCG CTCTGGCGCA GCAGGCCTCC GGTGAGTCGG CGTCTGGCGA CATGGTCGAC CAGGTGGTCG TCACCGGCTA CCGGAAGTCG CTGAGCGATG CGCGCGCCAT CAAGAGGGAT TCAGTCATCC AGAAGGACGC GATCGTCGCC GAAGACATGG CGAAGTTCCC GGACCTGAAC CTGGCCGAAT CGTTGCAGCG CCTGCCGGGC GTGCAGATCA CCCGCGAGGC GGGCGAGGGC AGGCGCATTT CGCTGCGCGG CCTCGGCCCG GATTTCAGCC GCGTGCAGCT GAACGGCATG GAAGTGCTGG GCAATGTCGA CTCCGCCCAG GACAGCCGTG GCCAGCGCTC GCGCGACCGC GCGTTCGACT TCAACATCTT CGCTTCGGAA CTGTTCTCGA AGGTGGAGGT CGAGAAGACC TTTGAAGCGG CCCAGAACGA GGGCGGCATG GCCGGCACCG TCGGCCTGTT CACCGGCAAG CCGTTCGACT ACGCGGCGGG ATCCAAGGGC GCGGTGTCGC TGAAGCTGGG CACCAACGAG TACACCAAGG ACACCCAGCC GCGCATAGCC GCCTTGTTCA GCCAGAACTG GGACAACAAG TTCGGCGTGG CGCTCTCGGT CGCCTACTCC AAGCGCGAGA CCACCGAGCA GGGCCACAAT ACCTACAATT ACGACCGGTT AAGTTCTGCT GCCTTGCAGA AGCTAGTCAC CAACGGCCTG AATATCTCCC ATCTGAGCGC CGCGCAGCAG GCCAAGTTCC TGTCCGGGGA CCTGTATTTC GCGGACGGTA ACCGCATCTC CTCCTGGAAC GCGAAGCAGG AGCGCCTCGG CCTGACCGGC GCTGTGCAGT GGCGGCCGAT GGACAATCTG CTGTTGACGC TGGATGCGCT GCACGGCGAA TTCACCACCC ACCGCGACGA GTATCACCTG GCCACGCGAC CGCTGGGATC CGGGACGAAG TCCTTTGCGT TCGACACGCC CGCCGGCGGG GTCTGGCCGG CGGCCTTCCA GACGGGGTCG GTCATCAACG ACCTGACGTG GGATAGCAGC AACTACGTCA CCAAGACCGA CGTCACCGGC ACGACCTTCG GCAGCGAGCA TCGCCGGTCG CTGAACGAGA ACCGCTTCAA CCAACTGGCC CTGACCGGCA AGTGGGACGC GACCGATCGC CTGACCATCG ACGGCCATGT CGGCTATGAA AAGTCGACCT ACAAGACCCC CTATGACGAC AAGCTCTACA TGCGCGCCAA GGGCAATATG GTCGCCAACT ACGGTACGGA CGGCCAGTCG GCCACGTTCA GCTATCCGGG CTTTAGCGCC ACCAACCCGG CCAACTACGC GATGGACTCG TTCTACTACC GCAGCTTCAA CAATGAATCG GGGCTGCGCG AGGGCGTGCT GAACCTGCGC TACGAACTGT CCGACGTCTT CACCCTGCGC GCGGGCGTGG CCTACCACCG CTTCTCGCAA GAGGGCATGG ACCTGTTCTA CGACGACAAC GTCAATGGAA CCCGGTCCAA GATGCGCGGC ACGTCCGTCG CCGACGTCAC TTCGGTATTC ACGAACGAAT TCGGATCGTG GCTGGTCGGC GACTACGGCA AGGCCTTCGC GAAGTACAAG GAGTACCACC GGCTCGGGGC CAATACCGAC GGGACGGGCG GGACGCTGCA GGACATCGAG AACGTCTACA AGACGTCCGA AGAGACGGTT TCCGAATATG TGCAGGCCGA CTGGGACAGC GAACTGTTCG GCAAGCGCTT CCGCGGCAAT ATCGGCCTGC GCGGCTACAG CACCGACACC CACAGCACCG GCTGGATCCA GGGCGACAGC TACGCCTATC TCGGCACGAC CGACGTCAAG GGCAGCTATG AAGGCGTCCT GCCGGCCCTC AACACCGTGC TGGACCTGAC GCCGGAAGTG CTGGTGCGCT TCTCCGCCAC CCAGAACCTG AACCGTCCGA GCCTGGGTTC GATGGCGGCC AAGGGCAGCG CGTTCCAGAA TGATAGCGGC GATATCAGCG CCTCTCGCGG CAATCCCGAC CTCAAGCCCT TCAAGGACAC CACGCTGGAC CTGTCGCTGG AGTACTATTT CGGCAAGTCA GGCCTGCTTT CGGCGGGTGT GTTCCGCAAG GACATAACCA ACTTCATCAC ATCGACGACC CTTCACAACA TCCCCTTCAG CCAGACGGGG GTGCCCTACA CCACCATACC GGGCGCGACG GCCAGCACCA TCGTCAAGGA CTTCGATGTT CCGACCAACA GTTCGGACAA GGTGAAGCTG ACCGGCGTTG AACTGGTGGC GCAAGGCCAG TTCTCGTTCC TGCCAGCGCC CTTCGATAAT CTCGGCGGCG TGGCGAACTA TACCTATGTG GACTCAAATT CGGATCTCAC TGGCATTTCC AAGTCCAGCT ACAATCTCAC CCTCTACTAT GAAACCGACC GTTGGGGCGC CCGCGGCTCG GTGAGCCACC GCACCCGCTG GTACACCGGC TATAACAAAG ATGTCATGAG CGCCGACACG CGAGGCTTCG AGGGGTCCAC CTATGTGGAC GCTTCGGCCT TCTTCAATGT CACCGACAAG ATGCAAGTCT CGTTGAACGC GATCAATCTG ACCAACCAGA AGGACACCCA GTTCTGGGGC CAGAACCGCT ATCTCTATAA TCAGAACCAG AGCGGCCGGA CCTACATGAT GGGGCTCAGC TACAAGTTCT AA
|
Protein sequence | MSKRYPVQAM RPNHKIFLLA TVSALVVGAS SSAALAQQAS GESASGDMVD QVVVTGYRKS LSDARAIKRD SVIQKDAIVA EDMAKFPDLN LAESLQRLPG VQITREAGEG RRISLRGLGP DFSRVQLNGM EVLGNVDSAQ DSRGQRSRDR AFDFNIFASE LFSKVEVEKT FEAAQNEGGM AGTVGLFTGK PFDYAAGSKG AVSLKLGTNE YTKDTQPRIA ALFSQNWDNK FGVALSVAYS KRETTEQGHN TYNYDRLSSA ALQKLVTNGL NISHLSAAQQ AKFLSGDLYF ADGNRISSWN AKQERLGLTG AVQWRPMDNL LLTLDALHGE FTTHRDEYHL ATRPLGSGTK SFAFDTPAGG VWPAAFQTGS VINDLTWDSS NYVTKTDVTG TTFGSEHRRS LNENRFNQLA LTGKWDATDR LTIDGHVGYE KSTYKTPYDD KLYMRAKGNM VANYGTDGQS ATFSYPGFSA TNPANYAMDS FYYRSFNNES GLREGVLNLR YELSDVFTLR AGVAYHRFSQ EGMDLFYDDN VNGTRSKMRG TSVADVTSVF TNEFGSWLVG DYGKAFAKYK EYHRLGANTD GTGGTLQDIE NVYKTSEETV SEYVQADWDS ELFGKRFRGN IGLRGYSTDT HSTGWIQGDS YAYLGTTDVK GSYEGVLPAL NTVLDLTPEV LVRFSATQNL NRPSLGSMAA KGSAFQNDSG DISASRGNPD LKPFKDTTLD LSLEYYFGKS GLLSAGVFRK DITNFITSTT LHNIPFSQTG VPYTTIPGAT ASTIVKDFDV PTNSSDKVKL TGVELVAQGQ FSFLPAPFDN LGGVANYTYV DSNSDLTGIS KSSYNLTLYY ETDRWGARGS VSHRTRWYTG YNKDVMSADT RGFEGSTYVD ASAFFNVTDK MQVSLNAINL TNQKDTQFWG QNRYLYNQNQ SGRTYMMGLS YKF
|
| |