Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4552 |
Symbol | |
ID | 5902013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4928331 |
End bp | 4931624 |
Gene Length | 3294 bp |
Protein Length | 1097 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641565071 |
Product | autotransporter beta-domain-containing protein |
Protein accession | YP_001686170 |
Protein GI | 167648507 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.518565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGCA AGGTCCTGGT CGCGACAGTC GCAACCGCTC CTCTCCTGGC CATGGCGTTC GGCGCTTATG CTGAGACCGT CGTCAACACC GCCCGTACGA CCCCGATCGC CACGGCGACG GCGGCCGGCG CGGCCCCGGG GACGCCCGAC GACGTGAAGA TCGACGCCGC CGGCTCGATC AAGCTGACCA CCGCGGGCCC GCTGGTCACG CTCAACAGCA ACAACAAGGT GACCAACGCC GGCACCCTGG CGACGGTGGG CGTCGACGAC TCGACCGGCA TCCTGATCCT CGGCGGCAAC ACCGGCTCGG TGACCAACTC GCTGGCGATC AGCCTGACCG AGGATTACAC CCCGACCGAC GACGACAAGA CGGCGGCCGG CGTGGCGGCG CCAGACGGCG ACCTCGACGG TCCGTTCGCC AAAGGAACCC ACCGTTACGG CATCCGCCTG ACGGGTCCCG GCGTCTTCAC GGGCAACATC ACCCAGGACT CCGCCGGCTC GATCGTGATC GAGGGCAACA ATTCGGCCGG CATCTCCCTG GAGACCGGCC TGGTCGGCAA CCTGACCACC AACGGCGCGA TCTCCGTCAC CGGCGACAAC AGTTACGGCG TCCACCTGAC GGGGCCGGTC ACCGGCAAGG TCACCCAGGG CGGCTCGGTT TCGGTCCTGG GCGACAAGGC CGTGGGGGTC GCGCTCGACG CCGACATCAG CGGGGCCTTC GCCCTGCAGG GCTCGGTCGC CGCGACCGGC TACCGCTACA TCACGCGGCC GACCGAAACC GCGGTCGGCA AGCTCGACAC CGACGACCTG CTGCAGGGCG GGCCGGGCGT GCGCGTCGCG GGCGACGTCA CCGGCGGCGT GCTGCTGAAC GGGCCGACCA GCTCCACCGT CATCGACGGT ACGACGACGA CCACCACGAC GGTGTCGTCC TCGACGACGG GGACCGGCTC GATCAGTGTC TACGGCGGCG CCCCCGCCTT GCTGATCGGG TCCGACAGTC GCGCGATCAC GATCGGCGCC GTCGGCGCGG CCGACAACGC CTATGCCCTG ATCGCCAAGG GCTCCATTGC GGCGACGGGC GTGTACGACA AGATCGCCGC CATTGGCGTG CAGATCGGCG GCCAGACCGG CCAGACCACC ACCTTGGCCG GCGGCGCGCG CCTGGACAGC TCCATCACGG TCAGTTCTCG CGAAGCCAAC GCCACCGGCG TCAACTTCAC GGCGGGCGCC GTCGCGCCAT CGCTCTGGAA CCGTGGCTCG ATCAGCGCCG TCAGCGCTTC CTATGCCGGC TCGCTCCTCA CCGGCGACGC ACGCGCCGTG GTGATCAACG CCGGAGCCAA CGTCGTCAAT CTGAACAACA GCGGCGCCAT CGCGGCCACC CGTAGCGGCG AGACCGGCGA CGCCGTCGGC ATTCTCGACA GCTCGGGCAC GCTGTCGACG CTCATCAACA GCGGCTCGAT CACCGCGCAG GTCGTCGCGC CGACGGTCAA GACGGACGGC ACGACGACCA CCACCACCGC CAAGGCGACA GGCAAGGCGA TCGCCGTCGA CCTAAGCGCT TCCACGAGCG GCGTCACATT CACCCAGACG GCGCCCACCT CGACCGCCGC GACGAGCACC TACCCCAGCG CCGACACCGT CACCACGGCC AGCGCCACCT CGACCACCTC GACCCCGACG ATCATCGGCG AGGTTCGCTT CGGCTCGGGC GCGGACACGT TCAATATCCG GGCCGGCAAC GTGTTCGGCG ACGTCTCGTT CGGGGCCGGC GCGGACGCCC TCAACATCAG CGGCGGCGGC GTCCTGCTGG GCGCGCTGAG CGATTCGGAC GGTCTGCTGG CGATCGACAT CGGCAAGGGC TCCCTGGGCC TGACCAACAC CGGGAACATC AACGCCACCA GCCTGAACAT CGGCGCTGAA GGCAAGCTGG TGTTCACCGC CGATCCGACC GCCAACAACG GGGCCGGCGC CAACACCCAC CTGGTGATCT CGGGCGCGGC CAACATCGCT ACCGGCGCCC AGATCGGCCT GCGGTTGACC GGCCTGCTCA CCGCCCCGAC CACCTACACC GTGATCACCG CCGGCAGCCT GACGGCCGGC ACGATCAACC AGGATCTGCT GGGCGGCACG CCTTATCTCT ACGTGGCCAG CAGCCGGACC GACGCCAACA ACGTCTATCT CGACGTCACA CGCCGGACGG CGGCCCAAAT CGGCATGAAC AAGGCCCAGT CCTCGGCCTA CGACGCAACG TTCGCGGCGC TGAGCAAGGA CAAGGACATC GCCACCGCCT TCCTCGGCCA GAGCACCAAG GACGGCTTCC TTGGCCTCTA TGACCAGATG ATGCCCGACC AGGGCGAAGG GTTCTTCGCG GCCCTGCAGA ACGCCAACCA GGCGATCTCG TCGGCCACGG CCTATCGTCC CGATCCGGGC GACCGCTATG GACCCGACAG CCTGTGGATC CAGGAGATCA ACACCCTGGT TCGCCGCGAC ACCGGCGACA CCATGGGCTC CGACACCCAG GCCTTCGGCT TCGTCGGCGG CTATGAGGCC ATGGGCGACG CCGGCGGCGC GCTGGGCCTG ACCCTGGCCT ATGTCAGCGT CGAGGAACAC GACATCGCCG CCAAGGTCGG CGAGCAGACC ACCGGCAACT TCGTCCAGTT GGGCGGCTAT TGGCGCCGGG CGATCGGCGG CTGGCGGCTG AACGCCGGCG GCGGCGGTGG CTTTGGCTGG TACGACGGCG ACCGGACGTT CAATTCCGGC GACATCAACG GCGACGGAAC TGCTGACGTC CAGCGCCACA ACACCGCCAA GTGGAACGGC TACACGTTCA ACGCCTTCGC CGGCACGGCT TACGAGGCCA AGTTCGGCCG CTACTTCGCC CGCCCCGAAG GCCGCCTCGA CTACCTGTAC CTGAGCGAGG GCGAGCGTAA GGAAAAGGGC GGCGGCGACG GCTTCGACCA TATCGTCCAC AAGCGCAAGT CCAGCAGCCT GACCGGCGAC GTGGGCGTCG CCTTCGGCGC GGACTACGGC CGCGACCTCT GGTGGCGTCC CGAAGTGCGG GTCGGCTACC GCCAGACCCT GGCCGGCGAC ATCGGCGACA CCACGTTCGG CTTCACCAAT GGCGGCGCGC CCGTCACCCT GGCGGCCATG AACGACAAGA ACGGCGCGGT GACGTTGGGC TTCGCCCTGC GGGCCGGCAC CCCGATGTCC TATCTGGCGC TTGAAGCCAA CGCCGAGGCC GCCAAGAAGC AGAAGCGCTA TAACCTGAAG CTGACCGGAC GGGCGATGTT CTAG
|
Protein sequence | MQRKVLVATV ATAPLLAMAF GAYAETVVNT ARTTPIATAT AAGAAPGTPD DVKIDAAGSI KLTTAGPLVT LNSNNKVTNA GTLATVGVDD STGILILGGN TGSVTNSLAI SLTEDYTPTD DDKTAAGVAA PDGDLDGPFA KGTHRYGIRL TGPGVFTGNI TQDSAGSIVI EGNNSAGISL ETGLVGNLTT NGAISVTGDN SYGVHLTGPV TGKVTQGGSV SVLGDKAVGV ALDADISGAF ALQGSVAATG YRYITRPTET AVGKLDTDDL LQGGPGVRVA GDVTGGVLLN GPTSSTVIDG TTTTTTTVSS STTGTGSISV YGGAPALLIG SDSRAITIGA VGAADNAYAL IAKGSIAATG VYDKIAAIGV QIGGQTGQTT TLAGGARLDS SITVSSREAN ATGVNFTAGA VAPSLWNRGS ISAVSASYAG SLLTGDARAV VINAGANVVN LNNSGAIAAT RSGETGDAVG ILDSSGTLST LINSGSITAQ VVAPTVKTDG TTTTTTAKAT GKAIAVDLSA STSGVTFTQT APTSTAATST YPSADTVTTA SATSTTSTPT IIGEVRFGSG ADTFNIRAGN VFGDVSFGAG ADALNISGGG VLLGALSDSD GLLAIDIGKG SLGLTNTGNI NATSLNIGAE GKLVFTADPT ANNGAGANTH LVISGAANIA TGAQIGLRLT GLLTAPTTYT VITAGSLTAG TINQDLLGGT PYLYVASSRT DANNVYLDVT RRTAAQIGMN KAQSSAYDAT FAALSKDKDI ATAFLGQSTK DGFLGLYDQM MPDQGEGFFA ALQNANQAIS SATAYRPDPG DRYGPDSLWI QEINTLVRRD TGDTMGSDTQ AFGFVGGYEA MGDAGGALGL TLAYVSVEEH DIAAKVGEQT TGNFVQLGGY WRRAIGGWRL NAGGGGGFGW YDGDRTFNSG DINGDGTADV QRHNTAKWNG YTFNAFAGTA YEAKFGRYFA RPEGRLDYLY LSEGERKEKG GGDGFDHIVH KRKSSSLTGD VGVAFGADYG RDLWWRPEVR VGYRQTLAGD IGDTTFGFTN GGAPVTLAAM NDKNGAVTLG FALRAGTPMS YLALEANAEA AKKQKRYNLK LTGRAMF
|
| |