Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2148 |
Symbol | |
ID | 5899603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2327114 |
End bp | 2328607 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562638 |
Product | TolC family type I secretion outer membrane protein |
Protein accession | YP_001683774 |
Protein GI | 167646111 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.400858 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0489946 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAATA ACCGCCGAGC CGGTTTGCTG GCCGCCGCTT GCAGCATGGG CCTGATGGCC GGTCTGGTTT CGGCCGCGAG CGCCGAAACT CTGAACGACG CCCTGGCCCT GGCTTACCAG ACCAATCCCA CCCTTCAGGC TCAGCGCGCC AACCAGCGCG TGACCGACGA GGGCGTGGTC CAGGCCAAGT CGGCCTTCCG ACCCAATCTG AGCGGTTCGG CCGACGTCAC CGGTCAGCGG ACGGACTACG CCAAGCCCAG GCAATCGCTC GTCGGCAACA CGATCGTCAA CAAAGACCAA CAGACCGCCT ACGGCAGCGG GGGCAGCTTT TCCCTGTCGC AGCCGCTCTA TACCGGCGGC CGCGCGAGCT CCAACCTGAC CGCCGCCGAG GCCGACGTGA TGGCCGGCCG CGAGGATCTG CGCAGCGTCG AGCAGTCGGT GCTGGGCAAT GTCATCCAGG CCTATGTCGA TGTCCGCCGC GACCAGGAGC GCCTGCGCAT CGCCCAGGAG AACGTCAGCG TCCTGAACCG TCAGCTCGAA GAGGCCCGCG CCCGTTTCGA GGTCGGCGAG ATCACCCGCA CCGACGTGGC CCAGTCCGAA GCCCGTCTGG CCGGCGGCCA GGCCAGCCAG TCGTCGGCCC AGGCGATCCT GGCCGGCAGC CGCGCGGCCT ACGCCGCCGT GGTCGGCCAG AACCCTACCA ACCTGGCGCC GGAACCGTCG CTGGCCGCCC TGCTGCCCGC CAGCGTCGAG CAGGCCTTCG ACTTGGTCGA CCAGAGCAAT CCGCAGATCC AGGCCGCCCG CTACGCCGAA CGCGCCGCCG CGGCGCGCGT GGCCCTGGCC AAGGCCGCCA TGCGTCCCAC GGTTTCGGCG CGCGCCGGCC TGGGCTGGGA GTCCGAGGGC CGGGTGGACG GCAAGGGCAA CCAGTTCGGC GACTATGATC GCGGGATCAA CGGTTCGATC ACCGCCTCGG TGCCGATCTT CACCGGCGGT CTGACCAGCT CGCAGATCCG CGCCGCCAAG GAGCGCGAGA ACGCCGCTCA CGTCGCCGTC GAGGGCGCCA AGCGCACCGC CCTGCAACAG ATCTCGACCG CCTGGAACAA CCTGCTGGCC GCCCGCGCCA ACCTCGTCTC CAACGAGGAG CAGGTTCGCG CCGCCCGGAT CGCCTTCGAA GGCGTGCGCC AGGAACAGCA GGTCGGTCTG CGCACCACCC TGGACGTGCT CAACGCCCAG CTGGAGCTGT CCAACGCCGA GGTGGCCCTG GTCATCGCCC GCCACGACGA ATATGTCGCC AGCGCCAGCG TCCTGCAGGC CATGGGCGTG CTGAACGTCG CCAACCTGGC GCCGGACGTC GAACGCTACG ATCCGGTGAA GTCCTACAAC AGGGTCAACC ACGCCATCGG CTGGGTGCCG TGGGAGCCGG TGGTGCAGGT GATCGACAAG ATCGGCGCGC CCTCGACGGC GGTCAGCAAC CCCACGCCCG TCGCCGCCAA GTAG
|
Protein sequence | MSNNRRAGLL AAACSMGLMA GLVSAASAET LNDALALAYQ TNPTLQAQRA NQRVTDEGVV QAKSAFRPNL SGSADVTGQR TDYAKPRQSL VGNTIVNKDQ QTAYGSGGSF SLSQPLYTGG RASSNLTAAE ADVMAGREDL RSVEQSVLGN VIQAYVDVRR DQERLRIAQE NVSVLNRQLE EARARFEVGE ITRTDVAQSE ARLAGGQASQ SSAQAILAGS RAAYAAVVGQ NPTNLAPEPS LAALLPASVE QAFDLVDQSN PQIQAARYAE RAAAARVALA KAAMRPTVSA RAGLGWESEG RVDGKGNQFG DYDRGINGSI TASVPIFTGG LTSSQIRAAK ERENAAHVAV EGAKRTALQQ ISTAWNNLLA ARANLVSNEE QVRAARIAFE GVRQEQQVGL RTTLDVLNAQ LELSNAEVAL VIARHDEYVA SASVLQAMGV LNVANLAPDV ERYDPVKSYN RVNHAIGWVP WEPVVQVIDK IGAPSTAVSN PTPVAAK
|
| |