Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4106 |
Symbol | |
ID | 5901568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4459880 |
End bp | 4462282 |
Gene Length | 2403 bp |
Protein Length | 800 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564626 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001685728 |
Protein GI | 167648065 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0612134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.370277 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAC ACCGTCACAC GATCGCCGGC GACAGCACAG CCGCCGGGCG TCGAGGCGCT TTCGCCGGCC GCAGGGCAGC CCTGGCCGCC TCCAGCTGTC TGGCGGGCGT AGCGGCCTGC GCCCTGATCG GCCTGGCCGG CGCGATGACC CTGGGGACGG CGGCGCAGGC CCAGACCCTG CCGACGGGCG GAACGGTCGC GGCGGGCGGC GCGACCATCA CCACCGGCCC CGGCGCGATG ACCATCAACC AGTCGACCCC GAACGCCGCG ATCAACTGGC AGAGCTTCTC CATCGGCCAG GGTGGCAGCG TCGTCTTCGT CCAGCCCGAC AGCCATTCGG TGGCGCTGAA CCGCGTGCTG GGGCCGGACG CGTCAACGAT CTTGGGCGGG CTGACCTCCA ACGGCCAGGT GTTCCTGGTC AATCCCAATG GCGTACTGTT CGGACAAGGG GCTCAGGTCA ATGTCGGCGG GCTGGTCGCA TCCACTCTGG GAATGACTGA CGCTGACTTC ATGGCCGGAA ACTACCGGTT CTCGGGCAGC GGAGGGATCG TCCGCAACCA AGGCGATATC ATCGCGACGG GCGGTTACGT CGCGCTGCTG GGAGGCCAGG TCAGCAATGA CGGACTGATC CAGGCCAATC TGGGCACGAT CGCGCTGGCG TCGGGCGAAG CCATCACCCT CGACGTCGCC GGCGACGGGC TGCTCAATGT CGTCATAGAG AAGGGCGCAG CCAACGCCTT GATCCAGAAC AGCGGCATGC TTCAGGCCAA CGGCGGTCGG GTGGTGATGA CCGCGCAGGG CGCGGGCGAC TTGCTCCGCA CGGTGGTGAA CAACACCGGC GTCATCCAGG CGCGGACGAT CGGTCAGCGT AACGGAACCA TCCAACTGCT CGGCGACATG CCAAGCGGGA CGCTGAACGT GGCCGGTGCC CTTGACGCCA GCGCGCCGGG CGGCGGGAAT GGCGGGTCCA TCCAGACCTC CGCCGGGAGC GTGAACATCG CCTCCACGGC GGGGATCACC GCGGCCGCCC CGACGGGCGT CGCGGGGATC TGGTTGATAG AGCCGGCCGA CTTCACGATC GGCGCTGGCG GCAATATCTC CGGCGCGACC CTGTCGGCCC AGCTGGTGAC CACCAATGTC ACGATCAACA CGCGGACGGC CGCCGGGCTG TCGGGTACAG GGGATATTCT CGTCAATGAC GCGATCGTCT GGACGGCGTC GTCCACCCCC ACCACCCTGA CGCTGAACGC CAACCGCGAC ATCAACATCA ACGCCGCGAT CACCGCCACA AAGGGCAATT TCGTCGCTTG CTGCGGGCGC GATGTGGCCG TCAACGCCCC CATCACCACG GTGAACGGCA GCGTGCTGCT GAACGCCGGC CAGAACGTCA CCGTGTTTCA CGCGATCACC ACCACGGACG GCAACATCGC CCTGTGCGCC GGGCATGACG TCCATATCGA CGCGGCCGTC ACCCTGACCC GCGGCAGCAC CATTCCCGCC CAGAGCCTGG GCCTGCCCGT CGGCCTGACC CTCATCGCCG GCGCGGGCGG GACGGGTCCG GGCGTGGGCG GCGGCACGAT CATCTTCAGC CCGTTGGCGC CCCGCGTCAC GGTCACGGCC ACCCCGGTCA CGATCAATTA CAACCCGGTT TCCTACGCGG CGCCGGCGGA CTTCTCGACC CGGTTCACCC TGACCGAGGG CGCCGCCCTG ACACAGCGGA TGCTGCTGTT CCCGGACGGA AGCCGGGTGT TCGACGGCGG GACGGCCACG ACCCTCTCCG GCTTCAGGAC CACGGCGACC TCGGGGTTGC CCACGGGCGT CACCTTGGTG GCGGGCCCCG GCGCGACCGC GACCTTCGAT TCGGCCGCCC CGGGCGCCGA CGTCGGGATC ACCTACAGCG GCTACACCCT GGCTGGGGCG AACGCCGACC AATACGCCTT GGCGGGCTTC TGTTGTGTAT CGACTCAGAG AACGCAAGGC ACGATCTCGG CGGCGGTGGT CACGCCGCCA GTGACCCCGC CAGTGACCCC GCCAGTGACC CCGCCGGTTA CTCCGCCAGT GGTCCCGCCG GTGGTCCCCC CAGTCACCCC GCCCGTGACG CCGCCCATAA CGCCGCCGGT GGTCCCCCCG GTGACGCCGC CGGTGACGCC GCCCGTGACG CCGTCTCCGG CCTCGCCGGG ACCGACCGCC TTCTACCCGA TCATCACGCC AACCCCGGCC TTGGTCGCCT CGCCCGATCT GGCCTTCAAC GTGGTGGGGG GAGGCGTGCG GATGCCGCCT TACGAATCGG CCCGCATCTC TCCGCCGGTG GAGGAGGTCG TTCGGACGGT GGAGAAGACC GCGCCGGTCG CGCCGCGTCC TGTGCAGGTC CCCGTCTATC CCCGCAAGCA GGATCGCAAC TGA
|
Protein sequence | MTRHRHTIAG DSTAAGRRGA FAGRRAALAA SSCLAGVAAC ALIGLAGAMT LGTAAQAQTL PTGGTVAAGG ATITTGPGAM TINQSTPNAA INWQSFSIGQ GGSVVFVQPD SHSVALNRVL GPDASTILGG LTSNGQVFLV NPNGVLFGQG AQVNVGGLVA STLGMTDADF MAGNYRFSGS GGIVRNQGDI IATGGYVALL GGQVSNDGLI QANLGTIALA SGEAITLDVA GDGLLNVVIE KGAANALIQN SGMLQANGGR VVMTAQGAGD LLRTVVNNTG VIQARTIGQR NGTIQLLGDM PSGTLNVAGA LDASAPGGGN GGSIQTSAGS VNIASTAGIT AAAPTGVAGI WLIEPADFTI GAGGNISGAT LSAQLVTTNV TINTRTAAGL SGTGDILVND AIVWTASSTP TTLTLNANRD ININAAITAT KGNFVACCGR DVAVNAPITT VNGSVLLNAG QNVTVFHAIT TTDGNIALCA GHDVHIDAAV TLTRGSTIPA QSLGLPVGLT LIAGAGGTGP GVGGGTIIFS PLAPRVTVTA TPVTINYNPV SYAAPADFST RFTLTEGAAL TQRMLLFPDG SRVFDGGTAT TLSGFRTTAT SGLPTGVTLV AGPGATATFD SAAPGADVGI TYSGYTLAGA NADQYALAGF CCVSTQRTQG TISAAVVTPP VTPPVTPPVT PPVTPPVVPP VVPPVTPPVT PPITPPVVPP VTPPVTPPVT PSPASPGPTA FYPIITPTPA LVASPDLAFN VVGGGVRMPP YESARISPPV EEVVRTVEKT APVAPRPVQV PVYPRKQDRN
|
| |