Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2178 |
Symbol | |
ID | 5899633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2365570 |
End bp | 2366760 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641562669 |
Product | carbohydrate-selective porin OprB |
Protein accession | YP_001683804 |
Protein GI | 167646141 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3659] Carbohydrate-selective porin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.278032 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.401422 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCAT CTAGACCAAA CTGCCTTATC GCCATGGTCG CGGCCCTGGC CGTCTCCGCC CTGAGCTCCG CCGCCCTGGC GCAGACGGCG GGCGAGGGCG CGTGGTCGCA CGAGGTCGTC TACACCGCGG ACGTCGCCGG CCCGGTGCGC GGCGGGGCGG CCCATGCCGG TCGGGCCTTG GACAACCTGG ACGTCATCAT CGACGGCGAC CTGGACAAGG CCTTCGGCTG GCGCGGCCTG GCCGTGCACG GCTACCTGCT CAACAACAGT GGCGGCGCCC CCAACGATAT CGCCGGCACT CTGCAGGGCG TCGACAACAT CGAGGTCGGG CGTCCCCGGG CGCGGCTCTA CGAGTTGTGG CTCAAGGCCA GTTTCGCCGG CGACAAGGGC TCGGTGCTGG CCGGGCTCTA TGACCTCAAC AGCGAATTCT ACTCGACCCA GGCCTCGGGC CTGCTGCTGG CCCCGCCGTT CGGCATTGGC TCGGAGCTCG CCTCGACCGG CCCCAACGGT CCGTCGATCT TCCCGTCCAC CGCCCTGGCG GTGCGGATGC GGGTCGAGGG CAAGCAGGGA CGCTACGTCC AGGCCGCCGT GCTCAACGCC AAGGCCGGGA CGGTGGGCGA TCCGGATGGG CCGGCGACGG AGTTCGATCA CGGCGCGCTG ATAGTCGCCG AGGCCGGGAT CGGCGCGACA TGGCGGCTGG CGGCCGGGGG CTGGTTCTAC ACCCAGCGCC AGACGGACCT GCGCGACCTC GACGCCAAGG GCGACCCGGC CCGGAGCCAC GCGCGCGGCG CCTACCTTCT GGCGGAGTAT CCCTTCGTCG ATGGCGGGGT GAGCGGACGC TCGGTGCGGG GCTTCGCTCG CCTGGGCCTT TCGGACGGCG ACACCACGGC GTTCCGTTCG GGCTGGCAGG CCGGCGTGCT GGTGGAGAAG GTTTTCGCCT CGCGCCCCGA CAGCGCCTTC TCGGTCGGGG TGGAGCAGGG GATGCTATCG TCCAAGCAGC GCGACAACAC CCGCGACGCC GGTGTCTCCC CGGCCCACGC CGAGTCCAGC ATCGAGATCA CCTATTCAGA CAAGGTCCTG CCGCGACTCA CCCTGCAGCC GGACGTCCAG TTGATCCGCC GGGCCGGCGG TGATCGCGAC GCCCGTGACG TGGTGGTCGT GGCCTTGCGG ATGACGATCA GCCTGTTCTA G
|
Protein sequence | MTSSRPNCLI AMVAALAVSA LSSAALAQTA GEGAWSHEVV YTADVAGPVR GGAAHAGRAL DNLDVIIDGD LDKAFGWRGL AVHGYLLNNS GGAPNDIAGT LQGVDNIEVG RPRARLYELW LKASFAGDKG SVLAGLYDLN SEFYSTQASG LLLAPPFGIG SELASTGPNG PSIFPSTALA VRMRVEGKQG RYVQAAVLNA KAGTVGDPDG PATEFDHGAL IVAEAGIGAT WRLAAGGWFY TQRQTDLRDL DAKGDPARSH ARGAYLLAEY PFVDGGVSGR SVRGFARLGL SDGDTTAFRS GWQAGVLVEK VFASRPDSAF SVGVEQGMLS SKQRDNTRDA GVSPAHAESS IEITYSDKVL PRLTLQPDVQ LIRRAGGDRD ARDVVVVALR MTISLF
|
| |