Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4221 |
Symbol | |
ID | 5901683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4587101 |
End bp | 4588534 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564743 |
Product | PepSY-associated TM helix domain-containing protein |
Protein accession | YP_001685843 |
Protein GI | 167648180 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGG TCCTGCGTTG GCTTTCCTGG TTCCACCGCT GGACCGGCGT GGGGCTTTGC CTGCTCTTTT TCCTCTGGTT CGCCAGCGGG GCGGTCCTGC TCTTCGTGCC GTTCCCGTCG CTGAGCGACC AGCAGCGGCT CGCGGCCAGC GATCGGTTCG ATTCGACGGC GATCACGGTG GCGCCCAGCG CGGCGCTCGC CGCCGTCGGA GGCGGGGATG GCCTGCGCCT CGTCGCCATC GGCGAAGAGC CGCGTTACGT CGTCCAGGAC GCCAAGGGCG AACTGCGGAC GATCGACGCC CGCAGCGGCC AGATCGCGCC GGCTGTCACG GCCGATCAGG CGCGAGTGAT CGCCCAGCGC TTCGCCGATG TTCCGGTCAA GCACGTCAGC GCGCCGCTGC GCTACGACCA GTGGGTCGTC CACAATCGCT TCGATCCCTG GCGTCCGTTC TTCCGGGTGT CGATCGACGA TCCGGCGGGC ACGCAACTCT ACGTCTCGGC ACGGACCGGC GAGGTGATGC AGCGCACGCG GGCCGGCGAG CGCGCCTGGA ACTGGGGCGG GGCGGTCATC CACTGGCTCT ATTTCACCGC GCTTCGCCAA AGCTTCGTGG CATGGGACCA GACGGTCTGG TGGGTTTCGC TGCTGGGCGT GGCGACCGCC GCCGTGGGGA CGTTTCTGGG CGTCTATCGC TCCCAGAAGC GCCTGCGCGG CCGCAGGTCC GACTGGTCGT CCTTCCGGGG GTGGCTCCGC TGGCACCATG GCCTGGGCCT TGGCGCGGCG ATCTTCGTCC TGACCTGGAT CGTCAGCGGG TGGCTGTCGA TGGACCATGG GCGATTGTTT TCGCGCGGCG TCGCCAGCCA GCAGGCCGCG GCGCGCTATC ATGGGCAAAG CCTGGAGCAG GCGTTCGCGC GAGCGCCGGC CTCTCGCCTG GCGGTCCTCG GTTCGACGCC GGCGATCACG TTCGACGTGG TGGGCGGACG ACCTGTAGCC GCGTCGGACG ACGGCTCGGT GTCCCGGGTG CGACTGGTCG ATAGCGACGC CGAGCCGGTG ACGCCAAACT TGCCGAACAC GCTCCTGATC GCCGCGGCGC GGCGCGCCTG GCCGGTGCGG GCCGAGGTCA CGGAAGGGGA CGATGGCCTC TATCGAGCCG CGGAGGGCAT GGCCGCCCGC ATCGTGCGCC TGCCTCTGGA TCGGCCGGCC GGGACGTCGA TCTATCTCGA CCCGGTGACC GGGCGGATCG TCAGCGTCAT GGATCCCAGC CGGAAGGCTT ATGCGTGGAT CTACTACGCT CTACACACCT TCAACTTTCC CGGCCTCATC GATCGGCCGG TGTTGCGCAA GATCCTTGTT CTGATCCCTT TGCTCCTTGG ACTGATCTTC AGCTGGACGA GCTTGGTGAT CGCCCTCAAG CGTCTGCGGC TCATCGCCGC CTGA
|
Protein sequence | MGKVLRWLSW FHRWTGVGLC LLFFLWFASG AVLLFVPFPS LSDQQRLAAS DRFDSTAITV APSAALAAVG GGDGLRLVAI GEEPRYVVQD AKGELRTIDA RSGQIAPAVT ADQARVIAQR FADVPVKHVS APLRYDQWVV HNRFDPWRPF FRVSIDDPAG TQLYVSARTG EVMQRTRAGE RAWNWGGAVI HWLYFTALRQ SFVAWDQTVW WVSLLGVATA AVGTFLGVYR SQKRLRGRRS DWSSFRGWLR WHHGLGLGAA IFVLTWIVSG WLSMDHGRLF SRGVASQQAA ARYHGQSLEQ AFARAPASRL AVLGSTPAIT FDVVGGRPVA ASDDGSVSRV RLVDSDAEPV TPNLPNTLLI AAARRAWPVR AEVTEGDDGL YRAAEGMAAR IVRLPLDRPA GTSIYLDPVT GRIVSVMDPS RKAYAWIYYA LHTFNFPGLI DRPVLRKILV LIPLLLGLIF SWTSLVIALK RLRLIAA
|
| |