Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3727 |
Symbol | |
ID | 5901189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4038844 |
End bp | 4039854 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641564250 |
Product | HfaB protein |
Protein accession | YP_001685352 |
Protein GI | 167647689 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1462] Uncharacterized protein involved in formation of curli polymers |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGTA AGCGCCTCCA CCTCCTGCTG GCGGGCGCCT GCGCGGCGCT GAGCGCCTGC GGCGGCGTGC CCACGCCCAT GGGCAACGGC AACTACGCCA CGCCGATCGG CACGGCGCCG GTCACCGCCA ACCCCACGCC CTATACGGCC GGCCTCGTTT GCCTGGCCCA GTACGCCCGC GCCAACCACG TGGTCGCCCC GCGCGTCGCC ATCGGTCGCA TCGCCGACTA CACCGGCAAG GAGGAGTCCG ACGGTTCGGG CCGCAAGGTC ACCCAGGGCG CCTCGCTGCT GGCCATGACC GCCTTCGCCA AGGCCGGCAT GCCGATGGTC GAGCGCTTCG ACACCTCGGT CTCCGAGTAC GAGCTGAAAT ACGCCAACAA CAAGCTGATC TCCGACAACC CCAAGCCGGG CGCCGACGTG CCGGCCGAGT ATCGCAAGAT CCTCGCGGGG CAAGTCCCGG GGTCGGACTT CTACGTGGCC GGCGGGATCA CCGAGCTGAA CTACAACATC CGCTCGGTCG GGGCCGACGC CTATGTCGGC GACAAGGACA CCGACGGGCT GAAGGGCAAC TTCCGCCGCC GGGTGTTCGT GATGAACATC GCCATCGACC TGCGACTGAT CAACACCCGC ACCCTGGAGG TGGTCGACGT GATCTCCTAC CAGAAGCAGG TGGTCGGCCG CGAGATCAGC GCCGGCGTGT TCGACTTCCT CAACGGCAAC ATCTTCGACA TCTCCGCGGG CCGCGGGGCG CTGGAGCCCA TGCAACTGGC CGTGCGCTCG CTGATCGAGC GGGCCGCCAT CGAGATGAGC GCCAACCTCT ATGGCATGCC GGGTCCGCAA AGCTGCATGA GCTACGATCC CTATGCCAGC AACACGGTCG GGGCCACTGG GGCGTTCGTC CCCGCCTACA ACAACCTGGG AACCAACAAT GCGCAAACCC GCGAAGACCC GTCTCGCTGG AATGATCGCA GCGATCCCAA TGTGCGCGAT GCTGGCCGGG GTCGCTACTA G
|
Protein sequence | MASKRLHLLL AGACAALSAC GGVPTPMGNG NYATPIGTAP VTANPTPYTA GLVCLAQYAR ANHVVAPRVA IGRIADYTGK EESDGSGRKV TQGASLLAMT AFAKAGMPMV ERFDTSVSEY ELKYANNKLI SDNPKPGADV PAEYRKILAG QVPGSDFYVA GGITELNYNI RSVGADAYVG DKDTDGLKGN FRRRVFVMNI AIDLRLINTR TLEVVDVISY QKQVVGREIS AGVFDFLNGN IFDISAGRGA LEPMQLAVRS LIERAAIEMS ANLYGMPGPQ SCMSYDPYAS NTVGATGAFV PAYNNLGTNN AQTREDPSRW NDRSDPNVRD AGRGRY
|
| |