Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0025 |
Symbol | |
ID | 5897737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 30671 |
End bp | 32257 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641560508 |
Product | apolipoprotein N-acyltransferase |
Protein accession | YP_001681661 |
Protein GI | 167643998 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0815] Apolipoprotein N-acyltransferase |
TIGRFAM ID | [TIGR00546] apolipoprotein N-acyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.129672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.499637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATCTGT CGGCCTGGCC GAAACTGGCG CCGTGGCGGA CCCGAGGCCT GGCGCTGGCC GCAGGCCTCG CCGCCGCCCT GGCCCATCCG CCGTTCGGGC TCTTGCCGGG TCTGCTGGGC TACGCCGTGT TGCTGTGGCT GCTGGACGCG ATCGACGGGC CCCGGCCGCT GCGTTCGGCC TTCCTGCGCG GCTGGCTGAT GGGGCTCTCC TATTTCGCGC TCTCCACCTG GTGGATCGCC GAGGCCTTCA TGGTCGACGC CGCCAACCAG GGCTGGATGG CTCCGTTCGC GGTGGCGGCC ATCGCGGCCG GCCTGGCGCT GTTCTGGGGC TTGGCGGCGG TGCTCTATCG CCTGGTGAAG CCGGCCGGCG CGCGGCGGGT GCTGGTGTTC GCCGGCGCCT TCGCGGCCCT GGAATGGACG CGCGGCCACA TCCTGACCGG CTTTCCCTGG AACCTGCCCG GCGAGACCTG GCGGGCGGGC TCGGCGGTGT CGCAGTTCGC CAGCGTGGTC GGCGCCTATG GCCTGACCTG GATCACCCTG GCCATCGCCG CCGCTCCGGC CGTGTGGCGG GAGGGGAGGC GCGGGCGGAT CGTCGTGGGG GCGGCCGCCG TCGTCCTGGC GGCGATCTGG CTGCGCGGGA TGCTGGGCTT TCCCGTCGCG GTCACGCCCA CCGCCCATCC GCCGCCGCCC ACGACCGTCC GCATCGTCCA GGCCGATATC CCGCAGGAAT CCAAGTGGGA CGCCGGGCGC TTCGCCCAGA TCGTCCAGGC CTATGTCTCG CTGACCGCCA AGCCCTATGC CGGCAAGCCC GCCGACATCG TCGTCTGGCC CGAGGGCGCC CTGCCGCTGG CGATCAACGA CTACATGGTC CCCGGCAGCT GGGTGCGGCA GGCGATCATC GACGCGCTGC ATCCCGGCCA GTTGCTGCTG ATCGGCGGCT ATCGCTACGA GGGGACGCCC GACAAGCCGG TCTATTACAA CAGCCTGGTC GCCCTGCGCC GAGAGGCCGC CGACGTCGTG GTGGTCGGGG TCTATGACAA GCACCGGCTG GTGCCCTTCG GCGAATATCT GCCGGCCGAC GCCCTGATGA CCAGGCTGGG CGTCAAGAGC ATGGCCCACC TGGGCGAGGG CTTCGCCACC GGCCCGCGTC CGGCGCCGCT GCGGGTCGCG CCGGACCTGC TGGTCCAGCC GCTCATCTGT TACGAGAGCT TGTTTCCGCG CCTAGCGGAA CCGACCCCTG GGGTGCGCGC TATCGTCAAT GTCTCGAACG ACGCCTGGTT TGGCGTCACT TCCGGACCGC TGCAGCACCT GAACCTGGCC AGCTACCGCG CGATCGAGAC CGGCCTGCCG ATCATTCGGT CCACCCCGAC GGGTGTCTCC GCGCTGATCG ATGCGCGGGG GCGCATCGCC GGCCGCCGCT TGGGTCTGGG AGAAAGCGGC GTGATCGACG GCGTTCTGCC GCAAGTCGTG GCGCCGACCC TGTTCGCCAA ACTTGGCCAT TGGCCCTTCG CAATGCTACT TTTGATTTCA ATTGGGGCCG GTATTCCCCA ACGAGCAGGT CGGGGCTTGG AAAAAGCCGC GAACTGA
|
Protein sequence | MNLSAWPKLA PWRTRGLALA AGLAAALAHP PFGLLPGLLG YAVLLWLLDA IDGPRPLRSA FLRGWLMGLS YFALSTWWIA EAFMVDAANQ GWMAPFAVAA IAAGLALFWG LAAVLYRLVK PAGARRVLVF AGAFAALEWT RGHILTGFPW NLPGETWRAG SAVSQFASVV GAYGLTWITL AIAAAPAVWR EGRRGRIVVG AAAVVLAAIW LRGMLGFPVA VTPTAHPPPP TTVRIVQADI PQESKWDAGR FAQIVQAYVS LTAKPYAGKP ADIVVWPEGA LPLAINDYMV PGSWVRQAII DALHPGQLLL IGGYRYEGTP DKPVYYNSLV ALRREAADVV VVGVYDKHRL VPFGEYLPAD ALMTRLGVKS MAHLGEGFAT GPRPAPLRVA PDLLVQPLIC YESLFPRLAE PTPGVRAIVN VSNDAWFGVT SGPLQHLNLA SYRAIETGLP IIRSTPTGVS ALIDARGRIA GRRLGLGESG VIDGVLPQVV APTLFAKLGH WPFAMLLLIS IGAGIPQRAG RGLEKAAN
|
| |