Gene Caul_0025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0025 
Symbol 
ID5897737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp30671 
End bp32257 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content70% 
IMG OID641560508 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_001681661 
Protein GI167643998 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.129672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.499637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCTGT CGGCCTGGCC GAAACTGGCG CCGTGGCGGA CCCGAGGCCT GGCGCTGGCC 
GCAGGCCTCG CCGCCGCCCT GGCCCATCCG CCGTTCGGGC TCTTGCCGGG TCTGCTGGGC
TACGCCGTGT TGCTGTGGCT GCTGGACGCG ATCGACGGGC CCCGGCCGCT GCGTTCGGCC
TTCCTGCGCG GCTGGCTGAT GGGGCTCTCC TATTTCGCGC TCTCCACCTG GTGGATCGCC
GAGGCCTTCA TGGTCGACGC CGCCAACCAG GGCTGGATGG CTCCGTTCGC GGTGGCGGCC
ATCGCGGCCG GCCTGGCGCT GTTCTGGGGC TTGGCGGCGG TGCTCTATCG CCTGGTGAAG
CCGGCCGGCG CGCGGCGGGT GCTGGTGTTC GCCGGCGCCT TCGCGGCCCT GGAATGGACG
CGCGGCCACA TCCTGACCGG CTTTCCCTGG AACCTGCCCG GCGAGACCTG GCGGGCGGGC
TCGGCGGTGT CGCAGTTCGC CAGCGTGGTC GGCGCCTATG GCCTGACCTG GATCACCCTG
GCCATCGCCG CCGCTCCGGC CGTGTGGCGG GAGGGGAGGC GCGGGCGGAT CGTCGTGGGG
GCGGCCGCCG TCGTCCTGGC GGCGATCTGG CTGCGCGGGA TGCTGGGCTT TCCCGTCGCG
GTCACGCCCA CCGCCCATCC GCCGCCGCCC ACGACCGTCC GCATCGTCCA GGCCGATATC
CCGCAGGAAT CCAAGTGGGA CGCCGGGCGC TTCGCCCAGA TCGTCCAGGC CTATGTCTCG
CTGACCGCCA AGCCCTATGC CGGCAAGCCC GCCGACATCG TCGTCTGGCC CGAGGGCGCC
CTGCCGCTGG CGATCAACGA CTACATGGTC CCCGGCAGCT GGGTGCGGCA GGCGATCATC
GACGCGCTGC ATCCCGGCCA GTTGCTGCTG ATCGGCGGCT ATCGCTACGA GGGGACGCCC
GACAAGCCGG TCTATTACAA CAGCCTGGTC GCCCTGCGCC GAGAGGCCGC CGACGTCGTG
GTGGTCGGGG TCTATGACAA GCACCGGCTG GTGCCCTTCG GCGAATATCT GCCGGCCGAC
GCCCTGATGA CCAGGCTGGG CGTCAAGAGC ATGGCCCACC TGGGCGAGGG CTTCGCCACC
GGCCCGCGTC CGGCGCCGCT GCGGGTCGCG CCGGACCTGC TGGTCCAGCC GCTCATCTGT
TACGAGAGCT TGTTTCCGCG CCTAGCGGAA CCGACCCCTG GGGTGCGCGC TATCGTCAAT
GTCTCGAACG ACGCCTGGTT TGGCGTCACT TCCGGACCGC TGCAGCACCT GAACCTGGCC
AGCTACCGCG CGATCGAGAC CGGCCTGCCG ATCATTCGGT CCACCCCGAC GGGTGTCTCC
GCGCTGATCG ATGCGCGGGG GCGCATCGCC GGCCGCCGCT TGGGTCTGGG AGAAAGCGGC
GTGATCGACG GCGTTCTGCC GCAAGTCGTG GCGCCGACCC TGTTCGCCAA ACTTGGCCAT
TGGCCCTTCG CAATGCTACT TTTGATTTCA ATTGGGGCCG GTATTCCCCA ACGAGCAGGT
CGGGGCTTGG AAAAAGCCGC GAACTGA
 
Protein sequence
MNLSAWPKLA PWRTRGLALA AGLAAALAHP PFGLLPGLLG YAVLLWLLDA IDGPRPLRSA 
FLRGWLMGLS YFALSTWWIA EAFMVDAANQ GWMAPFAVAA IAAGLALFWG LAAVLYRLVK
PAGARRVLVF AGAFAALEWT RGHILTGFPW NLPGETWRAG SAVSQFASVV GAYGLTWITL
AIAAAPAVWR EGRRGRIVVG AAAVVLAAIW LRGMLGFPVA VTPTAHPPPP TTVRIVQADI
PQESKWDAGR FAQIVQAYVS LTAKPYAGKP ADIVVWPEGA LPLAINDYMV PGSWVRQAII
DALHPGQLLL IGGYRYEGTP DKPVYYNSLV ALRREAADVV VVGVYDKHRL VPFGEYLPAD
ALMTRLGVKS MAHLGEGFAT GPRPAPLRVA PDLLVQPLIC YESLFPRLAE PTPGVRAIVN
VSNDAWFGVT SGPLQHLNLA SYRAIETGLP IIRSTPTGVS ALIDARGRIA GRRLGLGESG
VIDGVLPQVV APTLFAKLGH WPFAMLLLIS IGAGIPQRAG RGLEKAAN