Gene Acid345_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2341 
Symbol 
ID4069153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2772737 
End bp2774317 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content60% 
IMG OID637984357 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_591416 
Protein GI94969368 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.254934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0837539 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTTC TCTCCGGCGT TCTTCAGGTG CTGATATTTC CGCGGCCTGC TTTGTACTTC 
CTGTGTTGGA TCGCCGTCGC CCCTCTCATC ATCGCCATTC TGCGCGCTCG CGATCCCGAT
GCCGTGCAGC TTCTCGCCGA AGGTGGCTCG AGCTTCCTCG CGCCTGCCAG CCTGAAGCAA
GGCTTCTTCC TCGGCTACGC AAGCGGCATC GTGTGGTACC TGGGAAGCTG CTACTGGGTC
TTCCACGTCA TGCACCTCTA CGGAGGTCTG AACGTCTTCG TCTCCGCGAT CCTGCTCATC
ATGTTCGCAA TGTACCTTGG GCTTTATCAC GGGCTGTTCG GAATCTTGCT CGCACTCAGT
GCGCGTAAGC GCCAGGGATA CAGCCTGCGC GCTCTCGTGC TCGCGCCGTT TCTCTGGGTC
TCGGTGGAAC TGGCGCGAAC CTACGTCACC GGCTTCCCAT GGAACCTCCT CGGCACCGCG
CAGGTGGACA ACATTCCCCT GGCTCGCATT GCGACCTTCA CCGGCGTCTA CGGCTTGTCA
TTCGAAATCG CCCTTGTGAA CGCCGCCTTT GCCGCGGCTG CTCTCGTTGC TCGCCGCCAG
CGCGCCACCA TGCTGGTTGC CGCCATCTGC GCGACGATTG GCCTTCAAGT CTGGCGGCTC
GTCCCGATGC AAGCGCCGCC GCCCAACAGG TACGCGGTGC TGGTGCAGGA AAATATCCCG
GTCAAAGACA TCGAGTGGAC CGAGCCTGCG CTCAACTCCA CGCTCGCCGA CCTTGCGAAG
TTCAGCGTGG CGCCGAACAA TGCGACTGCC GACGATCCCG GCCTCATCAT CTGGCCTGAA
TCACCCGCGC CGTTCTTCGT CACCGACTAC CACTTCCGCG GCGCGATGAT GAACATCGCC
CGAGAGACGC ACTCGTATGT AATTGCCGGC AGCATCGGTA TCGAGAACAC GGGCAGGGAA
GACGTACAAC CCAACGTCTA TAACTCTGCC GTGCTCATCA CGCCCGACGC CAAGTGGTCT
GCGCGCTACG ACAAGAACCA CCTCGTGCCG TTCGGCGAAT ACGTTCCGTT CGCATCTCTT
CTGAGTTTTG CCAAATCGCT TACCCATGAA GTCGGGACCT TCAAGCCCGG CCACGAACGC
AAGCTCCTAG ATCTGGGCAA GCAGCAAGTT GGCGTCTTCA TCTGTTACGA GAGCATTTTC
CCGAATGAAG TGCGCCAGTT CGCCGACAAC GGCGCCGACC TCTTCATTAA CATTTCCAAC
GACGGTTGGT TCGGCGACAC CGGCGCTCCT GGTCAACACT TGAACATGGC GCGAATGCGT
GCGATCGAGA ACGAGCGCTG GCTGCTGCGC TCCACGAACA CTGGAATTAC TGCTTCGATC
GACCCTTACG GACGCATTGC GGCCGTCCAG CCGCGCAACG TTCGCGCCTA CATGCAAGCG
CCCTACGCGT ATCTAAGCGG GAAGACCTTC TATACTGAGC ACGGCGATTG GTTCCCCATC
TGCTGTGTCA TAATTTCGTT AGCGGCGCTC GTCATCCGGC ATCGCCCGGA GGCAGAAATG
CCCCAGCCGC AACCGGTCTA A
 
Protein sequence
MAVLSGVLQV LIFPRPALYF LCWIAVAPLI IAILRARDPD AVQLLAEGGS SFLAPASLKQ 
GFFLGYASGI VWYLGSCYWV FHVMHLYGGL NVFVSAILLI MFAMYLGLYH GLFGILLALS
ARKRQGYSLR ALVLAPFLWV SVELARTYVT GFPWNLLGTA QVDNIPLARI ATFTGVYGLS
FEIALVNAAF AAAALVARRQ RATMLVAAIC ATIGLQVWRL VPMQAPPPNR YAVLVQENIP
VKDIEWTEPA LNSTLADLAK FSVAPNNATA DDPGLIIWPE SPAPFFVTDY HFRGAMMNIA
RETHSYVIAG SIGIENTGRE DVQPNVYNSA VLITPDAKWS ARYDKNHLVP FGEYVPFASL
LSFAKSLTHE VGTFKPGHER KLLDLGKQQV GVFICYESIF PNEVRQFADN GADLFINISN
DGWFGDTGAP GQHLNMARMR AIENERWLLR STNTGITASI DPYGRIAAVQ PRNVRAYMQA
PYAYLSGKTF YTEHGDWFPI CCVIISLAAL VIRHRPEAEM PQPQPV