Gene Acid345_1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1783 
Symbol 
ID4072843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2162460 
End bp2163500 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content54% 
IMG OID637983791 
Productacyltransferase 3 
Protein accessionYP_590858 
Protein GI94968810 
COG category[I] Lipid transport and metabolism 
COG ID[COG1835] Predicted acyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACA GAATTACTCA ATTAGACGGG CTGCGGGCAT TCGCTTTTCT CGCCGTATTT 
GTCCATCACG CATTCAAGGC GCCATTGCTG TGGATGGGGG TCGACCTCTT CTTCGTGATG
AGCGGTTTCC TCATCACGCG TATTCTCCTA GGTTTGAAGG AGAAGGGAAA AGGCTTCTTC
GGCCCGTTTT ACGCTCGTCG CGCCCGACGA ATTTTGCCCC CCTACCTCTT GTTTATCGTG
TTCAGCACCA TCGTCATTCC AGTTGCATGG AAGCACATCT GGTTCTACTA CGCTTTCTTC
GCGGCAAACG TGGCCGAAAT GCTTGGTGTT GGTGGTGGCC GCGCCCTCGC TCCTCTGTGG
TCCCTTGCAG TCGAAGAGCA GTTCTACTTG GTGTGGCCGT TCGTTGTGCT CCTAACTTCC
CGCAAGAACC TGAAGCGGAT CTGCCTTGTC GGCGTCTTCG CGTTGCCGAT CATTCGCGCC
TTGGCCACTC CTTACGTTTC GTTTTCTGTC ATCTACCACA TGACGTTTTT CCGAATGGAC
CTTTTGATGG CAGGAAGTTT CGTCGGAATG GTGTGGGATG AGGACAGGAG CAAAGTAATG
TCCTATCAAC GGATGGCGGG AGTTATGACC CTCATCGTCC CCCTCGCAAT TCTCGCGTTG
GCCAGGATCC CAAGTTTTCG CACCGGCGCC AACTCGGTTC TCTTTAACGG AGTCGGCTAC
AGTCTCGCGT TAGTCATGTT CACGTCCGTC CTGGTGTATT CGTTGAGCAT TCGCGAAGGC
ATGGTTTTTC GGCTGCTCAC ATCGGGCCTC CTTACCTATC TGGGGAGAAT CAGCTACATG
ATGTATCTCG TTCATGCCAC GATGATCGTG CTCGTGGAAA AGCTGGCTCT TCACGGTCAC
AAAGGACTGG TCGTCGAGAG CGCGATCGCC CTCGCGGCAA CCGTTGCGTT CGCGTCACTC
TCTTGGAAGT TTGTCGAAGG ACCGATTCTC AACGCCGGCC ACGTCCCGCC CAAACTTACA
AATGTCGCGG CGGCCAGCTA G
 
Protein sequence
MNNRITQLDG LRAFAFLAVF VHHAFKAPLL WMGVDLFFVM SGFLITRILL GLKEKGKGFF 
GPFYARRARR ILPPYLLFIV FSTIVIPVAW KHIWFYYAFF AANVAEMLGV GGGRALAPLW
SLAVEEQFYL VWPFVVLLTS RKNLKRICLV GVFALPIIRA LATPYVSFSV IYHMTFFRMD
LLMAGSFVGM VWDEDRSKVM SYQRMAGVMT LIVPLAILAL ARIPSFRTGA NSVLFNGVGY
SLALVMFTSV LVYSLSIREG MVFRLLTSGL LTYLGRISYM MYLVHATMIV LVEKLALHGH
KGLVVESAIA LAATVAFASL SWKFVEGPIL NAGHVPPKLT NVAAAS