Gene Acid345_2535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2535 
Symbol 
ID4072179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2994396 
End bp2995544 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content62% 
IMG OID637984552 
Productlipid-A-disaccharide synthase 
Protein accessionYP_591610 
Protein GI94969562 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID[TIGR00215] lipid-A-disaccharide synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAG TCCTCATCTC CGCCGGCGAA GCCTCCGGCG AAATGTACGG CGCCGCGCTC 
TTAGACGCCC TGCGCAAGCT CTCGCCCGAC CCGGTCGAGG CCTTCGGTTT GGGCGGTGAA
AAAATGCGCG CCGCCGGCTG CGACATCATC GTGGACTCCA AAGATGTCGC CGTCGTCGGC
ATCGCCGAGG TCGTCGCCCA CCTCCCGCGC ATCTACGGCG AATTTCACAA GCTCCTGCGG
GAAGCCGACC GCCGCAAGCC CGACGTCGCC GTCTTAATCG ACTTCCCCGA CTTCCACTTC
CGCCTCGCGA AGGCGCTCCA CGCACGCGGC ATCCCGGTCG TCTATTACGT CAGCCCGCAG
CTCTGGGCCT GGCGCCGCGG ACGCATCAAG CTCGTCCAAC GCTACGTCAA GAAGATGCTG
GTCATCTTCC CCTTCGAGGA GCAGTTCTAC CGCGAGCACA ACGTGGAAGC CGAATTCACC
GGCCATCCTC TCGGCGAGTT AAGCGTCACC GTCGATCCCC GCACCGAATT CGCCGTCCGC
TACGGTCTCG ACCCCGCCAA GCCCTGGGTC GGCATCCTCC CCGGCAGCCG CCGCAAGGAA
GTCCAGATGA TCCTCCCCAC CCTCATCGAC GCCGCAAAAA AACTCGGCCC AGCCAACGAG
TACCTGCTTC CCGTCGCATC TACTCTCGAC GCGGGCTGGA TGCAGGCGCA ATTGCTTGCA
ATTCCTCAGC CGCCGCGGGT CACATTAACC AGTGACGCAC GTCAAACACT GGTACAGAGC
CGCGCAGCCA TGGTCGCCAG TGGTACCGCT ACGGTGGAAG CCTCGGTACT CGGCACGCCG
TTCGTAATGG TGTACCGCGT CGCGCCGCTT AGTTGGAGAG TCGGCCGCCG ATTGGTGAAG
TTAGATCGTT TTGCGATGCC GAATCTAATC GCCGGACGCG AAGTCGTCCG CGAGCTGGTG
CAAGAAAATT TTACGGCCGA CAAAGTTGCG GCCGAAGTAA GCGCCCTGAT TGAGGATGGC
CCGCGCCGCG CGCAGGTATT GAAGAATCTG GCCGAAGTTC GAGAGCACTT GCAGTCAGGC
CGAACGAATG AGTCGGCGGC AGAACGCGCA GCCCGGTCAG TTTTATCAGT TGCGCAGCGG
AAGGATTGA
 
Protein sequence
MLKVLISAGE ASGEMYGAAL LDALRKLSPD PVEAFGLGGE KMRAAGCDII VDSKDVAVVG 
IAEVVAHLPR IYGEFHKLLR EADRRKPDVA VLIDFPDFHF RLAKALHARG IPVVYYVSPQ
LWAWRRGRIK LVQRYVKKML VIFPFEEQFY REHNVEAEFT GHPLGELSVT VDPRTEFAVR
YGLDPAKPWV GILPGSRRKE VQMILPTLID AAKKLGPANE YLLPVASTLD AGWMQAQLLA
IPQPPRVTLT SDARQTLVQS RAAMVASGTA TVEASVLGTP FVMVYRVAPL SWRVGRRLVK
LDRFAMPNLI AGREVVRELV QENFTADKVA AEVSALIEDG PRRAQVLKNL AEVREHLQSG
RTNESAAERA ARSVLSVAQR KD