Gene Acid345_0725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0725 
Symbol 
ID4069797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp888152 
End bp889981 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content57% 
IMG OID637982731 
Productglycosyl transferase family protein 
Protein accessionYP_589804 
Protein GI94967756 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCTA GTACGACTCC GGCCCAACCT GTTGTTGTGG GCCGCTCGCA ATCTTCTAAA 
GCCAAGCTGC ACATCGTGAT CATCAGCCTC TTCTGGCTGG TCATTTACAT CCCGGGCCTC
TTCACGCCTG CGCTCCTCGA CGACGCCGAC TCCATCCACG CCGAAGCTGC ACGCGAAATG
ATCACGCGCC ACGACTGGAC CACGCTCTAC ATCGATGGCC TGCGCTATCT CGAAAAAGCC
CCACTGATGT ACTGGGGAAT GGCCAGCAGC TTCAAGATGT TTGGGGTCAC CGAATGGACC
GCCCGACTGC CTCTCACGCT CGGCGTCCTC GCAACCCTGC TCGCCACGTA TGCCATCGGC
AAGCGTAACC TCGGCGAACG CGCGGGATTC TGGGCCGCCA TCATCCTCGG GACGGGCGTC
GGAACTTATA TCTTCACGCG CATTCTCATC CCCGATCTAC TGGTTGGTCT CTTCCTCACC
ATCGGCTTTG ATTTCTTTTT GCGCGGGATT GATCAGGAAA AGCCTTCTAT AGCCTCGGCT
GCCGGCCTCG CCGCTGCTGC CGCGCTGAAT ATCCTGACGA AGGGCTTCAT CGGCGTCATC
TTCCCCATTG GGATCATCGT TGTCTATCTG TTCCTAACGC ACAACCTGAA GCACCTGCTG
AAAATGCGCT GGCTGCTGAT GATTGGCGTG CTGCTGGTGA TTGCCGCGCC GTGGCACATC
CTGGCGAGTC TGGCGAATCC ACCACAGGGA CAGGCGCGCG GCTTCTTCTG GTGGTACTTC
ATCAACGAGC ACATCCTGCG CTACCTCGGA AAGCGTGTCC CGAAGGACTA CGACACGGTT
CCGTTGGCGA TCTTCTGGTC GCTGATGGTG CTGTGGCTGT TGCCGTGGTG TGCTTTCGCG
CTTCAGGCCA TTGCGCGTGT ACCGCGCAAA CTACGTGAAC TCGACCGCCG CGGACGCGCG
CTGGTGCTTT TCACCATCTG GATGGTGTTG ATCCTCTTTT TCTTCAGCTT CTCCACACGC
CAGGAGTACT ACACGATTCC GGCGCTGCCG GCCCTGGCTC TGATCACTGC GGACTGGCTC
GTAGCTGAGG ACGAGTCGCC GGAGAAGAGT TCGCTTCGCA AGTGGGGGAT GATCGGCTCC
GGATTCCTGC TGTTTCTTGG CATCGCGTTT GCCGGCACTG CGTCATACAT TCTGCATCTT
TCGGAATCCG TGCCGCCGGG ATCCGATCTC GCCGATCTTT TGAGGAAAAA TCCGAACGAC
TACGCCATGT CGCTCGGTCA CGTCCTTGAT CTGACGCCGC GCGCGATTGG TCTGTTCCGC
CTGCCGATGG GACTGTTCGC GGGGTCGTTC TTCGTGGGCA GCATCGCGAA CTTCTGGTTG
CGTTGGAAGC GCAAAGCCAA CGCCGCGAAC TGGGCGCTCG CATTGATGAT GTTCCCGGTT
CTCTACTGCG TTCACGTGGG GATGGTGGAC TTCGCACCGA TTCTCTCTTC GAAGACGCTC
GCCGTAGCCA TCGACAAGCA ATGGCAATCA GGCGATGTCA TCGTAGTAAA CGGGCCCTAT
GAAGAAGCCT CGACCTTGAA TTTCTATACT GGGAAACTAA TCCATATCAT TAACAATCGG
GAACACGGAA ACGTCTATAA TGGCGCGCTG TACCCTGACG CTCCGCCCAT CTTTGAAGAT
GATGCCTCGT TTCAGAAGTT GTGGAATGGA CCTCAGCGCA TCTTCGTGTG GACGCAAGAG
GAAAAGGCTC TGTCCGTGCA ACGTCTCGGC AATAGTTACG AGATCGCGCG GAGCGGCGGT
AAGCTCATCC TCAGCAATCG TCCGAACTGA
 
Protein sequence
MISSTTPAQP VVVGRSQSSK AKLHIVIISL FWLVIYIPGL FTPALLDDAD SIHAEAAREM 
ITRHDWTTLY IDGLRYLEKA PLMYWGMASS FKMFGVTEWT ARLPLTLGVL ATLLATYAIG
KRNLGERAGF WAAIILGTGV GTYIFTRILI PDLLVGLFLT IGFDFFLRGI DQEKPSIASA
AGLAAAAALN ILTKGFIGVI FPIGIIVVYL FLTHNLKHLL KMRWLLMIGV LLVIAAPWHI
LASLANPPQG QARGFFWWYF INEHILRYLG KRVPKDYDTV PLAIFWSLMV LWLLPWCAFA
LQAIARVPRK LRELDRRGRA LVLFTIWMVL ILFFFSFSTR QEYYTIPALP ALALITADWL
VAEDESPEKS SLRKWGMIGS GFLLFLGIAF AGTASYILHL SESVPPGSDL ADLLRKNPND
YAMSLGHVLD LTPRAIGLFR LPMGLFAGSF FVGSIANFWL RWKRKANAAN WALALMMFPV
LYCVHVGMVD FAPILSSKTL AVAIDKQWQS GDVIVVNGPY EEASTLNFYT GKLIHIINNR
EHGNVYNGAL YPDAPPIFED DASFQKLWNG PQRIFVWTQE EKALSVQRLG NSYEIARSGG
KLILSNRPN