Gene Acid345_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1038 
Symbol 
ID4073125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1303989 
End bp1306046 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content58% 
IMG OID637983045 
Productglycosyl transferase family protein 
Protein accessionYP_590115 
Protein GI94968067 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.152383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCAG CCGACACGAC CCTCAGAGCC GGAGCCGCGA GAAAGGCCTA CTCTGCGCCC 
TCCGAGCGCT GGACTCACTC CCCATTTTGG ATGGTGCTAA TCGGCTTTGT CGCTCGTGTG
CTCTACATCT GGCTAGCGCA CACCTACAAG TTCCGCGCCA ACGACGAAAA TTTTTCGTTC
GGCTGGGAGA TCGGCCGCAT CGCCCAATCC ATCGCGACCG GCCACGGATT CTCCTCACCG
TTTTACGGCG ATACCGGGCC CACCGCCTGG GCCGCGCCGG TGTATCCGTA CATCGTTGCC
GGCGCTTTCA AGATTTTCGG TCTCTACACG CGCTCCGCGT CCTTCGCGCT GCTGACCTTC
AACAGCTTTT TCTCCGCCCT CACCGCGTGG ACGGTCTACC GCATCGCCCA GCGGATGTTC
AACGAGACCG TTGCCGTGTG GTCAGGTTGG CTCTGGGTCT TCCTGCCGAA CCTCGCCTAC
TGGGGCGTAC GCATGGTGTG GGAGACAAGT CTTTCGACCT TCCTGCTCAC CGCGCTCATT
CTGCTCACGC TCAAGATGGA AGGCGACGAT CGCTTCGGCC GCTGGATTTT CTGGGGCGCA
CTCTGGGGAT TCGTCGCACT CACAAACACA TCCATCTGCT CGTTCACTCC ATTTGCCGGA
TTATGGCTCG TCTACAAATT GCATCAGCAA CGGAAGTCGT GGTTCCTGCC GGCGCTTGCC
AGCGCGATCG TCTTCTTCGC CGTGCTCTCG CCCTGGCTGA TACGCAACGA TCGCGTCTTC
CACAAACCGT TCCTACTGCG CGGAAACTTC GGTGTCGAGT TACATTGCGG CAACAACGAC
ATGGCTGAAG GCCTCTGGGT CGCCCTCGCG CATCCCACGC AAGGCATCCC GGTCTATCAG
CGTTACAAAG CCATGGGTGA AGTCGCCTAC GTTGCGCAGA CCCAGAAAGA CACGGTCGCC
TGGATCAAAG CGCACAAAGA AAGATTCGCG GTGCTGACCT TCCGCCGCTT CGTCTATTTC
TGGAGCGGAC CCCCGCGTAT CTCAAAAGTT TTTTACGAAG AAGAATTGAA AAATGCGGGC
TATCTCGCGA CGGCATGCCT CGGATGGTGG GGCTGTTTTC TCGCGCTCAA GAAGCGCATT
CACGCCGCCG GGCTCTTCGC CTGTCTTCTG ATTTTCTACC CGGCGGTTTA CTATATTACT
TTCCCGAATC CGCGCTACCG GCATCCGCTT GATCCGGTCA CTCTGATCCT GGGGGTCTTC
TTGGTTTCAC AAACTCGGCC GGATACCGAG GACACAACTG TGCCCGAATC GCTGGCTACA
ACCGAGCCTC AATCGCCGGT GTTCCACACG CTGTCCATCA TCATCCCTGT CTACAACGAG
CGCCGCACTG TGATGCAGCT GCTCAACGCA GTCGCACGCC AGCCCATCGG CGAACTGCAC
AAAGAGATGG TGATCGTGGA TGATTTCTCC ACCGACGGCA CCCGTGAGTT CCTCCAGGAG
ATGGATCTTC CCTCACTTTT CGGCAAGCAA GGCATCACGG TCAAGGTGGT TCTGCATGAA
ATGAACAAGG GAAAGGGCGC CGGCGTTCGC ACCGCCCTCG AACATTGCAC CGGCGAACTC
GTTCTCATGC AGGATGCCGA CCTCGAATAC GATCCGGCCG ACTACCCGGC CCTGCTCAAG
CCCTTGCTGG ACGGTCATGC CGACGCCGTC TTCGGCAACC GCTTCCACTA CGGATCGCAT
CGCGTTCCGC GTTTCTACCG CTTCGTTCTG AATCGTGTCT TCTCCATCAC CTGCAACATG
CTCACTGGCC TCGCGCTCAA CGACGTCACC GCCTGCTACA AGGTCCTGAC TCGCGATCTG
TTCAATCGCC TTAACCTGAA GAGCGATCGT TTCAGCTTCG AAACCGAACT CACGGTGAAA
CTCGCGAAGC TCAAAGCGCG CATCTACGAG GTCCCCATCG TCTATCACGG CCGTACTTAT
GCCGAGGGTA AGAAGATCAA TTGGACCGAT GGCGTGGCAG CGTTTTACCA CCTGATTCGC
TTCCAGCTCA AAGACTAG
 
Protein sequence
MTSADTTLRA GAARKAYSAP SERWTHSPFW MVLIGFVARV LYIWLAHTYK FRANDENFSF 
GWEIGRIAQS IATGHGFSSP FYGDTGPTAW AAPVYPYIVA GAFKIFGLYT RSASFALLTF
NSFFSALTAW TVYRIAQRMF NETVAVWSGW LWVFLPNLAY WGVRMVWETS LSTFLLTALI
LLTLKMEGDD RFGRWIFWGA LWGFVALTNT SICSFTPFAG LWLVYKLHQQ RKSWFLPALA
SAIVFFAVLS PWLIRNDRVF HKPFLLRGNF GVELHCGNND MAEGLWVALA HPTQGIPVYQ
RYKAMGEVAY VAQTQKDTVA WIKAHKERFA VLTFRRFVYF WSGPPRISKV FYEEELKNAG
YLATACLGWW GCFLALKKRI HAAGLFACLL IFYPAVYYIT FPNPRYRHPL DPVTLILGVF
LVSQTRPDTE DTTVPESLAT TEPQSPVFHT LSIIIPVYNE RRTVMQLLNA VARQPIGELH
KEMVIVDDFS TDGTREFLQE MDLPSLFGKQ GITVKVVLHE MNKGKGAGVR TALEHCTGEL
VLMQDADLEY DPADYPALLK PLLDGHADAV FGNRFHYGSH RVPRFYRFVL NRVFSITCNM
LTGLALNDVT ACYKVLTRDL FNRLNLKSDR FSFETELTVK LAKLKARIYE VPIVYHGRTY
AEGKKINWTD GVAAFYHLIR FQLKD