Gene Acid345_4480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4480 
Symbol 
ID4070963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5317651 
End bp5319207 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content59% 
IMG OID637986519 
Productglycosyl transferase family protein 
Protein accessionYP_593554 
Protein GI94971506 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.963285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAC CGGCGCAAAC GGCCCCCTTG CAGAACTCCA GCTTCACGCA TTCGCTTCTG 
TTTCCGGCAT GGTGCGTTGC CGCCGTCATA TTCATCCTGC ACATCGTCAC CGCGCCGTAT
TACGGATACT TCCGCGACGA ACTCTATTTC ATCGCGTGCA GCGATCACCT TGCCGCGGGC
TACGTAGACT TCGCGCCACT CGCTGCGTGG ATCCTCAAGG CCAACCGCGT CATCTTCGGA
GATTCGCTCT ATGCGTTGCG CTGGCTGCCG GGACTCGCGC ACGCGGCGCT GGTCGTTCTT
ACCGGCATGC TCGCACGGGA GTTAGGCGGC AAGCGCTTTG CGGTGCTCCT GTCGGCAATT
GCCGTCGGGT TCACGCCAGT GATCCTCTGC GATAGCACTC GCTATTCGAT GAATCCCTTC
GAGCCGCTCT TCTGGATGAG CGCACTCTAC CTGCTGATTC GCGTTATCAA CGGCGGGGAT
GAGCGTCTTC TACTCGGCCT CGGGGTGGCC TGCGGTCTCG GCGTGGAGAA CAAACACTCG
ACCATCTTCT TCATGGCGGC CCTGCTGCTC GCGCTCGCGC TTACACCGCA GCGAAGGTTC
TTCGCGAGCA AGTGGTTCTG GGGTGCAGTC GGAATCACGA TCGCTCTGGC GCTTCCCAAC
TTGATCTGGC AAATCCAGCA CGACTATCCG ACGTACGTTG ATCTGCACAA CGTGAAGGTC
ATGCACAAGA ACGTTGAATT GCCGCCACTG CCGTGGATCA AGCAGCAGAT CGTGATGTTG
AACCAGGCCC TCGCACTGCT GTGGATCCCG GCGATCGGGT TCCTGCTCTG GCATCGCGAT
GGGAAAAAGT ATCGCGTGGT CGGCCTGACC TTTCTCTTCT TCTTTCTCGA ACTCATGCTG
ATGAAGGGCA AGGATTACTA TGTCGCGCCC ATCTATCCGG TGATGTTTGC TGCAGGCTGC
GTGCTGTGGG AGACATTGTC GGAGGTGCGT TTCCGTTGGA TACGGCGGAC GCTGGCGGTC
GTGACCGTAG TCGCAAGCCT AGTCGCCGTC CCGATCGTTG TACCTATACT TCCGCCGGAG
AAAGCCAACG CGTATATTCG CGCCCTCGCT GGGGACGGCC AGAAGACGGA AGTCGGTATG
CACTCGCAGC TTCCGCAATA TTTCGCCGAC GAATTCGGCT GGCCTGAACT GGTTGAAAAG
ACCGCGCAGC TTTATCATTC GCTCCCGCCA GAAGAGCAGG CCAAGACCGC GATTCTCGGC
GGAAGTTATG GTGACGCCGG CGCTATCGAT TTCTTCGGCG CGAAATATGG GCTGCCGAAA
TCCATCAGCG CGCACCAGAA CTACTGGTAC TGGGGTCCGC GAGACTATAC CGGCGAGTCG
GTGATCATCC TGCACTGGCG GCGCTCCTCG GTTGAGAAGC ACTGCTCCTC GGTCGTGGAA
GGGCCGACGC TCGATCATCC CTGGGCGATG GAAGAGGAGC ACTACACGAT CTGGCTCTGC
AAAGGCATGA AGCCAGGATT GCAGGAGTTC TGGCCCGACC TGAAAAACTG GAACTAG
 
Protein sequence
MASPAQTAPL QNSSFTHSLL FPAWCVAAVI FILHIVTAPY YGYFRDELYF IACSDHLAAG 
YVDFAPLAAW ILKANRVIFG DSLYALRWLP GLAHAALVVL TGMLARELGG KRFAVLLSAI
AVGFTPVILC DSTRYSMNPF EPLFWMSALY LLIRVINGGD ERLLLGLGVA CGLGVENKHS
TIFFMAALLL ALALTPQRRF FASKWFWGAV GITIALALPN LIWQIQHDYP TYVDLHNVKV
MHKNVELPPL PWIKQQIVML NQALALLWIP AIGFLLWHRD GKKYRVVGLT FLFFFLELML
MKGKDYYVAP IYPVMFAAGC VLWETLSEVR FRWIRRTLAV VTVVASLVAV PIVVPILPPE
KANAYIRALA GDGQKTEVGM HSQLPQYFAD EFGWPELVEK TAQLYHSLPP EEQAKTAILG
GSYGDAGAID FFGAKYGLPK SISAHQNYWY WGPRDYTGES VIILHWRRSS VEKHCSSVVE
GPTLDHPWAM EEEHYTIWLC KGMKPGLQEF WPDLKNWN