Gene Acid345_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1220 
Symbol 
ID4068560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1504889 
End bp1506718 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content57% 
IMG OID637983229 
ProductTPR repeat-containing protein 
Protein accessionYP_590296 
Protein GI94968248 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00113386 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00622178 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAACCACA AGCGTCAACG AACAGATTTC AACTTTTCTC ACGAATTCTG TCCGGCGAGC 
ACCTGCAAAC CCCATGGTAT CATCCGCCTC ATCCAGCAAA TGTCTTACGT ACCGGCCCCC
GACAACCGCC AACCCGGCTT GCTAAGCGGG TCCCGCGCCA TTTTGGCACT CATTCTCGGA
GTCACTGCTC TCGTCTACGC CGGTACCCTG CAATTTGGCT TTACCTACGA CGACACCCCA
CAAATCGTAA CCAATCCACG CATTTCCTCA TGGAGTTACC TGCCCAAGTA CTTCACGGAG
CACGTCTGGG CGCAGATTTC TGCCTCCGGA ACATACTACC GGCCCTTATT TTTGTTGTGG
TTACGTCTGA ATCACGCGTT ATTCGGAGTC CAGAACCCGT TTCCGTGGCA CTTGACGACC
GTGCTTTTGC ATCTTCTCGC GGTGACGTTG GTTTTCCGGC TCCTGATCAA GACATTCAGC
CTCGAAGTCG CCGGAATCGC GACTTTCGTG TTCGCGCTGC ACCCCGGACA CGTTGAATCC
GTCGCGTGGA TCTCCGGCTG TACTGAGCCG CTCATGACCT GTGCGCTCGT CGGCGCAATC
CTCTGCTGGA CGAATCGCAG AAATTCTCGG GGTGCGCTTT GGCTCGCGGC CTCATGGCTG
CTCTGTCTCG CCAGCCTGTT GATCAAAGAA ACCGCAGTGC TGTTGCCGGT CCTGATCTTT
GTCTATTCAC TTTTCGAAGA CCGTGAATCG CCGCGCTTCG ACGCTTACAA GCGCGCAGTC
CTGAACACGC TGCCCTTCGC GGTAATCACG GTCGCGGAAT TGATGGTTCG CGCTCGCGTG
CTTCGCGGCG GCGTCGCCGA CGAACCGCAT CCGGCGATGC AAACGCTGCT GACAGTTCCA
TCAGCGATTT CTTTTTACAT TCAACATCTG TTCTGGCCCG TGAAATTGAG CCCTTTCTAC
GGCCTCGAGC TGATGCAAAA ATTCAGCGCC GTTGTCGTGA TTCCGGCGGC TCTCGTCGCT
CTCTGCGCCG TCGCACTTCT TCTTGTCCTC GCATTTCGCT CGCGCACATT GCTGATCGCC
GTCGCATGGC TTGTTTTACC TGTGCTGCCT GCGCTCATTG GCATCCGCCT CTTTGATAGC
AACGATATCG TTCACGACCG CTATCTCTAT CTCTCGACGA TCGGACTAGG ACTTCTTCTT
GGACTTGCGA TCTCCCGGCT TCCGGCGGCG GGCACAGAGA TTTTCCGCCT CCCACGCGCG
CAATTTGCGT GCATCGCGCT TATCGCGATC GCAATTGCTG CTGGAACCGC GCTGGAAATT
CGCCCCTGGA GCAACAATCT CGCCTTGTTC CTCCGTGGGG TCGACGTCGC ACCCAACAGC
ACGCCGGCCT ACAGTCATCT CGCCTTCGAG GTTTACAAGC GCGGCGATGC CGCGGATGCC
GAGCGCCTCT ACAAACACGC CGTGGCGCTT GGCCCCAACG ACTGGCCCGC GAATTTTGGC
TTGGCTATGA TCGAAATGCG CATGTCGAAC TGGAACGAGG CCGACCGATT TTTCCAACGC
GCAATCGAGA TCAGCCCTTC GGTGAGCAAT GGAAGCTATC TTCTGCAAGC ACGGGTCCGG
GTCGAAATGC AGCATTACGA TGCCGCCGAA AAAAGCGTGC GGGAAGCGAT CGACAACTGG
CCGAACATCG CGAGCCAGCA TTTATTATTG GCTCAGATCC TGACAAAGCA AGGGCGCATT
GAGGAAGCCC GCTCTGAATA TCAGAAGGAG TTGACGCTTA ATCCCACCTC CACCGAAGCA
CGGATGGGAC TCGCGGAAAT TGGGCAGTGA
 
Protein sequence
MNHKRQRTDF NFSHEFCPAS TCKPHGIIRL IQQMSYVPAP DNRQPGLLSG SRAILALILG 
VTALVYAGTL QFGFTYDDTP QIVTNPRISS WSYLPKYFTE HVWAQISASG TYYRPLFLLW
LRLNHALFGV QNPFPWHLTT VLLHLLAVTL VFRLLIKTFS LEVAGIATFV FALHPGHVES
VAWISGCTEP LMTCALVGAI LCWTNRRNSR GALWLAASWL LCLASLLIKE TAVLLPVLIF
VYSLFEDRES PRFDAYKRAV LNTLPFAVIT VAELMVRARV LRGGVADEPH PAMQTLLTVP
SAISFYIQHL FWPVKLSPFY GLELMQKFSA VVVIPAALVA LCAVALLLVL AFRSRTLLIA
VAWLVLPVLP ALIGIRLFDS NDIVHDRYLY LSTIGLGLLL GLAISRLPAA GTEIFRLPRA
QFACIALIAI AIAAGTALEI RPWSNNLALF LRGVDVAPNS TPAYSHLAFE VYKRGDAADA
ERLYKHAVAL GPNDWPANFG LAMIEMRMSN WNEADRFFQR AIEISPSVSN GSYLLQARVR
VEMQHYDAAE KSVREAIDNW PNIASQHLLL AQILTKQGRI EEARSEYQKE LTLNPTSTEA
RMGLAEIGQ