Gene Acid345_4601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4601 
Symbol 
ID4071546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5448410 
End bp5449606 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content55% 
IMG OID637986641 
ProductTPR repeat-containing protein 
Protein accessionYP_593675 
Protein GI94971627 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.458402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTGA TCGGGGCTGT GTGCTTGTTG TCGTTGTGCG CGAGCGTTTT TGCGCAGGAC 
CTTTCGGACA AGGCGGAGTA CGACAAGCTG AAAGCGCACG CGACGGAGCT CTTCAATCAG
AACAATTTCC TCGCAGCTTT GCCGGAGCTC CAAAAACTCG CAGACCAGAA CCCGAAAGAT
TATGCAGTGC TGGAGGCGCT AGGTTTTGCG CTCGCCAGCA AAGCGCTTCT GGAAACCGAT
GCCGACCAGC GTAAGGCCGA CCGCATTGCT GCGCGCAAGC ACCTGCTGGA GGCCAAAAAA
CTCGGCGATA ACAGCGAGAT GATCAACTAC CTGCTAGAAA CGACCCCGGA AGACGGCACC
CCGCGAAAGT TCTCCGACAA CAAAGAGATC GAACGGCTGA TGCAAACCGC CGAAGCGCAT
TTTGCGAAGG GAGAACTCAA CGAGGCAAAG GCCGGATATC TCCAGGTGCT GCTGCTCGAT
CCCGAGAATT ATGCAGCGGC GTTGTTCACT GGAGATGTGT ATTTCAAGGA TGGCAAGTAC
TGCAGCTCCA TCCAGTGGTT CCAGAAAGCG ATTGAGATAG ACGCCAACAC CGAAACCGCC
TACCGATACT GGGGCGATGC ACTCGACCAC CTGGGCCAGA AAGACGAAGC GCGACGAAAG
TTTATGGAGG CGGTGATCGC CGACCCGTAC AACAATCGTC CATGGCAACA CTTGTACCAG
TGGATGAAAA CGCAGGGCCA CGAACTGACG GTTCCCAAGA TACAACCGCA GGCCTCGGTG
AACGTGGAAT CGGACAAGAA AATCAATATT ACGGTGAACT CAGGTAGCGT CGAGAAGCAC
GATGGCAGCG CTGCGTGGAT GACATATGGA ATCGGCCGCG CGGCTTGGCA AGGTGAGAGG
TTCAAGAAGG AATTTCCGAA CGAGCCGAAG TATCGCCACA CGCTGCGCGA GGAGAATCAT
GCACTCTCGC TCGTCGTAAG CTCGGTGAAG AGTCAAAAAG ACATCAAACA GCTTGACCCG
CAGCTCGCAA CACTGGTGAA GATATCCGAC GCCGGACTGC TCGAGCCGTA CATCCTGCTC
AATGCGGCAG ACCAAGGCAT TGCCCAAGAC TACGCGCCGT ATCGCAAGGA ACACCGCGAT
CTGCTCTACA AATATCTCGA TACGATTGTT GTCCCGCAGT TGAAGCCGGG GCTCTAG
 
Protein sequence
MRLIGAVCLL SLCASVFAQD LSDKAEYDKL KAHATELFNQ NNFLAALPEL QKLADQNPKD 
YAVLEALGFA LASKALLETD ADQRKADRIA ARKHLLEAKK LGDNSEMINY LLETTPEDGT
PRKFSDNKEI ERLMQTAEAH FAKGELNEAK AGYLQVLLLD PENYAAALFT GDVYFKDGKY
CSSIQWFQKA IEIDANTETA YRYWGDALDH LGQKDEARRK FMEAVIADPY NNRPWQHLYQ
WMKTQGHELT VPKIQPQASV NVESDKKINI TVNSGSVEKH DGSAAWMTYG IGRAAWQGER
FKKEFPNEPK YRHTLREENH ALSLVVSSVK SQKDIKQLDP QLATLVKISD AGLLEPYILL
NAADQGIAQD YAPYRKEHRD LLYKYLDTIV VPQLKPGL