Gene Acid345_4645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4645 
Symbol 
ID4070802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5503522 
End bp5504571 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content58% 
IMG OID637986685 
ProductTPR repeat-containing protein 
Protein accessionYP_593719 
Protein GI94971671 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.656867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGC AAGCACTAAT CCTATTCTTC GTCGCGCTCG TGCTGGGGAT GGTCCCGGCT 
GCGATGGCGC AAACCGGTAC CGTGAAAGGC TATGTCAAAG ACAAGGGCAC GCCGATCGTG
GGCGCGCAGG TCTTGTTCGA GAATCTTGAC AACGGCCGCA AGATGACCCT GAAGACCGAC
AAGGCCGGCA ACTTCTTCAG CATCGGCGTC GCCATCGGTA GCTACAAGAT CACCATCACC
GCCGACGGCA AAACCATCTG GAACACTGCG AAGTATCCGG TCGGCGGCGG CGACGGCAAT
CCCGAGTTGA ACATCGACTT GGAGAAGGAA CGCGCCGCAC AGGCAACCGC CAATCCGGCC
AATGCGGAAG CGGTGAAGAA GGCGGAAGAG AACAAGAAAG AGAATGAGAA GATCGGCAAC
CTCAACACCA TGTTGAAGGA AGCCCAAGCC GATATGCAGG CCAAGAACTT TGACGCGGCG
ATCCAGATCA TGGAGAAGGC GACCGCGCAA GACGCAACCC ATGACATCAT CTGGGCCGTT
CTCGCCGATG CGTATCTCGG CGCGAAAAGA TACCCTGACG CGGTGAAAGC CTACGAAAAG
GCGATCGCGC TCGATCCCAG CAAAGCGCCC GTGCATAACA ACTATGCGCA GGCACTTGCC
AAGACAGGAC AGTCGGACAA GGCCATCGCC GAGTACGATG CGGCTGCCAA GCTCGATCCA
GCCCATGCCG GCTCGTTCTA CTTCAATGAA GGCGCCGTCT TGACCAATGC TGGAAAGACC
GACGACGCCA ACGCGGCCTT CGATAAAGCG ATCGCTGCCG ATCCCACCAA GGCAGACGCC
TATTACCAGA AGGGCGTGAA CCTGATGGGC AAGGCGACGC AGAAGGACGG GAAGTATGTT
GCGGCGCCGG GCACCGTCGA GGCCTTCAAC AAGTACCTCG AACTGTCCCC TGACGGACCG
AACGCTCAGA ACGCGAAAGA TATGATCGCG GCTCTTGGCG GCACAGTCGT CACCGGCTAC
AAGGCCGAAA AGGGCAAGAA GAGCAAGTAG
 
Protein sequence
MRKQALILFF VALVLGMVPA AMAQTGTVKG YVKDKGTPIV GAQVLFENLD NGRKMTLKTD 
KAGNFFSIGV AIGSYKITIT ADGKTIWNTA KYPVGGGDGN PELNIDLEKE RAAQATANPA
NAEAVKKAEE NKKENEKIGN LNTMLKEAQA DMQAKNFDAA IQIMEKATAQ DATHDIIWAV
LADAYLGAKR YPDAVKAYEK AIALDPSKAP VHNNYAQALA KTGQSDKAIA EYDAAAKLDP
AHAGSFYFNE GAVLTNAGKT DDANAAFDKA IAADPTKADA YYQKGVNLMG KATQKDGKYV
AAPGTVEAFN KYLELSPDGP NAQNAKDMIA ALGGTVVTGY KAEKGKKSK