Gene Acid345_4603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4603 
Symbol 
ID4070760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5450933 
End bp5453101 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content60% 
IMG OID637986643 
ProductTPR repeat-containing protein 
Protein accessionYP_593677 
Protein GI94971629 
COG category[G] Carbohydrate transport and metabolism
[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase
[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.524488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGTC TTTTCATGAA CCCAAAAGCT CTCTCGTCCA CTTTGGCCGT GCTGCTGTTG 
TCTGCCACGG CGAGCCTTGC ACTGCCGAAT GGGCCTACTT TGTTGGCTCA GCAGAAGCAG
GACGCGCCGA CGCCAACCGC TCCGGCCAAG CCGAATGCGG CCCAGCCAAC TTCGAATCCG
GAAGCCGACG CCAAGAAACC AGCCAATCGC GCTGACGCCT ACTACCACTA CACGATGGCG
CACATGTACG AGGAAATGGT GGCGACGTAC GGCCGCGCCG AGTACGCGAA CAAGGCGATT
GAGGAATACC GCGCGGCGAT CACCGCCGAT CCGTCGTCGG ACTATTTAAA CGCCGGACTG
GCCGATCTTT ACTGGCGCAC GGGCCGCATT CGCGACGCGG TGCTGGAAGC GCAGGAGATC
CTGAAGCGCG ATCCGAAGAA CGTGGATGCG CACCGCCTGC TCGGGCGTAT TTACCTGCGT
TCTTTGGGCG ACATGCAGAG CGGCAATAAC CAGTCTCGCG ACATGCAGCG GCTGGCGATT
GAGCAGTACG AAGAGATCGT GAAGCTCGAT CCGACGAGCG TGGAAGACCA TCTGCTGCTC
GGTCGCCTGT ATTCGTACAG CAACGACCTG ACCAAGGCCG AGAAGGAATT CAAGACCGCC
GTCCAGATCC AGCCGGATTC CGAAGAAGCT GTGACCATGC TGGCGTACCT CTATACGCAG
GAAGGCGACA CCAAGAAAGC GCAAGAGGTG CTGAGCAACA TTCCCGACGA CGATCGCAGC
GCGAAGCTGT ATTCGACTCT GGGCTACACC TACGAAGAGC AGAAGGATTA CAAGAAGGCG
ATCGAGGCAT ACCGCAAGGC GGTGATGCTC GATAAAGAGA ACCTCGACTC CGTCCGCGGG
CTGGCGCAGA ACCTGTTGAA TGACGGGCAG CTCGATGCCG CGCTGGAGCA GTACAAGATC
ATCGTTGATC AGGACCCGAG TGACGCTCAG AGCTACCAGC ACATCGCGGA GATCGACCGC
CGCAACGGCA AGTTCGAAGC TGCGCTCGAT GCGCTGAAGA AGGCCTCCGC ACTAGTGCAG
GATTCGCAGG AGATCCCGTA CAACATGGCG GTGATCTACG AGGGCCAGGG CCGCTACGAA
GACGCCATCA ATACCATCCA GCAGCTTCTT ACGAAGACCG ATAAGCCGGA TGCGTCCTAC
AGCTCGGCCG ATCGCAGCAA CCGTTCGATC TTCCTCGAGC GGCTGGGCAA CATCTATCGT
GAGGCGAACA AGCCGCAGCA GGCGGTGGAG ACCTTCCGGC GGATGATCGC GCTCGGTGAC
GATCCGGCCT CGCGTGCTTA CCAGGAGATG GTGGAGACCT ATCGCGATAA TCGCGATTGG
CCGTCAGCCA CAGCAGCGGC GCAGGAAGGC GCAAAGAAAC TTCCCAAAGA TCGCGGGCTG
CAACTAGTTC TCGCCGCGCA ATTGGCGGAT GAAGGTAAGG CCGACCAGGC GCTGAGTATT
GCGAAGTCGC AACTCAATGG CAAGGCCGCC GATGACCGCG AAGTGTACGT GTCTCTGGCG
CAGATGTACA CGCGGTTGAA GAAGTATCCC GAAGCGGAAG ACGCGATCGC GCAGGCGATG
AAGCTTGCAG GCACGCAGGA TGAACGGAAC TACGTCACGT TTGTGCAAGG CTCGATCTAC
GAGCGTGAGA AGAAATTCGA ACAAGCGGAA GAGGCCTTCC GTAAGGTCAT CAATGCCGAT
CCGAAGAACG CCGGCGCGCT GAACTACCTG GGTTATATGC TGGCCGACCG CGGCACGCGC
CTCGAAGAAG CGCTTGGCAT GCTGCGCAAG GCCGTGCAGA TGGAACCGCA GAACGGCGCG
TATCTCGACT CGCTGGGCTG GGCCTACTTC AAGATGGGCA ACTACGAGCA GGCGGAAGAG
AACCTGCGCA AAGCGTCCGA CAAGATCGGC AGCGATCCGA CGGTGCAGGA CCACCTTGGC
GATCTTTATC AGAAGACGGG GCGCCTGAAG CTGGCGGCCA CGCAGTGGGA ACGCGCGCTG
GACCAGTGGA ACCACTCAGT GCCGGCAGAA GTTGACGCCG ATGATGTGGC GAAGGTGCAG
AAGAAGCTGG AGTCGGCGAA GATCAAGCTG GCACAGCAGA CCTCGACGAC TTCTAACACG
AAGCAGTGA
 
Protein sequence
MAGLFMNPKA LSSTLAVLLL SATASLALPN GPTLLAQQKQ DAPTPTAPAK PNAAQPTSNP 
EADAKKPANR ADAYYHYTMA HMYEEMVATY GRAEYANKAI EEYRAAITAD PSSDYLNAGL
ADLYWRTGRI RDAVLEAQEI LKRDPKNVDA HRLLGRIYLR SLGDMQSGNN QSRDMQRLAI
EQYEEIVKLD PTSVEDHLLL GRLYSYSNDL TKAEKEFKTA VQIQPDSEEA VTMLAYLYTQ
EGDTKKAQEV LSNIPDDDRS AKLYSTLGYT YEEQKDYKKA IEAYRKAVML DKENLDSVRG
LAQNLLNDGQ LDAALEQYKI IVDQDPSDAQ SYQHIAEIDR RNGKFEAALD ALKKASALVQ
DSQEIPYNMA VIYEGQGRYE DAINTIQQLL TKTDKPDASY SSADRSNRSI FLERLGNIYR
EANKPQQAVE TFRRMIALGD DPASRAYQEM VETYRDNRDW PSATAAAQEG AKKLPKDRGL
QLVLAAQLAD EGKADQALSI AKSQLNGKAA DDREVYVSLA QMYTRLKKYP EAEDAIAQAM
KLAGTQDERN YVTFVQGSIY EREKKFEQAE EAFRKVINAD PKNAGALNYL GYMLADRGTR
LEEALGMLRK AVQMEPQNGA YLDSLGWAYF KMGNYEQAEE NLRKASDKIG SDPTVQDHLG
DLYQKTGRLK LAATQWERAL DQWNHSVPAE VDADDVAKVQ KKLESAKIKL AQQTSTTSNT
KQ