Gene Acid345_1575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1575 
Symbol 
ID4069013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1923577 
End bp1924788 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content58% 
IMG OID637983584 
Producttype II secretion system protein 
Protein accessionYP_590651 
Protein GI94968603 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.377421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0567328 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAT ACGTCGTAAA GCTGGCCGAC GAACGCGGCC GCATCCAGGA AAAAACCGAG 
AGCGCACACT CCGAGGCCGA GATCCGGGAC CGCTTTTCGC AGGCCGGATA CCTGGTGTAC
TCGGTCAAGG CGAGGGGTAC AGCCGTTGGC ATCCGCCTGC CGTTCCGCCG CAAGGTGAGC
GCGCAACAGT TCCTAATCTT CAACCAGCAA TTCCTGACAT TAGTTCGCGC CGGACTGCCG
ATTGTGCAGT CGATGGAACT CCTGATGCGC CGGCAGAAGA ACCAGTACTT TCAAAAGGTT
CTTGAAGACG TTCGGGATCG GCTGAAGGGC GGCTCGTTGT TGTCGGAGGC GTTCGAGGCG
CAGGGAATTT TCCCGAAGAT CTATACGACG ACTCTGCTCG CCGGCGAGAA GAGCGGCAAC
CTCGAAGAAG TAGTTGGCCG TTACATTGCG TTCCAGCGTC TACTGCTTTC ATTCCGTAAG
AAGCTGATTG CATCGCTGAT CTATCCATCC ATTCTCGTCT GCGGCGTGGT GGTGCTGTTC
TCCATGCTGA TTACGTGGGT GGTTCCTCGA TTCGCATTGT TATTCCAGGA TTTAGGTTCG
GACTTGCCGG CGATCACGAA GTTCGTTCTG GCGTTTGGTA ATAACGCGCA GACTTGGGCA
CCGTTCGTCC TGGTCGGCGC AATTGTGTTG GCGATCGTTT TTTTTCGTTG GAAGAAAACC
GAGTCCGGTT CGCTGATGTG GGACCGGTTC ATGATGTCGC TACCGATTTT CGGACAGATT
TGGCTGAAGT CGCAGGTGTC GACATTCTCG CGCATGTTGT CCACGTTGCT CGGCGGCGGC
CTGCCGTTAG TGCCGTCGTT AGAGACGGCG GCGGCTTCGA TTGGAAGCAA AACTTTGGCC
CGGGGGATTC GTACGGCGAG CAAGAGCGTG CGTGAGGGCA GATCGCTGGC CCGGAGCCTG
GAAGCGACGG CAGCGTTTCC GGATTTGTCG GTAGAGATGA TTGAAGTGGG CGAGTCCACG
GGCGCGTTGC CGCAAATGCT GGTGTCGGTG GCGGAGTTCT ACGAAGAAGA CGTGCAGAAC
GCGCTGGCGG CGGCGATGTC GCTGGTGGAG CCGGTAATCC TGATCATCAT GGGCATGGTC
GTGGGGTTCA TCCTGATAGC ACTCTATCTG CCGATTTTCA GCATCGGAAT GGGCGGGGCT
GCGGGACACT AG
 
Protein sequence
MAEYVVKLAD ERGRIQEKTE SAHSEAEIRD RFSQAGYLVY SVKARGTAVG IRLPFRRKVS 
AQQFLIFNQQ FLTLVRAGLP IVQSMELLMR RQKNQYFQKV LEDVRDRLKG GSLLSEAFEA
QGIFPKIYTT TLLAGEKSGN LEEVVGRYIA FQRLLLSFRK KLIASLIYPS ILVCGVVVLF
SMLITWVVPR FALLFQDLGS DLPAITKFVL AFGNNAQTWA PFVLVGAIVL AIVFFRWKKT
ESGSLMWDRF MMSLPIFGQI WLKSQVSTFS RMLSTLLGGG LPLVPSLETA AASIGSKTLA
RGIRTASKSV REGRSLARSL EATAAFPDLS VEMIEVGEST GALPQMLVSV AEFYEEDVQN
ALAAAMSLVE PVILIIMGMV VGFILIALYL PIFSIGMGGA AGH