Gene Acid345_1576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1576 
Symbol 
ID4069014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1924798 
End bp1926438 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content58% 
IMG OID637983585 
Producttype II secretion system protein E 
Protein accessionYP_590652 
Protein GI94968604 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.22471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0615675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGA ACACAGCAAT TTTCGTCAAC AACGCGACCA CCGCAGCCAA GGAAGAAGAG 
CGCGGACGCG ATCTCGCGCG GCGCTATCGC TGCGAGTTCG TGGACCTGAA GTCGTTTCGG
ATGCACCAGG ACCTGTTCCG CAAGATTCCA GTGGAACTGA TGTTTCGCTA TAACTTCATC
CCTCTGGAAG AACTGCCGGA CGGCCGCCTC GAAATCGCGA TCGACGATCC CAGCCGGCTG
ATGATGATTG ACGAAGTCGG TTTGCTGTTG CGGCGCGAGA TCGTGACCAA GGTTTCGACG
CTCTCCCAGA TCACCGACAT CCTGAAGAAG ACGGAGCAGT CGCAGCGCGT TCTGGAAGAG
GCGAGCGAAG ACTTCGCCGT CCACGTTATT CGCGACGACG ACGAGTCCGA CGAAACCATC
TCGATCGAGA AGTTGACGGC GGAAGGGGAC ATGAGCCCCA TCATCCGCCT GGTGGACACG
ACCATCTTTA CCGCGTTGCA ACGGCGTGCG TCCGATATTC ATATCGAGAC GCAAGATGAA
TCCGTGATCA TTAAGTACCG TATTGACGGC GTGCTACAGA AGGCGATGCA ACCGATCGCG
AAGGAACACC ACTCGACGAT CATCTCGCGT ATCAAGGTCA TGAGCGAGTT GGATATAGCC
GAGCGTCGTG TACCGCAGGA CGGGCGCTTC CGCGTACGCT ACCTGGGCCG CCAGATTGAT
TTCCGCGTTT CCATCATGCC GTCTATCCAC GGCGAAGACG CGGTGCTCCG TGTGCTCGAC
AAAGAGAGCA TGAGCGAGAA GTTCCACAAG CTGACGCTTG ATGTGGTCGG GTTCAGCGAG
GACCACATCA AGACGTTTCG CCGGTACATC AACGAGCCGT ACGGCATGGT GCTGGTGACG
GGGCCTACCG GTTCCGGCAA GACGACGACC CTTTACGCTG CGCTGAATGA AATCAAGACG
GAAGAAGACA AGCTGATCAC GATTGAAGAT CCGGTCGAAT ACCAGATCCG CGGCGTTACG
CAGATTCCGG TGAACGAGAA AAAGGGTCTG ACTTTCGCTC GCGGCCTGCG TTCGATTCTG
CGTCACGATC CGGACAAGAT CATGGTCGGC GAAATCCGCG ACACCGAGAC GGCACAAATC
GCGATTCAGT CCGCGCTGAC CGGTCACCTT GTGTTCACGA CAGTCCACGC GAACAACGTG
GTGGACGTAC TGGGGCGGTT CCTGAACATG GGCGTGGAGG CCTACAACTT TGTGTCGGCA
CTGAATTGCA TCCTGGCGCA GCGGTTGGTG CGCGTCATCT GCGACCACTG CAAGCGCAAG
GTGCGCTACG ACCTCGAAAC CCTGGAGAAC AGCGGTCTCA ACCCGGCAGA GTGGGGAGAC
TTTGAATTCA GCGAGGGCCC GGGCTGTATC GAGTGCGCCG GCACCGGGTT CCGCGGCCGA
ACGGCGATCC ATGAACTACT TGAACTGACC GATCGGATTC GCGAAATGAT TCTCGACAAG
AAGCCGAGCT CGGAGATCCG CAAGGCGGCG CGCGAGGACG GCATGATTTT CCTGCGCGAG
TCGGCGCTGG CGAAGCTGCG CGATGGGATC ACGACGCTAC GCGAAATCAA TAAGGTCACG
TTCATCGAGG CCTCGAGATA A
 
Protein sequence
MAENTAIFVN NATTAAKEEE RGRDLARRYR CEFVDLKSFR MHQDLFRKIP VELMFRYNFI 
PLEELPDGRL EIAIDDPSRL MMIDEVGLLL RREIVTKVST LSQITDILKK TEQSQRVLEE
ASEDFAVHVI RDDDESDETI SIEKLTAEGD MSPIIRLVDT TIFTALQRRA SDIHIETQDE
SVIIKYRIDG VLQKAMQPIA KEHHSTIISR IKVMSELDIA ERRVPQDGRF RVRYLGRQID
FRVSIMPSIH GEDAVLRVLD KESMSEKFHK LTLDVVGFSE DHIKTFRRYI NEPYGMVLVT
GPTGSGKTTT LYAALNEIKT EEDKLITIED PVEYQIRGVT QIPVNEKKGL TFARGLRSIL
RHDPDKIMVG EIRDTETAQI AIQSALTGHL VFTTVHANNV VDVLGRFLNM GVEAYNFVSA
LNCILAQRLV RVICDHCKRK VRYDLETLEN SGLNPAEWGD FEFSEGPGCI ECAGTGFRGR
TAIHELLELT DRIREMILDK KPSSEIRKAA REDGMIFLRE SALAKLRDGI TTLREINKVT
FIEASR