Gene Acid345_1392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1392 
Symbol 
ID4068927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1689374 
End bp1691089 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content57% 
IMG OID637983401 
Producttype II secretion system protein E 
Protein accessionYP_590468 
Protein GI94968420 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E
[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.025124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAGA GGTTGGGCGA TCTTCTTGTT CGCGAGAAGG TCATCACCGC CGAGCAGTTG 
GAACAAGCAC TCAGGGAACA AGGCTCCAGC GGCACGCGTC TTGGTGCGGC CCTGGTGAAG
CTTGGTTTTC TCTCGGACGA TGACGTCACC AACTTCCTCT CCCGTCAGTA TGGCGTACCC
GCCATCAACC TCAACTATTT CGAGATCGAT CCTTCCGTCG TAAAACTCAT TCCTTACGAC
ACTGCGAAGC GTTACCAGAT CCTTCCCTTG AGCCGCGTCG GCGCCTCGCT GACCATCGCC
ATGGTGGATC CCACCAACGT CTTCGCAATG GACGACATCA AGTTCATGAC CGGCTTCAAC
ATTGAGCCGG TGGTTGCCAG CGAAAGCGCG ATCCTCGAAG GCATCGAGAA GGCCTACAAC
ACCGCTCCGG AAGAAGATCT TGAGTCGGTG ATGGCCTCGA TGGGTGAGGG TGAGGCCTCC
GATATTGAAG TCCAGGCGGA CATGGAAGAG GCTGACTCCG CCGACCTCGA GCGCGCCGCC
GAAGAAGCTC CGATCGTCAA GCTGGTGAAC ATGATCCTCA CGGAAGCCGT GAAGAAGGGC
GCCAGCGACA TCCACATGGA GCCCTACGAA AAGGAATATC GCGTACGCTT CCGGATTGAC
GGCATTCTCC AGACGATGAT GAATCCGCCG ATGAAACTTC GCGACGCGAT CATCTCGCGC
GTGAAGATCA TGGCAAAGCT CGACATCAGC GAAAAGCGCC TGCCGCAAGA CGGCCGCATC
ATGCTGAAGA TGAACCTCCA GGGAAAGAAG AAAGTGCTCG ACTATCGCGT CAGCACCCTG
CCTACCCTGT GGGGCGAAAA AGTCGTTCTC CGACTGCTCG ACAAAGAGAG CCTGCGTCTC
GACATGACCA AGCTCGGCAT GGAGCAGGAA TCGCTCGACA AGTTCACCAA AGCTATCTTC
AAGCCGTACG GGATGGTGCT GGTCACCGGT CCCACGGGAT CCGGTAAGAC GAACACGCTG
TACTCCTCGA TTTCGCAGCT CAACAAGCCC GACACCAACA TCATGACCGC TGAAGATCCG
GTCGAGTTCC AGTTGCACGG TGTGAACCAG GTGCAGATGA AGGAACAGAT CGGCTTGAAC
TTCGCGGCGG CCTTGCGCTC CTTCCTGCGT CAGGACCCCA ACATCATTCT CGTCGGTGAG
ATCCGCGACT TTGAAACCGC GGAAATTGCG ATCAAGGCCG CATTGACCGG CCACTTGGTT
TTGTCGACGC TGCACACCAA CGGCGCGCCC GAAACCATCA GCCGCTTGAT GAACATGGGT
ATCGAACCAT TTCTTGTCGC GACTTCAGTG CACCTGATTG CTGCGCAGCG CTTGATCCGC
CGCATTTGCA GCAACTGCGC CGAAGTCCTC GACCTGCCGC CGCAAGCGTT GATCGAAGCC
GGCTATTCGC CGGCCGAGTC CAAGACGGTG AAGATCAGCA AGGGCCGCGG TTGCAGCAAC
TGCAACAACA CGGGATATAA GGGCCGTACC GGCCTTTATG AAGTAATGGA GATTGACGAC
GAAATCCGGG AATTGATCCT GGTCGGCGCT TCGGCGCTGG AGTTGAAGAA GAAAGCGATC
GAGAAAGGCA TGATCACGCT GCGTCGCAGC GGCTTGATCA AAGTTTCACT GGGGATCACG
ACGTTGGAAG AAGTCGCACG TGAAACCGTG CACTAA
 
Protein sequence
MSQRLGDLLV REKVITAEQL EQALREQGSS GTRLGAALVK LGFLSDDDVT NFLSRQYGVP 
AINLNYFEID PSVVKLIPYD TAKRYQILPL SRVGASLTIA MVDPTNVFAM DDIKFMTGFN
IEPVVASESA ILEGIEKAYN TAPEEDLESV MASMGEGEAS DIEVQADMEE ADSADLERAA
EEAPIVKLVN MILTEAVKKG ASDIHMEPYE KEYRVRFRID GILQTMMNPP MKLRDAIISR
VKIMAKLDIS EKRLPQDGRI MLKMNLQGKK KVLDYRVSTL PTLWGEKVVL RLLDKESLRL
DMTKLGMEQE SLDKFTKAIF KPYGMVLVTG PTGSGKTNTL YSSISQLNKP DTNIMTAEDP
VEFQLHGVNQ VQMKEQIGLN FAAALRSFLR QDPNIILVGE IRDFETAEIA IKAALTGHLV
LSTLHTNGAP ETISRLMNMG IEPFLVATSV HLIAAQRLIR RICSNCAEVL DLPPQALIEA
GYSPAESKTV KISKGRGCSN CNNTGYKGRT GLYEVMEIDD EIRELILVGA SALELKKKAI
EKGMITLRRS GLIKVSLGIT TLEEVARETV H