Gene Acid345_1390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1390 
Symbol 
ID4068925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1686925 
End bp1688139 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content59% 
IMG OID637983399 
Producttype II secretion system protein 
Protein accessionYP_590466 
Protein GI94968418 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.387943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0225351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTAT TTACATTCAC TGGCAAGAAC GCCACCGGCG AGAAAGTCAC CGGCGAGCGT 
GTCGCCGAAA ACAAGCAGGC CCTGGCCAGC AATCTGCGGC GTGAACGAAT CCAGCCCGTC
ACCATCAAGG AGAAGGGCAA AGAATTCGTC ATGCCGACAT TTGGCGGCGG CAGCGTCAAG
ACCAAGGACA TCGCCATTTT CTTCCGGCAG TTCTCGGTCA TGATTGATGC CGGCCTCCCG
CTGGTGCAGT GTCTTGAGAT TCTCGCGGGC AACCAGGAAT CTCAAGCCTT CCAGAAGGCG
CTTAACGGCG TCCGGACAAC TGTGGAAGGC GGCTCGACCC TGGGCAACGC CATGCGTGGC
TACCCCAAGA TTTTCGACGA CCTTATGGTC AACATGGTGG ACGCCGGCGA AACCGGCGGT
ATTCTCGACA CCATTCTTCA GCGTCTCGCG ACCTATGTAG AAAAGGCCGT GAAACTGAAG
GCGGCCGTCC GCTCGGCGTT GATCTACCCG GTCTCGGTCA TCACGATTGC GGTTTTGATC
GTCGGCCTGC TGCTGTGGAA GGTCGTCCCG ATTTTCGCCA ACCTCTTCGT TGGCCTCGGT
GCTCCCCTTC CCCTGCCTAC GCGAATCGTC ATCGGCATCA GTAACTTCCT CGGAAGTTTC
TGGTGGATGG TGCCGATCAT GGTAGCTGCC GTGTTCTTCG GAGTCCGTGC ATTGCGCTCC
GACCCGCGTG GCCGCTACTT GACCGACAAT TTTCTGCTCC ACATTCCGAT TATCGGCATG
CTGCTGCGTA AGATCGCCGT CGCCCGCTTC ACCCGTACCC TGGGCACGCT GATCACCTCC
GGCGTTCCGA TTCTCGAAGG CTTGAACATC ACCGCCCGCA CCTCCGGTAA CCGCGTGGTG
GAAGAAGCGC TCTACAAGGT CCGCAAGTCG ATCGAAGAAG GCCGCACCAT CGTCGATCCG
CTTCGCGAAT CCGCTGTCTT CCCCAACATG GTTACGCAGA TGATCGGCGT CGGTGAGGCC
ACCGGTGCAA TGGATGCCAT GCTCCAGAAG ATCGCGGACT TCTACGAAGA CGAAGTGGAC
GCCGCGACCA AGGACTTGCT GACGTTGCTC GAACCCATCA TGATCGTGTT GCTCGGCATC
ATGATCGGCG GCGTAGTCGT TTCGTTGTAC CTGCCGCTCT TCTCGATGGT GGCGAAGCTC
TCCGGCGGCG GTTAA
 
Protein sequence
MPVFTFTGKN ATGEKVTGER VAENKQALAS NLRRERIQPV TIKEKGKEFV MPTFGGGSVK 
TKDIAIFFRQ FSVMIDAGLP LVQCLEILAG NQESQAFQKA LNGVRTTVEG GSTLGNAMRG
YPKIFDDLMV NMVDAGETGG ILDTILQRLA TYVEKAVKLK AAVRSALIYP VSVITIAVLI
VGLLLWKVVP IFANLFVGLG APLPLPTRIV IGISNFLGSF WWMVPIMVAA VFFGVRALRS
DPRGRYLTDN FLLHIPIIGM LLRKIAVARF TRTLGTLITS GVPILEGLNI TARTSGNRVV
EEALYKVRKS IEEGRTIVDP LRESAVFPNM VTQMIGVGEA TGAMDAMLQK IADFYEDEVD
AATKDLLTLL EPIMIVLLGI MIGGVVVSLY LPLFSMVAKL SGGG