Gene Acid345_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1473 
Symbol 
ID4069623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1780498 
End bp1782924 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content62% 
IMG OID637983482 
Producttype II and III secretion system protein 
Protein accessionYP_590549 
Protein GI94968501 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4796] Type II secretory pathway, component HofQ 
TIGRFAM ID[TIGR02515] type IV pilus secretin (or competence protein) PilQ 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.401194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.578419 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTGA AGCAACTACT GGGTGTTTCC GTAACGCTGC TCGCGCTGGC TGGCGCTGCA 
GCGGCCGCCG GGTCGCAGTT GACCAACGTA AACGTCGCGT CGGAGGGAAC TTCGACTGTC
GTTACGCTGC ACACGACCGG CGCCTTCACG CATAACGAAT ATCGACCTGC AGACAATTTG
ATGCTGGTCG ATTTGACGGG TGTTTCCGCC GGACAACTCC AGGAGCGCAT GCGCAACCTG
GATGCTGCCA GCGTGAAGTC GTACCGCGTG CTGACTTATA CGGGCACCAG CGGAACGGAA
GTCACCCGGG TTGAATTGGC TCTTGTGCCG GGAGCCGCTG TTGAGGTCGA TAAGAAAGAC
GCCGACCTCA CATTAAAGAT CTCGGGCGGG GAAGTAGCGG CTGCGGCTCC GGTGAGGGCA
CCGGCAGCAG CGCCCGTTGC TCCTGCGCCT GTTGCTACGG CAACCGCAGC TCCCGCAGCG
AACACCACGC CGGTCATGAT CCGGCAGGTG AACGTGACGC GCGGCGCGAA TGGCATGGAA
GTCGCCATCT CGCCACGCAC AGCGGCTGCT CCGATTACGC AGACGCTGTC GGGCCCCGAC
CGTCTCGTGA TCGATCTGCC AAACGCGATC CCGGCCGTAC GCACCAAGCA GATCGCCGTC
AACAGCTCTG ACATTAAGGG CGTCCGCATC TCGCGTTACC AGGAGAATCC TCCGGTTACG
CGCATTGTTG TGGACATGAC CAGCGCGCAT GATTTCCAGC TGGTACCCGG CGAAAAGGAA
CTGGTCGTAA AGCTGACGCC CTCGATGGCG AAGGCTGCTC CAGCTCCAGT GGTCGAGAGC
AAGCCCGCAG CGACGGAAGT CGCAAAGGCT GACGCTCCGG CTGCGATCCC GGCTGCCGCA
CCTGCCGCTA CCGACACCAA GCCCACCGCC ACACCGGCGC CGAGCTTCGT GATGGTTGAC
GCGCAGAACC CGGTCAACGT TCCGAAACCT GCTCCGGCCG TTCGCTCGTT GGAAGCCGCC
TCTGTGATGG CTACGACCAT CGCCAAGGAC AAGGTTGACA ACGTTCTTCC GCCGACAGCA
TCGGAAGCAC CAGCCGGCGC CGTGAACCTG GCGCAGGAAC AACAGACGCA ACAGCACAGC
GCTCAGCCGG CGTCGGGTCC CCGCTACACG GGCGAACCGA TCTCGGTGAA CCTGAAGGAT
GTCGACCTGA AGGACTTCTT CCGCCTGATC CACGAAATCA GCGGCTTGAA CGTCGTTCTC
GACCCCAGCG TCAAAGGCAC CGTAACCCTG GTGCTCGACG ACGTACCGTG GGATCAGGCC
CTCGACATCG TGTTGCGGAA CAACGGCCTT GACCGTCAGC TCGACGGCAA CGTCCTTCGC
ATCGCCTCAA TCGAAACGCT GCGCACGGAA GCCGTTGCCC GCCGCCAGCA GCTCGAAGCG
CAGGCCCTCG CGGTGGACAA GGTCACCGTC ACCCGTTTCC TGAGCTACGC TCGCTCGGCC
GATCTCGTCC CGACACTGAA GAAGTTCCTC AGCGCTCGCG GCGACATCAT TGCCGACGGC
CGTATGAATG CTCTCGTGAT CTCGGATATT CCGGGCGTCA TCCCGAGCTT GGACCGCTTG
ATCTCGCAAC TTGACCGCAA GACGCAGGAA GTTGAAATCG AAGCCCGCGT CGTGGCGGCA
ACCCGCACCT TCGTACGTGA GCTTGGTATT CAGCTCGGCG GCGGCTGGGG CGGCGCCTCC
ACATCGGTCA GCGGCAACCC GAACGCTGGT TTCGGTAGTA CGAACTTCAA GAACCCGGCC
GGACAGCCGA TTACCGGGGC TCCGTTCCCG ATTTCCACGA CCGGCGCTTT CCCGCTCTTC
ACCAACTTCC AGGTGTTGAA CCCGACAGCC GGCCTTGAAC TCATCAACAT CGGCCACTCC
TATGGCCTCG ACCTCTACCT CAGCGCTGCT GAACAACGCG GCCTGTTGAA GATCCTGTCG
AGACCGCGTG TCGTGACCCA GAACAACGTG ACTGCTCTCG TCCGCCAAGG TTTCCGTCTG
CCGGTCGTTA CGCTGTCGCA GTTGAACGGT CCTCCGACCG TCACCTACGT GGACGCTTTC
CTCCGCCTGA CGGTTACGCC CCAGATCACA GTTGAGAACA CCATCTTCCT GGCAGTCGAC
GTGGAAAACA CCACGCCTGA CTTCACCCGG ACGGTACTCG GCAACCCGAT CCTGTTGACG
CAACAGACCA CCACCCAGGT CCTCGTGACC GACGGCGGCA CGGTCACCAT CGGCGGCGTG
GTTCAGACGC AGAACACCTT GACGAGCCAG GAAGTTCCGA TCCTCGGCGA CATCCCGATC
CTCAAGTACT TGTTCATGCA CAAGACCACC AACACCCAGA CCCAGGAACT GATCTTCTTC
ATCACTCCGA AGATCGTTCA AACATAG
 
Protein sequence
MRLKQLLGVS VTLLALAGAA AAAGSQLTNV NVASEGTSTV VTLHTTGAFT HNEYRPADNL 
MLVDLTGVSA GQLQERMRNL DAASVKSYRV LTYTGTSGTE VTRVELALVP GAAVEVDKKD
ADLTLKISGG EVAAAAPVRA PAAAPVAPAP VATATAAPAA NTTPVMIRQV NVTRGANGME
VAISPRTAAA PITQTLSGPD RLVIDLPNAI PAVRTKQIAV NSSDIKGVRI SRYQENPPVT
RIVVDMTSAH DFQLVPGEKE LVVKLTPSMA KAAPAPVVES KPAATEVAKA DAPAAIPAAA
PAATDTKPTA TPAPSFVMVD AQNPVNVPKP APAVRSLEAA SVMATTIAKD KVDNVLPPTA
SEAPAGAVNL AQEQQTQQHS AQPASGPRYT GEPISVNLKD VDLKDFFRLI HEISGLNVVL
DPSVKGTVTL VLDDVPWDQA LDIVLRNNGL DRQLDGNVLR IASIETLRTE AVARRQQLEA
QALAVDKVTV TRFLSYARSA DLVPTLKKFL SARGDIIADG RMNALVISDI PGVIPSLDRL
ISQLDRKTQE VEIEARVVAA TRTFVRELGI QLGGGWGGAS TSVSGNPNAG FGSTNFKNPA
GQPITGAPFP ISTTGAFPLF TNFQVLNPTA GLELINIGHS YGLDLYLSAA EQRGLLKILS
RPRVVTQNNV TALVRQGFRL PVVTLSQLNG PPTVTYVDAF LRLTVTPQIT VENTIFLAVD
VENTTPDFTR TVLGNPILLT QQTTTQVLVT DGGTVTIGGV VQTQNTLTSQ EVPILGDIPI
LKYLFMHKTT NTQTQELIFF ITPKIVQT