Gene Acid345_2476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2476 
Symbol 
ID4072100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2930163 
End bp2933171 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content59% 
IMG OID637984493 
Productsurface antigen (D15) 
Protein accessionYP_591551 
Protein GI94969503 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.738372 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGTGC GATTGCGTTT CTGGTGCAGA CACGGAAGCT GGCGACGAGG TGCGTTTGCG 
TGGCTACTGG CCATCGCGCT GGCTTGCGCC TGTTCTGCCC GCCTTGTCGC CCAGGACATC
CCTGCTTCCA CCAACGGGCC AGCCGCAAAA CGTATCGCTG AAATCCGGTT CCGGGGCGCG
AGCCTAGCCA GCACCGACCC TCTTACCGAG TACCTGACCG TGAAGGTGGG CGATCCCTTT
ACCCGCGCCG CGGCGAGCGC AAGTATTAAG GCGCTTTTCG CTACCGGCCT CTTCTCCGAC
ATCTCCGCTG AAACCGATCC CGCTCCCAAC GGCGACGTCG TTCTCACCTT CGCCTTGCAA
TACCGCTATT TCATTGGTGA TGTGAACCTC AATGGCCGCC CGAAAGGTGC GCCAAGCCTG
CGCCAGCTCC TCAACGCCAC TAAGCTCGAA CTCGGACACG CCCTCACCGA CACCGGGATT
AAGCAGGCCA TCACGCAAAT GACCGGCGTC ATGGAAGACA ACGGCTACTA CGAGTCGAGT
TTCACTTACA CGCTGAAGAA GCATGAAGAC AGCCGGCAGG CGGAAGTATT TTTCCATCTC
GCTCCCGGCC CGCTCGCGCG TGTCGGCAAA ATCGAGGTCC ACGGCGAGTC GGGATTCACA
CAGGAGGAAG TCGAGTCCAT CACCAAGATC AAGCCCGGCG CCAAGGTCAA GGCGTCCGAC
GCCACCCGCG CCCTCGAGCG CCTCCGCAAG AAGTATCAGA AGCGCGACCT CCTCGAAGCC
CAGGTCACGC TCGCGCGACA GTCCTACAAC CACGACAGCG ACACCGTGGA TTTCATCTTC
ACGGTGCAAC GCGGGCCGGT GGTGAAGATT GACGTTGAAG GCGCGAAACT CAGCCGCGGT
AAGATCAAGC GCTATGTTCC TGTGTACGAA GAGAACGCAG CCGACGACGA TCTCCTGAAC
GAAGGAACCC GCAACCTCCG CGACTACTAT CAGTCCGAGG GTTATTTCGA CGTCAAGGTC
AACTACTCGC GAAACCGCAC CCTCGATAAC CAGAAACTCG ACATCGTTTA CAACGTTGAC
GCTGGTGAAC GCCACAGCCT CCAGTCAGTG GACGTCCAAG GCAACAAGTA CTTCCCGAAA
GATACGATCC GGGAGCGCCT TAGCGTTCAG ACCGCGACCA TGCTGCTGAC GCACGGCAAG
TTCAGCCAGG CCATGCTGGC GCGCGATGTT GCCGCCATCA CCGCCCTCTA TAAGACCAAC
GGTTTTCAAG ATGTATCCGT AAAAGCCGAC GTCGAGGACA ACTATCGCGG TAAAAGTGGT
GACCTTCGCA TCGTCTTCCG TATCGACGAA GGCGAACAGT CGCGCGTCCA CACTCTCACC
GTAATCGGCA ATCTCGCGAT TCCAACCGCC GAGTTCCAGC CGCAGCTCTC ACTCGACGAA
GGCCAGCCTT ATTCCGAGTA CGCCGTCGCA GCCGATCGCG ACGCGATCAT CAGTTACTAC
TTCAATCGCG GCTTTCCCAA CATGGACATG AAGATCACCA CGCTCCCTTA CAGCGGCGAT
CCTCACTCCA TGGACCTGAC TTATGAAATT CACGAGGGCA CCCGCGTCTT CGTAGACCGC
GTTTATGTTT ACGGCTTGCA CTACACGCGT CCCGGAGTTG TTGCCAAGCG CATGCACGTG
CATGACGGCG ACCCCCTCAG CCAGCTGGAT ATGCTCGATA CCCAACGTCG CCTCTACGAC
CTGGGAATCT TCAGCGAAGC CAACGTCGCC ATCCAGGACC CCGACGGTAC CGCCCAGCGC
AGGAACGTAA TCTTCCAGCT CGACGAGGCG CGCCGCTGGA CATTCAACTA CGGCGTCGGC
TTCGAGTTCG CCACCGGCAG CAGCCAGGGA TCGTCCAACA CCCCGAACGG CACCACCGGA
TGGAGTCCAC GTTTCTCGTT CGAACTGACG CGTCTCAACG TCTTTGGACG CGACCATACC
TTCGTCATCA AAGCTCGCTA CGGAAAACTC GAACAGCGCG GGCTCGTCAG CTACACCGCG
CCGCGCCTAT TCGCAAAGGA AAACTGGCGC CTCTCTCTGA CTGGTTTCTA TGACAAATCC
GCCGACGTAC TGACCTTCAC CTCGGAGCGC GCCGAAGGCT CCATCCAGGC CGAGCAGGTC
ATCAGTAAGA CATGGACGAT GCTTTACCGA TACAGCTATC GTCGCGTCAA CGTAGATCCC
ACCACGCTGC AAATCGATCC CGCACTCATT CCCCTGTATT CGCAACCGAC GCGCATCGGC
ATGCCGGGAG TCACCGTCAT CTACGATCGG CGCGACGACC CAATCGACGC CCACAAAGGC
ATGTACACCA CCGCCGATAT CGGTATTGCT TCCACCAGGC TCGGTTCTGA AGAGGACTTC
AGTCGCATCC TCGTACAGAA CTCCAGCTAC TACCAGTTCG GTGCGAAACA CTGGGTCTTC
GCGCGGTCCT TGCGTATTGG TCTCGAGTCG CCATACCAGA ATTCCACTCT CGTCCCGCTG
CCGGAACGCT TCTACGCAGG TGGCGGCAAC TCTCTCCGCG GGTACTCCAT TAACCAGGCC
GGCCCGCGCG ATCAGTTCAC CGGATATCCC ATCGGAGGCA ACGCGCTTTT CGTGAACAGC
CTTGAGTTGC GCATGCCACC ACCAACCTTG CCCTTCGTAG ACGACAACCT CAGCTTTGTC
TTCTTTCACG ATATGGGCAA CGTCTTCGAC ACCGTCTCCC ACATGTGGAC TGGCCTCGGG
CGGTTGCATC AGCCAACCAT TGCGGCCTGC TCGCAGAAGC CTGCCGACGG CTCAAACCCG
CCGCCCTGCG ACTACGGTTA TCTTGCGCAA GCGGTCGGAC TCGGGATTCG CTACCATACG
CCGGTTGGGC CCGTCCGATT CGACATCGGC TACGCCATCA ATCCGACGCG CTACCCGATC
CTGAACGACA ACTCGACATC GTCCACCACG CGGGTCAACG TCTTCTTTAG CATCGGTCAA
ACCTTCTGA
 
Protein sequence
MLVRLRFWCR HGSWRRGAFA WLLAIALACA CSARLVAQDI PASTNGPAAK RIAEIRFRGA 
SLASTDPLTE YLTVKVGDPF TRAAASASIK ALFATGLFSD ISAETDPAPN GDVVLTFALQ
YRYFIGDVNL NGRPKGAPSL RQLLNATKLE LGHALTDTGI KQAITQMTGV MEDNGYYESS
FTYTLKKHED SRQAEVFFHL APGPLARVGK IEVHGESGFT QEEVESITKI KPGAKVKASD
ATRALERLRK KYQKRDLLEA QVTLARQSYN HDSDTVDFIF TVQRGPVVKI DVEGAKLSRG
KIKRYVPVYE ENAADDDLLN EGTRNLRDYY QSEGYFDVKV NYSRNRTLDN QKLDIVYNVD
AGERHSLQSV DVQGNKYFPK DTIRERLSVQ TATMLLTHGK FSQAMLARDV AAITALYKTN
GFQDVSVKAD VEDNYRGKSG DLRIVFRIDE GEQSRVHTLT VIGNLAIPTA EFQPQLSLDE
GQPYSEYAVA ADRDAIISYY FNRGFPNMDM KITTLPYSGD PHSMDLTYEI HEGTRVFVDR
VYVYGLHYTR PGVVAKRMHV HDGDPLSQLD MLDTQRRLYD LGIFSEANVA IQDPDGTAQR
RNVIFQLDEA RRWTFNYGVG FEFATGSSQG SSNTPNGTTG WSPRFSFELT RLNVFGRDHT
FVIKARYGKL EQRGLVSYTA PRLFAKENWR LSLTGFYDKS ADVLTFTSER AEGSIQAEQV
ISKTWTMLYR YSYRRVNVDP TTLQIDPALI PLYSQPTRIG MPGVTVIYDR RDDPIDAHKG
MYTTADIGIA STRLGSEEDF SRILVQNSSY YQFGAKHWVF ARSLRIGLES PYQNSTLVPL
PERFYAGGGN SLRGYSINQA GPRDQFTGYP IGGNALFVNS LELRMPPPTL PFVDDNLSFV
FFHDMGNVFD TVSHMWTGLG RLHQPTIAAC SQKPADGSNP PPCDYGYLAQ AVGLGIRYHT
PVGPVRFDIG YAINPTRYPI LNDNSTSSTT RVNVFFSIGQ TF