Gene Acid345_0248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0248 
Symbol 
ID4073098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp260801 
End bp263581 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content57% 
IMG OID637982249 
Productsurface antigen (D15) 
Protein accessionYP_589327 
Protein GI94967279 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCTAC TGTTTTGCGG GATCGCACAA GCGCAAGAGG GAGTCATCGT CGACATCCGG 
GTCCACGGGA ACCGTCGCAT TCCCGCCGAC ACAGTTAAGT CCCGTATGTT CACGCACGCC
GGGGATGTAT ATGACCAGAG CTCCCTGGAA CGAGATTTCA ATGCTCTTTG GAACGCGGGG
TATTTCGATG ATCTGCGGCT TGAACGGGAA CAGACGGACA AGGGCTGGAT CATTCACGTT
TACGTCAAAG AGAAGCCGAC GATCCGTGAA ATCAAGTACG AAGGCCTGAA CTCCGTCACC
CAGAGCGACG TCCTCGATAA ATTCAAAGAA CGCAAGGTTG GTCTTTCCCA GGAAAGTCAA
TACGACCCGA CCCGTGTGAA GCGGGCGGAA GTGGTCTTGA AAGAACTGCT CGCGTCGCAC
GGCCGCCAGT TTGCGACCAT CCGCACGGAA GTCCGGCCGA TTCCGCCAGC GGCAGTTTCG
ATTACGTTCG TAGTGAAGGA AGGACCGAAG GTCAAGGTCG GCAAGATCAT CTTCGAGGGC
AACGCACACG TGAAAGCGCG CGAATTGCGC GCGGCGATGA AGAACCTGAA GCCGATTGGC
ATTCCGAAAT CGATCTTCCT GGAAAACCTG TTCGCCCGGA CCTTCGACTC GACCAAGCTC
GAAGAAGATG CCGAGCGCGT CCGCTACGAC TACCAGACGC GCGGCTACTT CAAGGCGATC
GTCGGCGATC CGAAGACCAA GATCCGCGAC GTGAGCGGCA TCAAGTGGTA CATGCCGTGG
AAGAAAACGG ACGGCAAAGT GGTGGACATC ACCATGCCGA TCGAGGAAGG CGATCGCTAC
AAGCTGAAGG AGATCACCTT CAGCGGCAAT AAGGCGATCA GCAACACCAG GGCGCTCCGC
GAAATCTTCA AGATGAAGGA TGGCGACTGG TTCGATGCGG AACTGGTGCG CAAAGGCCTC
GACGACTTGA AGAAGGCCTA CGGCGAGTTC GGCTACATCA ACGCCACTGC CGTGCCCGAT
ACGCAATTCG ATGATGTCAA CAAGAGCATC ACGCTGAAGG TTGATCTCGA CGAAGGTAAA
CAGTTCTCGG TGCGCCGTAT CGAGTTCGTC GGAAATACGA CGACACGCGA TAAGGTCATT
CGCCGCGAGT TGGCGCTCGA AGAAGGCGGC ATCTACAACA GTCGGTTGTG GGAGATGAGC
CTCCTGCGCC TGAACCAACT CCAGTATTTC GAGCCTTTGA AGGCGGAAAC CGACTCCGAA
ACCAAGCAGA ACAACCAGGA CAACACCATC GACCTGACGT TGAAGGTGAG GGAGAAGGGC
AAGAACTCCA TTGGTTTGAC GGGTGGCGTC AGCGGCCTGG CGGGATCGTT CATCGGCGTG
AATTACACGA CGAACAACCT GCTCGGCAAA GGCGAGACGC TTCAGCTTGA GGCCAACGTC
GGACAGTTTG AGCGCAACAT CCAGTTCGGC TTCACCGAGC CGTATGCGTT CGACCGTCCG
CTGCAATTGG GCGCGGTGGT GTTCAGCAGC AAGTACGACT ACAACTACGC CAAGCAGCTG
GCACTTTCCA CCGGCCAGCA GTTGAACCTG TCGCAGAGCG TGCAGGACAC CCTGCAGAAC
TACTCGCAAT CCACGACGGG CTTCACGCTT TCGTCGAGCT ACCCGCTGCA CCGCTCGTTC
AAGCGCGTAG GGCTCTCGTA CACGTTCAGC GATTCGTCGG TCCAGACCTT CTCGACGGCT
TCGACGCAAT ACTTCCAATA CTTGGCATTC CGCAGCGTTA CCGGTCCGAA CGCGCTCGAA
GGCATTCTCA CCAGCAAGGT GACGCCGAGC TTCACCTGGA ACCGCATTGA CAATCCGCAG
CGTCCACACC GCGGTAGCAG CTTCTTCCTG GCGGCGGACA TTTCGGGCCT TGGCGGCAAC
GTACAGATGA TTCGGCCGGT AACCGAGTAC AAGCGCTTCA TCCCGGTGAA CAAGGGACGC
AACGTGTTCG GCTTCCGTGT ACAGGGATCG TTTGTGACGG GCTACGGCGG CAACGTGGCG
CCACCGTTCG AACGCTTCTA CATGGGCGGT GAAAACGATC TGCGCGGCTT CGACATCCGC
TCGGTATCGC CGACGGCGTT CCTTACCGAC TTCACCTCGA TTGCCTTGAC GAATCCAGAC
GGCACGACAG TTCCAATCGA TCCGGCTCAT CCGAATAAGG GTGCATACAC GATTGCGATT
CCGGTGCAGC GCATTATCTA TCCGGGTGGC GACACCAGCG TAGTCACCAA CCTCGAGTAT
CGCGTACCGA TCGCCGGTCC GGTGACCATC GCAGCCTTCG TGGATACCGG CTGGGACATG
GTGCTGCGCA ACGACCAGCT TCGCATTAGC GATCAGCAGT ACAGCACGTT GACCAACACG
ATCTTTGGCT GCGTGTACAA CCCGCTGATC CCGCTAACCG TGGGCTGTAC CGGTGGTGGG
GATGCGAGCC ACTTCCTGAC AGGGCTCAGC CAGAACATCA CGCCGATCGA CAAGACCAAC
TACCAGACGC GTATGTCTAC GGGTTTGGAG TTGCAGGTCA TCATGCCGAT CGTGAATGCG
CCGTTCCGAA TTTATTACGC GTATAACCCG TTCCGCCTGG ATACGACGAC AACGACACCG
TCGCCGATCA CCCGTTCGAT GTTCCCGGAT GGCGCCGCGG GCGATTACAC GTACAAGCAG
GCAATCTCGT TGTATAACCC AACTTACGTG CTGCGCGAGC CTTTGAAGAC GTTCCGCTTC
ACGGTGGCTA CAACGTTCTA A
 
Protein sequence
MTLLFCGIAQ AQEGVIVDIR VHGNRRIPAD TVKSRMFTHA GDVYDQSSLE RDFNALWNAG 
YFDDLRLERE QTDKGWIIHV YVKEKPTIRE IKYEGLNSVT QSDVLDKFKE RKVGLSQESQ
YDPTRVKRAE VVLKELLASH GRQFATIRTE VRPIPPAAVS ITFVVKEGPK VKVGKIIFEG
NAHVKARELR AAMKNLKPIG IPKSIFLENL FARTFDSTKL EEDAERVRYD YQTRGYFKAI
VGDPKTKIRD VSGIKWYMPW KKTDGKVVDI TMPIEEGDRY KLKEITFSGN KAISNTRALR
EIFKMKDGDW FDAELVRKGL DDLKKAYGEF GYINATAVPD TQFDDVNKSI TLKVDLDEGK
QFSVRRIEFV GNTTTRDKVI RRELALEEGG IYNSRLWEMS LLRLNQLQYF EPLKAETDSE
TKQNNQDNTI DLTLKVREKG KNSIGLTGGV SGLAGSFIGV NYTTNNLLGK GETLQLEANV
GQFERNIQFG FTEPYAFDRP LQLGAVVFSS KYDYNYAKQL ALSTGQQLNL SQSVQDTLQN
YSQSTTGFTL SSSYPLHRSF KRVGLSYTFS DSSVQTFSTA STQYFQYLAF RSVTGPNALE
GILTSKVTPS FTWNRIDNPQ RPHRGSSFFL AADISGLGGN VQMIRPVTEY KRFIPVNKGR
NVFGFRVQGS FVTGYGGNVA PPFERFYMGG ENDLRGFDIR SVSPTAFLTD FTSIALTNPD
GTTVPIDPAH PNKGAYTIAI PVQRIIYPGG DTSVVTNLEY RVPIAGPVTI AAFVDTGWDM
VLRNDQLRIS DQQYSTLTNT IFGCVYNPLI PLTVGCTGGG DASHFLTGLS QNITPIDKTN
YQTRMSTGLE LQVIMPIVNA PFRIYYAYNP FRLDTTTTTP SPITRSMFPD GAAGDYTYKQ
AISLYNPTYV LREPLKTFRF TVATTF