Gene Acid345_2804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2804 
Symbol 
ID4071807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3318764 
End bp3321781 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content56% 
IMG OID637984822 
Productsurface antigen (D15) 
Protein accessionYP_591879 
Protein GI94969831 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGAC GTGTCGCTGG TGCAGTGCTG TTCCTTGCTT GCTTGTGTGC GACGAGAATG 
TGGGCACAGG GATCGGCTGA CCCCGGCACC CAGATCATGC AGTCGTATGA GGGCCAGAAT
GTTTCCGTGG TAGAGATCGC AGGCCAGCCT GACATTGATA CCAGCAAATA TGAGTCCGTG
CTGAAACAGC ATCAGGGACA GCCGTTCTCG ATGGAAAAGG TCGCTGCAAC AGTCGAGGCT
TTGAAAAAGA CCGGCGACTT TAAGGACGTG ATTCTCGACC TGAGGCCGGA GACCTCAGGT
GTGCACGTGA TGTTCATTGC GCAGCCGGCG TATTACATCG CTCTCTATGA CTTCCCCGGT
GCGCTAAAGA ACTTTCCTTA TTCCCGGCTT ATCCAGGTGG CGAACTACCA GTCACAGGAA
CCGTATTCAA AATTAGATAT TGAGACAGCG CAGAAATCGC TGGAGAAATT CTTCCACCAG
GTTGGGTTCT TTGAGGCCAC GGTACAGCCC GAAATCCGCT TGAATAAAGA GCATGGAATC
GTCAACGTTG ATTTCACTAC GACGCTGAAA CGTCACGCCA AATTTGGAGA AGTTAAAATC
GAGGGAGCCA CGCTACAAGA CACTGCCTTT CTCCAAGGCA AAGTTCGCGG ATTCATGGCG
CGCACGCGCG GGGCACAGAT CAGACCGGGC AAGCCGTATT CAAGCCGAAA GATCCAGCTT
GCAACGAACT ATTTGCAAGG AGCGCTGGCG AAACAACAAC ATCTTGGGGC GGATGTGAAG
TTTGTTACGG CAGAATATGA TCCGGCAACC AACGTCGCGA ATGTCGTCTT CCACGTGAAG
GTCGGGCCAA AAGTTGAGGT GGACATTGTC GGAGCGCATT TGTGGCCGTG GACGAGGAAG
AAGCTCATCC CGATCTATAT TGAGGGTTCT ATTGATGAGG ACCTGATCGA AGAGGGTGAA
CGGAATCTGC ACTCGTATTT CCAATCAAAG GGCTTTTACG ACGCGGTCGT AAAAACGGAT
GTGAAGCAGA ACGCGGAACT CACGACGATC AGTTACACCA TCACCAAGAA TGAAAAGCAT
AAAGTGGAAC GCCTCGATGT TGAGGGAAAC AATTCGATCT CCTCAAAAGA CCTGCTGAAC
AACTCCCAAG TGGAGAAGGC GCACTTCTAC AGCCATGGAA AATTCAGCGA GGATCTGGTG
CGGAAGTCCT CCGCCAGCCT GCGCGCTGTG TATCTGAACG CCGGTTACAG CAAGGCGAAG
GTCACGCCAC GGGTGACGCG CGATCGCGGC AACATCGTGG TGACGTACGT GGTGGAAGAG
GGACCACGGG ATTACGTTGC CGAGCTGCAC ATTGTTGGTA ACGATACGGT GCCGATGGAG
CAGCTTTCCC CCAAGGGACT ACAGCTTGGC GTGAACCGTC CATACTCGCC GCTCTTTGAG
CAGCAGGACC GCAACAATAT CGCCGCGCAT TACCTGACGA ACGGCTATCT CACGGCGGGT
GTGACTTCGA AAGCCGTACC GGTGAGCAAG TCCGATCCGC ATCATTTGAT CGTGACCTAC
AAAATTCATG AGGGTCCGAA GGTCACGACG GCACGGATCA TTACGGTAGG AAAACAGCAG
ACAAAGCAGG AGATCGTGGA CCGTGCTTCT GTCGTGAAGG TCGGCGTGCC GCTCAGCGAA
GCAGACCTGC TTTCTTCCGA GAGCCGCTTG TATGCGATGG GAATCTTCGA CTGGGCGCAG
GTGGACCCGA AACGCGGAAT CACGAGCCAG AACCAGGAAG AGGTACTGAT CAAAGTCCAC
GAGACCAAGC GGAACACGAT TACGTACGGC TTCGGATTTG AGGTGATCAA CCGCGGCGGC
AGCGTGCCGG GAGGAACGGT GAGCGTTCCG GGCATTCCGC CGGTTGGCCT GCCGCAGGGG
TTCCGGACCA GCGAATCTAC GTTCTGGGGT CCGCGGGGAA CGTTTTCCTA TACGCGTCGC
AACGTGCGCG GCCTCGCTGA GAGTTACACG TTGGGGGCGT TTGCCGGACG CCTGGACCAG
CGAGTGTTCG GCAACTACAC GATTCCCTAC CTCCTCGCGT CGAGTTGGAG CGGAGCGTTC
CAAGTCTCGG GCGAACACGA TGCGACAAAC CCAGTCTTCA GCGCCCTCAA TGGTGCAGCG
GGGTTCCAGG TCCAACGCTA CCTCGACGCC AAGCAAACGA AGCAACTCTT CTTTCGATAC
AAGTTTCAGT ACACCGACTT GAGCAACATC TTCCCGGGCT TCGAAATCCT GGTTCCGGAG
GAAGACCGCC GGGTGCGGCT CTCGACACTA TCCACTTCAT TTGTGCGCGA CACGCGCGAC
AACATACTGG ACGCGCACAA GGGCACATAT GGCACGGTAG ACCTCGGAAT TACACCGCAG
GCGCTGGGAT CGAGCGAAAC GTTCGCGCGC TTCCTTGGTC AATTCGCATT TTACAAACAG
ATTCCGCACG GAATTATCTG GGCCAACAGC TTCCGCTTGG GGATAGAGAC GCCGTTTGGC
GGCAGTCATG TTCCGACGAG CGAACTATTC TTCAGCGGCG GCGGCAGCAC ACTGCGCGGC
TTCCCGCTGA ACGGTGCTGG TCCGCAGCAG TACACGACGG TATGCGGCGA CCCGAACGAC
ACGTCCACCT GCGGCCCGAT TACGGTGCCG ACGGGCGGTA AGCAGCTGAT CATCGTGAAT
TCGGAACTGC GGATTCCGCT CAACCAGCTC TACAAGGGGC TGGGAATCGT GCCCTTTTAT
GACGGCGGAA ACGTCTATAA GCACGTAGGG TTCAGTAGTT TTTCTACCAA CTGCAATGCG
GCCGCTACGA CTAGCACCGG CAGCAATGGA CAAACCGTTA CGCTGGTGGA ACCCTCGTGT
TTCACCAGTT CGATTGGATT GGGAGTGCGC TACAACACGC CGATTGGACC GGTACGACTG
GATGTTGGCC ACAACTTGAA CCACATAACT GGAATCAAAT CGACCCAGGT ATTCATCACC
TTGGGACAGG CATTCTGA
 
Protein sequence
MRGRVAGAVL FLACLCATRM WAQGSADPGT QIMQSYEGQN VSVVEIAGQP DIDTSKYESV 
LKQHQGQPFS MEKVAATVEA LKKTGDFKDV ILDLRPETSG VHVMFIAQPA YYIALYDFPG
ALKNFPYSRL IQVANYQSQE PYSKLDIETA QKSLEKFFHQ VGFFEATVQP EIRLNKEHGI
VNVDFTTTLK RHAKFGEVKI EGATLQDTAF LQGKVRGFMA RTRGAQIRPG KPYSSRKIQL
ATNYLQGALA KQQHLGADVK FVTAEYDPAT NVANVVFHVK VGPKVEVDIV GAHLWPWTRK
KLIPIYIEGS IDEDLIEEGE RNLHSYFQSK GFYDAVVKTD VKQNAELTTI SYTITKNEKH
KVERLDVEGN NSISSKDLLN NSQVEKAHFY SHGKFSEDLV RKSSASLRAV YLNAGYSKAK
VTPRVTRDRG NIVVTYVVEE GPRDYVAELH IVGNDTVPME QLSPKGLQLG VNRPYSPLFE
QQDRNNIAAH YLTNGYLTAG VTSKAVPVSK SDPHHLIVTY KIHEGPKVTT ARIITVGKQQ
TKQEIVDRAS VVKVGVPLSE ADLLSSESRL YAMGIFDWAQ VDPKRGITSQ NQEEVLIKVH
ETKRNTITYG FGFEVINRGG SVPGGTVSVP GIPPVGLPQG FRTSESTFWG PRGTFSYTRR
NVRGLAESYT LGAFAGRLDQ RVFGNYTIPY LLASSWSGAF QVSGEHDATN PVFSALNGAA
GFQVQRYLDA KQTKQLFFRY KFQYTDLSNI FPGFEILVPE EDRRVRLSTL STSFVRDTRD
NILDAHKGTY GTVDLGITPQ ALGSSETFAR FLGQFAFYKQ IPHGIIWANS FRLGIETPFG
GSHVPTSELF FSGGGSTLRG FPLNGAGPQQ YTTVCGDPND TSTCGPITVP TGGKQLIIVN
SELRIPLNQL YKGLGIVPFY DGGNVYKHVG FSSFSTNCNA AATTSTGSNG QTVTLVEPSC
FTSSIGLGVR YNTPIGPVRL DVGHNLNHIT GIKSTQVFIT LGQAF