Gene Acid345_0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0423 
Symbol 
ID4069649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp490961 
End bp494449 
Gene Length3489 bp 
Protein Length1162 aa 
Translation table11 
GC content57% 
IMG OID637982427 
Producthypothetical protein 
Protein accessionYP_589502 
Protein GI94967454 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000804756 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0470711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTGGC TCAACAAAGT TCGCCTGGTG GGGTGTTTGC TGTTCCTCTC CCTGCTAGCG 
GCAAATTACT TAGAAGCGCA ACAGGTGACT GCTGCAATTA CAGGCACAGT TACGGACCCG
GCCGGTGCGG CGATCAATGG CGCGACAGTT ACAGCCAGAG ACGTAGAGCG CGGCACGGTG
ACAACCGTCA AGACAAATGA TTCCGGCGTC TTCAACTTCC CGCGCGTAGC GATCGGTACA
TATGAAGTGC GCACCGAAGC CAGCGGCTTC GAAATCGCAG TGCAACCGCC GTTCACGCTG
GTGCTGAACC AAACCGCGCG TCTAACCTTC CAAATGAAGA TTGGTAAGAC GACGGAAACC
ATGGAGGTGA GTGCGGAAGC GCCCCAACTC CAGACCGACA CCACCCAGGT CAGTACGCTG
ATCGACGCGA AGACGAACGA CAGTCTCCCG CTTGCCACGC GCAACTTCAT CCAGTTGACG
CTGCTATCCC CGGGCGCTCT GTCGGTAGAT CCGCAGAGCA TGAACACGGG ATCGAACGTC
GCGGAAGAAG GCGGCCGTCC GTACATCAAC GGCAACCGCG AGCAGGCGAA CAACTTCCTG
CTGGATGGAA TTGACAACAA TCAATCGTCC GAAAACCTAG CGGGCTTCAC ACCGTCACCG
GATGCGATCG CGGAGTTCAA CGTAATTACC CAAAACGCGC CGGCAGAATT CGGAAACTTC
AACGGTGGTA TCGTCAGCGC CACCATCAAG TCTGGAACGA ATTCGTTCCA CGGAAATGTG
TTCGAGTTTT TCCGCAACGA CATTTTCAAC GCAAACAAGT GGGAAAATGG ATTGCATAAG
GGAGATCCCG CCTATTTCAA CGCGGACGGT TTCGACAGTA ACGGTGTAGC CTTTACCCCG
AAAGTGCGTT GGAATATGTT CGGGGTAACC TTCGGCGGAC CCATAATCAA GAACAAGTTG
TTTTTCTTCG TTGATTATCA GGGCGGACGC CTGGATCACC CGTCAACCGC AGGTACGTTT
GGCGTCTTGA CGCCGGCTCA AATCGGCGGC GATTTCTCTA GCCTGCTAAG CCTTTCGACC
CCGGTTCAAC TCTACAATCC TTGCGCAGCG GGAACCGGCG TTTCCGGAAA TCCCTGCCAG
TTAGTCCCGG TGGCAAGTCG CCAGCCATTC GCAGGGAACA TCATCCCATC GAACATGCTG
GACCCGACCT TCGCCGCGCT CACCACCAAC AGCCTCTATC CGAAGTCGAT CGCGTCGGAT
CCGACCTCTG GATTCGGGTT GGCCTCAAAT ATCACGGGGC AACAGTACAA CACCGACCAA
GGCGACCTGC GACTCGACTA CAACCTAAGC CAGAAAGATC ATCTTTTTGC GCGCATGTCG
AAGGGATACC AAACCGATCC TTCGACAAAC AGTATTCTGT TGCTCGGCGA CACCTTGAAC
CAGGCGTGGC TCAACAACTT CGCCTTCAAC TGGGACCACA ATTTCTCTCC CAGTCTCCTG
AATGAAGTGC GTTTCGGCTT GAACTGGGTG AAGTTCACGA ACGGAGCCCA CACCTTCGAT
AGCTCCGTTG GTCAACTCGG TAACACTATT GGCATTGAGA ATGGAAATCC GGGCGGGATC
GATGGCCTCC CCGCGATGTC GTTCGGCGGT GGCGGGATTA CGAACCCAGG CGTCGGTTCA
ATCCCGACCA TCGGCTCGGC CAATGTAGTT GAAAACTTCG CGTCGACGGT AACGCAGTTC
GACGACGTTC TGGAGTACAC CCACGGCCGT CACGTGATCA AGGGCGGCTT CCAGATGAAC
AACTATCGAA TCAATGTGTT CTACAGCGGA AACGGCGGCG AACTCGGTCA ATTGCTGTAT
GGAACGACGT ACAGCTCGAG CCTGGATGCC GGCGGTACGC CGGTCGGCGG AAACGGCGTC
GCCGATTGGG CGTTAGGTCT CCCAGAACTT GTCGGCCGTG GAACCAGCAC TGGAGGTTGG
CATCAGCGCG ATTGGCTCTA TGCCGGCTTT ATCCAGGATG ACTGGAGAAT CACGGACTCC
TTGACGCTGA ACCTTGGTCT GCGTTACGAA GCCCGAACTC CCTGGACCGA ACTCAATGAT
CGGCAGGTGA ACGTCAACGT CGCCTCTGGC GCGTTGGAAT ACGCTGGCAA TACCCCGGTC
GTAGGGGTGG GATCGAACGG TTTCAGCGAG GGCCTATACG ACTCTTCCTA CGGCCTTTCC
GCCTTCCAGC CCCGCTTCGG CTTTGCCTAT TCACCGAGAT CCATGGACGG AAAGTTTGTT
GTCCGCGGTG CGTTCTCGAT TTCTTCCTAT CTTGAAGGAA CGGGGACCAA TCTGCGCCTG
ACGCAAAACC CCCCCTTTAC TCCGGCGCAG GTGGAAGCCA ACAATGCCAC CACCGGAATG
CCCTACACTT CTGCAACGGG ATTCACGACG GCAGCACCTC CGGGAGGCGA TCCCTTCCAG
AACGCGACGA TGTTGGCGTG GTCTGGAACA GTTCAGCCAG CGGTAGCCAA GCAGTGGAAT
TTGACCGTGC AGCAGGAACT AGCGAAGAAC CTCACGCTGC AACTTGGCTA CGTCGGCCAG
GCAACACAGC ACTTGATGGT TCCCGAATGG TTGGTACAGG GCGTCCTGAA CGGGGATGGC
TCCGTAACGC CCAGCGCTTT CGCAGGCGGT ACGAATGCTG ACGGCACCCT CGGACCCAAC
CACTTCGGCA ACGTAAAGGA CACGGCTTCC AACGGCAGCA TGAACTACAA CGCGCTGCAG
GCGGTGCTGC AACAACGCTT CAATCACGGT CTCGATTATC AGATTTCGTA TACCTACAGC
AAGTGCATGA CGAACAACGA CGGGTACTAC GGAACCTGGG GAGCGAACAC CGAAACTACT
CCGGCTGCAA ACTACTGGCA AAACCTCTAC GACCCGCAAG CCGACTACGC GCAGTGCTAT
TGGGACGCTA AGCACGTGAT CAGTGCCTAC GCGACTTACG AACTGCCATT CGGCAAGGGT
AAACAGTTCG GCGGCAACAT GAATCCGGTC CTCAACGCGG TTGTTGGCAA CTGGCAGATT
GCTCCGATCG TCTCGTGGCA CACCGGTTTC CCGATCGCGC TTTATGGCCC AGACAACTCT
GGAACCAATT CACCGGCTGC TCGTCCAGAT TGCAACGGGC CGGTGCAATA CATCCAGCAC
ACGGTGGATG GCGGCTATCA ATGGGTGAGC CCGAGTGCGT TCTCGGCAGC TCAGCCTGGA
ACCTTTGGCA ATTGTCCCGC CCAGGGGCCA GTAGTCGGGC CACACTACAC CGATGCCGAC
ATCAGCATGC AGAAGAACTT CCCGATCACG GAGCGCTACC GCTTGCAGTT TAGGGCAGAC
TTCCTGAATG CCTTCAACCA TCCGAATTTC GCGCACCCCG ACAACACCGT GGGCGACACG
ACGTTCGGAC TCATTACCGG AACTCAAGAC GCGAGACAGA TCCAGTTCGC GTTGAAGTTC
TACTTCTAA
 
Protein sequence
MSWLNKVRLV GCLLFLSLLA ANYLEAQQVT AAITGTVTDP AGAAINGATV TARDVERGTV 
TTVKTNDSGV FNFPRVAIGT YEVRTEASGF EIAVQPPFTL VLNQTARLTF QMKIGKTTET
MEVSAEAPQL QTDTTQVSTL IDAKTNDSLP LATRNFIQLT LLSPGALSVD PQSMNTGSNV
AEEGGRPYIN GNREQANNFL LDGIDNNQSS ENLAGFTPSP DAIAEFNVIT QNAPAEFGNF
NGGIVSATIK SGTNSFHGNV FEFFRNDIFN ANKWENGLHK GDPAYFNADG FDSNGVAFTP
KVRWNMFGVT FGGPIIKNKL FFFVDYQGGR LDHPSTAGTF GVLTPAQIGG DFSSLLSLST
PVQLYNPCAA GTGVSGNPCQ LVPVASRQPF AGNIIPSNML DPTFAALTTN SLYPKSIASD
PTSGFGLASN ITGQQYNTDQ GDLRLDYNLS QKDHLFARMS KGYQTDPSTN SILLLGDTLN
QAWLNNFAFN WDHNFSPSLL NEVRFGLNWV KFTNGAHTFD SSVGQLGNTI GIENGNPGGI
DGLPAMSFGG GGITNPGVGS IPTIGSANVV ENFASTVTQF DDVLEYTHGR HVIKGGFQMN
NYRINVFYSG NGGELGQLLY GTTYSSSLDA GGTPVGGNGV ADWALGLPEL VGRGTSTGGW
HQRDWLYAGF IQDDWRITDS LTLNLGLRYE ARTPWTELND RQVNVNVASG ALEYAGNTPV
VGVGSNGFSE GLYDSSYGLS AFQPRFGFAY SPRSMDGKFV VRGAFSISSY LEGTGTNLRL
TQNPPFTPAQ VEANNATTGM PYTSATGFTT AAPPGGDPFQ NATMLAWSGT VQPAVAKQWN
LTVQQELAKN LTLQLGYVGQ ATQHLMVPEW LVQGVLNGDG SVTPSAFAGG TNADGTLGPN
HFGNVKDTAS NGSMNYNALQ AVLQQRFNHG LDYQISYTYS KCMTNNDGYY GTWGANTETT
PAANYWQNLY DPQADYAQCY WDAKHVISAY ATYELPFGKG KQFGGNMNPV LNAVVGNWQI
APIVSWHTGF PIALYGPDNS GTNSPAARPD CNGPVQYIQH TVDGGYQWVS PSAFSAAQPG
TFGNCPAQGP VVGPHYTDAD ISMQKNFPIT ERYRLQFRAD FLNAFNHPNF AHPDNTVGDT
TFGLITGTQD ARQIQFALKF YF