Gene Acid345_3831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3831 
Symbol 
ID4071115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4529575 
End bp4533000 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content58% 
IMG OID637985854 
Producthypothetical protein 
Protein accessionYP_592905 
Protein GI94970857 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0885568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.451604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCT TCGGGCAACG TATGGTGCTG CATATCGCTT GTGTGTCGTT GGTACTCATC 
GGGCTCGTGA ACACAGTTCC GGCCCAAACC GCTTCCACAG GCGCCATCGC CGGTACGGTG
ACGGATCCGG CTGGAGCCGT CATCCCGAAC GCCACGGTCA CGGCGACCGA CGCCCGCACG
GGCGAAACTC GCACCACTAC AACCTCCAAC ACCGGCGCCT ACGTGGTTTC CCTCCTTAAT
CCCGGCACGT ATGTGTTGGC GGTCACTAAG ACCGGCTTCA AGCGCGCGGA GCGGCCCGAC
ATCACCGTCC ATATCACGGA GACCGTCGCT GACAACGTCC AGATGGCAGT TGGGTCGCAG
AATGAAACCG TTTCCGTGAA CGATATGGGC GAACTGCTGA AGACCGAAGA TAGCTCTCTT
GGTAACGTCG TGGATCAGCG TCAAGTCGCG AACCTGCCCT TGGTCACGCG CAACTATCAG
CAAATCCTCG GCCTATCACC GGGCGTCTCG GCTGAGATTT TCAATGCCGG CGAGATCGGT
CGCGGCGGCG TGGATGGTGC TCTCGTGACC GGCGGCGCCA GTTATTCCGA CAACAATTTT
CAGATGAATG GCGTCAACGT CAACGACCTC CAGGGGAGCG GTCACTTCAG CGGTGGCGTG
TCCGCTCCGA ACCCTGACAC CATCGAGGAA TTCAAGGTCC AGACTGGCCA ATACGACGCT
TCGTTCGGAC GGAACGCGGG CGCCAACGTG AACGTGCTGA CGAAGTCCGG CACCAATCGC
TGGCATGGCA GTGGTTGGGA GTTCTTCCGC AACGAGGCCA TGAATGCCAA TGATTACTTC
CGCAAACAGA CCGACCAGCC TCGTGCCGAG CTGCGGCAAA ACCAGTTCGG TTTCACGTTT
GGCGGCCCTA TCGTCAGGGA CAAACTTCTT TTCTTCACGT CCTATCAGGG AACGCGCCAG
AACAACGGCA TCGATCCGAG CTGTTCGAGC AGCGTAACGC TGCCTGTGTT GACCGATGAT
CGCTCCAATG CAGGGTTGGC AGCAGCCGTT GGAGCGACGA CAGCGTTCGG CGGTATGGAC
CCGTATACCG GAAATCCAGT AACTGCGGCG AACATCAGTC CGCAGGCCGC GGCGCTCTTC
AATGCGAAGC TTTCGAATGG GCAATACCTG ATTCCCAACC CGCAGGTCAT CAAAACCGAT
CCTGCGACTG GCTTGCCCGA AGGCTTTTCC ACGTATAGCG TGGCGTGTCC CTATCACGAA
GACCAGTTCA TGGTGAACCT CGACTGGCTG CAGAACTCCA AGAGCACGTT CCAGGAACGC
TTCTTCTACG CGGACAGTGA AGCGACATCC ACGTTGCCGC AAACCCAGAC AGTTGGCGAT
CAAGTTCCCG GTTCTCCCTC GAAGAACCCG CAGAACTTCC GCGATTTCTC GCTCAGCCAT
ACCTATGTGT TCACCTCGGC ACTGGTGAAC CAGGCACAAA TTGGATTCAC CCGCAACCTG
GCCGGCACCA ACCAGTCGTT CCCGCTGAAA TATTCCGACA TTGGTGTGAC TGCACCCGGA
TTTGACGATG CACGTGCAAA CATCTCGGTG CTCGGCGGCT TCGATGAAGG CGGCAACGGC
CAGACGACCG TCATCGCTCA GAACAACTAC ATCTTCCAGG ACACGCTCTC CTGGTTCCAC
GGACGTCACT CGTTCCGCTT CGGCGGGAAC ATCACGCGTT CACAGGACAA TATCTCCGAG
TTCGCGTTTG CCGGCTATAC GATCTTCCTC GACTATCCGG GCTTGATGAT TGGCGACGGT
CCCTTCAATC CTTACCAGTC TGTCGACCTC GCGGGCATCA CCCAGCGCGG CTACCGCGTG
TGGGACGGGT CGCTCTACGC GCAGGACGAT TTCAAAGTTA CCCAGAGACT CACCCTCAAT
CTGGGCTTCC GCTATGAGCG ACTCGGTGAT GTCGGAGAGA ACGCGGGCAG AAATGCCAAC
GTGAATCCTT CGCTGGTGAA TCCGAATCCA GGAGCCGCTG GAAGTCTCGA AGGCATCATT
GTTGCCAGCA ACTTTTCGGG GCAGATTCCC GACGGAGTCA CTCGCGCCAG CAACGATCTC
GCGATCAACG GTGACGGGCA GAACACGTGG AACCCACGCA TCGGTTTCGC ATGGATGTTG
CCCGGCTCAG ATCGCTTCGT TTTACGCGGT GGTTACGGCC TTTACCGCCA GCGAATTACC
GGCCAACCCT ACTTCCAGCT CGAGACCAAC CAGCCGTGGG GACAGTATCG AGCTGCCGTA
GGGACTGCAG GCTTCGCCAA TCCGTTCGGC CCCGATCCCG GAGCATTCCC GCAGTTCTTC
CCGTACTCAG CTCCCGTGGA ATACCTTCCC GGACAATTTG CTGCCACTAC CACGCTCTCT
CCGTTTGCCT TGGCGCAGAA CCTCCGCCCG CCGCTGTTCC AGCAATACGG ACTGAATTTG
CAGGCGCAGA TCACCAAGTC AACGGTGGTA CAGGTGGGCT ACGCCGGCTC GCACGGCACG
CACATGCTCC TCTACAACAA CCTGAACCAG GCATCGGCGG CGAGTGCCGA CAATCCGGTG
CGCGGTCAGA CCGACACTAC CTTAGGCAAC TTCTACGCCC GGATTCCCTA TGAAGGCTTC
GGTGCCCTCT ACTACGACCA GAGCACCGGC TACTCGTGGT ACAACGCGCT GCAGGTCAGC
GTGGAACATC GATTGAGCCA CGGATTGCAG TTCCTGGCCT CATATACCTA TGCCAAAGAC
CTCACCAGCG TGTGGGGCGC CACGACCGGC GCGAACGGCG GAACACAGGT TGGCGATAAC
TTCAACCCGA ACCGCGACCA CGGTCCGGAC ATCTTTATTC GTCCCCACCG TTTTGTGCTC
TCGTACGTTT ACGAAATTCC CGGGTTCCAC GACCATGGCT GGGCGAGCGC GCTGCTGTCA
GACTGGAAAG TTGCCGGCGT GACGACGCTT CAATCCGGAC ATCTCCTGCC GGCGCTCGAC
GTGAATCCAA CGAACGTCTA CACCCAGGGT TATAACTACG ACTTCGCGAC CATGACACCC
GGGTGTTCGC TGAGCAAAGG CGGTTCTGTT ACTGGTCGCC TGAACGGATG GATCGACACA
ACCTGCTTCA CTTCCGCTCC TCCCGCATCG GCGGATGGCG GCACGGGTTT CGGAAACACT
TCGCTGGGAC TGTTCAAGGG CCCGGCGCAA GCGAGCTCGG ACCTCTCGTT GATCAAGGTC
TTCCCAGTAC GTCGGTTGAG TGAAGCTGCC AATTTCGAGT TCCGCGCGGA AGCCTTCAAC
GTTTTCAACC AGGTCAATTT CGCCGATCCC GATAACGTCT TCACCGATGG TCCAAGTTTT
GGAACCATCA CGAAGACGCT GTCCAACCCG CGCATTCTGC AGTTGGCGCT GAAGTTCTCT
TTCTAA
 
Protein sequence
MTLFGQRMVL HIACVSLVLI GLVNTVPAQT ASTGAIAGTV TDPAGAVIPN ATVTATDART 
GETRTTTTSN TGAYVVSLLN PGTYVLAVTK TGFKRAERPD ITVHITETVA DNVQMAVGSQ
NETVSVNDMG ELLKTEDSSL GNVVDQRQVA NLPLVTRNYQ QILGLSPGVS AEIFNAGEIG
RGGVDGALVT GGASYSDNNF QMNGVNVNDL QGSGHFSGGV SAPNPDTIEE FKVQTGQYDA
SFGRNAGANV NVLTKSGTNR WHGSGWEFFR NEAMNANDYF RKQTDQPRAE LRQNQFGFTF
GGPIVRDKLL FFTSYQGTRQ NNGIDPSCSS SVTLPVLTDD RSNAGLAAAV GATTAFGGMD
PYTGNPVTAA NISPQAAALF NAKLSNGQYL IPNPQVIKTD PATGLPEGFS TYSVACPYHE
DQFMVNLDWL QNSKSTFQER FFYADSEATS TLPQTQTVGD QVPGSPSKNP QNFRDFSLSH
TYVFTSALVN QAQIGFTRNL AGTNQSFPLK YSDIGVTAPG FDDARANISV LGGFDEGGNG
QTTVIAQNNY IFQDTLSWFH GRHSFRFGGN ITRSQDNISE FAFAGYTIFL DYPGLMIGDG
PFNPYQSVDL AGITQRGYRV WDGSLYAQDD FKVTQRLTLN LGFRYERLGD VGENAGRNAN
VNPSLVNPNP GAAGSLEGII VASNFSGQIP DGVTRASNDL AINGDGQNTW NPRIGFAWML
PGSDRFVLRG GYGLYRQRIT GQPYFQLETN QPWGQYRAAV GTAGFANPFG PDPGAFPQFF
PYSAPVEYLP GQFAATTTLS PFALAQNLRP PLFQQYGLNL QAQITKSTVV QVGYAGSHGT
HMLLYNNLNQ ASAASADNPV RGQTDTTLGN FYARIPYEGF GALYYDQSTG YSWYNALQVS
VEHRLSHGLQ FLASYTYAKD LTSVWGATTG ANGGTQVGDN FNPNRDHGPD IFIRPHRFVL
SYVYEIPGFH DHGWASALLS DWKVAGVTTL QSGHLLPALD VNPTNVYTQG YNYDFATMTP
GCSLSKGGSV TGRLNGWIDT TCFTSAPPAS ADGGTGFGNT SLGLFKGPAQ ASSDLSLIKV
FPVRRLSEAA NFEFRAEAFN VFNQVNFADP DNVFTDGPSF GTITKTLSNP RILQLALKFS
F