Gene Acid345_0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0445 
Symbol 
ID4071692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp523023 
End bp526544 
Gene Length3522 bp 
Protein Length1173 aa 
Translation table11 
GC content57% 
IMG OID637982449 
Producthypothetical protein 
Protein accessionYP_589524 
Protein GI94967476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0367904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTGTG CGTTAACTCG GGTGCCAGAG CTGTGTCTGC GCGCCGTGCA TCGATCGCGT 
TTTGCGATCC TGTGCGCGCT AGTGCTTTTC TTTTCAGCTC ACGTTTTCGG ACAAGAAGCG
ACCGTGGTCG GGACAGTTAC CGATCCTACC GGCGCGAATG TTCCTAACGT GACGATCACC
ATCACCCATA TCGAAACGGG TCAAGTCCGT AACGTCACGA CTAACAGCGA AGGACAGTTC
GCAGCGGCAG CGCTGCCCAT CGGACACTAC AACATCACTG CGCAGTCCGA AGGCTTCGGT
GTCGCGCAGA AAACCAACGT CGCCCTCAAC GTGGGCGATC GTACCCGCAT TGACTTTGCT
CTTGCCGTGG GGTCCACGAA GCAGACCGTA ACGGTGGAAG CGAATGCAGT TCAAGTGCAA
ACCCAGACCG GCGAAGTCAG CACGGTCATC GACGGTAAAC AAGTTGCGCA ACTGTCTACC
AATGGCAGAA GCGTTTACAC CCTATACGCA TTGACTCCGG GCGCCTCGAG TGTGCAAGGA
GACTTCATCA CGCCAACGCC GGTAAGCGGC GACAGTAACG TGAGCATCAA CGGCCAGCGG
CCAGGCCACA ACCTGCAGAT CCTCGATGGC GGCGAGAACC TCGATCGCGG CGGCAGCAGC
GCGAGTGTGA TGCCTTCGCT CGAAGCGATC GCAGAATTCA CGAACATGAC GTCGAACTAC
AGCGCCGAGT ATGGCCTGTC GTCCGCCGCA ACCATTACCA CCGCAGTCAA GTCGGGCACC
AACCGGTTCC ACGCGTCGGC GTGGGAGTTC CTCCGCAACG ATGCTTTGGA CGCACGTAAC
TACTTCAACC CGGCGCCGGC GAAAGTTGCC GAACTCCGCT ACAACAACTT CGGCTTCAAT
GTCGGCGGTC AAGTGCCGCT CTGGAAAGAC CATCCGACAT TCTTCTTCTA CAACCAGGAA
TGGCGCCGGT TGGTGCAGGG TGGACAGTTG AACCAGACTG TGCCGCTCGC GAGCGCGTAT
CCGGATGCGA ACGGAACGGG AACGGGCGCA GTGATTCCGA CAACCCTGCC GAACGGGGAC
CCGAACATCA TCACCGTTCC GACGGGCATT GCGAACTTCG GGGCGAATTG TTCCGGCGCC
GTGCGTGCAT CGTTGATTCC GGGCCAACCG TTCCCCGCGA ATACCATCCC AGATTGCCTG
ATCGATCCGA ACTCGCAATC GTTGCTGAAG GCAGGTATTT TCCCCTTGCC GACGAACGGC
GTGCAGTTCC AAGGCGGAAA CAACTCGCCG ACGAACGTGA CGGAAGAGAT CGTTCGTATT
GACCACCAGT TCTCGTCCAA GTTCTCGATC TTCGGGCACT GGATTGCTGA GCAGGTGTCG
CAGACGTATG GCACCACGCA GTGGAGCGGC GACAACGTCC CGACGATCAG CGACGTTTTC
GGCAACCCGT CATTCAGCGG GGTGATCCAC ACCACATACA CGATCAGCCC AACCCTGCTC
AACGAGGCGT CGTTTAACTA CAACGGCAAC CGCATCCACA TCATCCCGCA GGGACTGGTT
TCGGCGCCCG GCGACTTCAC CTTCAATCGC TTGTTTACCG GTCCGAACGA TCAGACCCGT
ATTCCGTCCA TCGCGCTGAG CGGCAGCACG GGCACCAACT ACACCTCCAA CTGGACGCCA
TGGAACAACG CCGCAAACGA CTACCAGGTT CGCGACGATG TCACGTGGTC CAAGGGCGCT
CACCAGCTCA AGATGGGCGG CAGCTGGGCC TTGTACACCA AGGTTCAGGA TGCTTTCGCC
AATACCCAGG GCAATTTCTC TTTCAACGGT GGATTTAGCG GCAATGACTT TGCCGACTTC
CTCCTCGGAT ATGCACAGCA GTACACCGAG GATGCGGTGA AGATCAGTGG GAATTGGAAC
AACGTATCCT GGGCAGCCTA TGTTCAAGAC AACTGGCGCG TAACTCACCG GTTGACCCTG
AACCTCGGAC TTCGTTGGGA CGGTGTACCT CACACCTACG AAGCGAACAA TCAATCGTCC
AACTTCTACC GCAACCTCTA CGATCCCGCG AATGCGGCGA CGTTCGATGC GAACGGGAAC
ATCTGCAGCG CAAACTCGGT TCCTGCTTGT CCAGGTGGAC CCAGCCCGGG CTTGGGTACA
AGTTCGAACC CAATCCTCAG CGGTGTGGGG TTCTATGTGA ACGGCATCGG CATCGGTGGT
TTGAACGGCG TTCCCAATGG ACTGGTGAAC AACCATTGGG CAGCGTTCGG ACCTCGTCTG
GGCTTCGCCT ATGACCTTAC TGGACAGGGC AAGAGCGTGG TTCGTGGCGG CTTCGGCATC
ATGTATGAGC GCATTCAGGG CAACGATATG TACAACGGTG CCGTGAACCC GCCCAACGAC
TTGCAGCCAC TGCTAAACAG TGTCTCCCTC TCTAACCCGG GATACAACAT TAAGACTGGC
AATTCGATCA CAGCAGCGGC CCTGCCGGTG TTGCCACTCG TGGTGAGCGG TATTGATTCG
GAAAACTACA AACTTCCCGT CAACTACCAG TACAGCTTCG GGTTCCAGCA GTCGCTCGGT
GAAGCGTCGG TCCTCGGTAT CTCGTATGTT GGCAGCCAGT CACGCCATCA GAACGATTAT
CGCCAGATCA ATCTTCCGGC GATCACCTCG TTGCCCGCTC TGTTAGCGAG CAACGGCGCC
GGCATCAATC AACAAATGCC CTACCTCGGT TTCGGCGATA TTCGCCTGGC CGAGAACGAG
GCGAACGCCA GCTACAACTC CCTGCAGGTC GACCTGCGAG GTAACTTGAA GCGGGACTTC
CAGTATCAGT TCGGCTACAC ATGGTCGAAG GCCATTGACG CAGCCACCAG CAACGGAAGC
GGCGGCGATC TCAACAACGT AACCAACCCG TACGTCGGAT GGCAGTACGA TAAAGGACCC
TCTCCGTTCG ATCGTACGCA TATCGCGTTC GCAAACTTCG TGTACTCGAT TCCTCTGTTC
AACAACAACG ACAGCCGTGC AGTAAGAACG ATTCTCGGCG GCTGGCAGGT GTCGGGAATT
GTGACCATGG AGTCGGGTGC GCCCTTGGCA ATCGGCTTGA GCGGTAGCAA TGTCTCGAGT
TTGTTCAACG GCGGCTTCGT AACGAACAGT GGTAATCGGC CCAACGTGAA TGGAGCGGTC
ACCTACCCGC AATCCGTGAA CGAATGGTTC GACACCTCCG CATTCTCGGC TCCCACTTGC
ACGACCGGAC CGGACTGCTG GGGCGACCTC GGCCACAATA CGGTCCGCGG TCCTGGGCGT
GACAACTGGA ACCTCTCACT GTTCAAGAGC TTCACTCTGA ACGATCGCGG CAGTCGGATC
GAATTCCGGG CAGATTCGTT TAACACCTGG AACCACACGC AGTTTAAGGG TGATTACAAC
AACGGCGGCA TCAGCACAAA CTTCGGCTCC GGTAACTTCG GGGCGGTAAC CGCCGCATTC
GACCCGCGCG AATTTCAGTT CGGGCTCAAA CTCGTGTTTT AA
 
Protein sequence
MYCALTRVPE LCLRAVHRSR FAILCALVLF FSAHVFGQEA TVVGTVTDPT GANVPNVTIT 
ITHIETGQVR NVTTNSEGQF AAAALPIGHY NITAQSEGFG VAQKTNVALN VGDRTRIDFA
LAVGSTKQTV TVEANAVQVQ TQTGEVSTVI DGKQVAQLST NGRSVYTLYA LTPGASSVQG
DFITPTPVSG DSNVSINGQR PGHNLQILDG GENLDRGGSS ASVMPSLEAI AEFTNMTSNY
SAEYGLSSAA TITTAVKSGT NRFHASAWEF LRNDALDARN YFNPAPAKVA ELRYNNFGFN
VGGQVPLWKD HPTFFFYNQE WRRLVQGGQL NQTVPLASAY PDANGTGTGA VIPTTLPNGD
PNIITVPTGI ANFGANCSGA VRASLIPGQP FPANTIPDCL IDPNSQSLLK AGIFPLPTNG
VQFQGGNNSP TNVTEEIVRI DHQFSSKFSI FGHWIAEQVS QTYGTTQWSG DNVPTISDVF
GNPSFSGVIH TTYTISPTLL NEASFNYNGN RIHIIPQGLV SAPGDFTFNR LFTGPNDQTR
IPSIALSGST GTNYTSNWTP WNNAANDYQV RDDVTWSKGA HQLKMGGSWA LYTKVQDAFA
NTQGNFSFNG GFSGNDFADF LLGYAQQYTE DAVKISGNWN NVSWAAYVQD NWRVTHRLTL
NLGLRWDGVP HTYEANNQSS NFYRNLYDPA NAATFDANGN ICSANSVPAC PGGPSPGLGT
SSNPILSGVG FYVNGIGIGG LNGVPNGLVN NHWAAFGPRL GFAYDLTGQG KSVVRGGFGI
MYERIQGNDM YNGAVNPPND LQPLLNSVSL SNPGYNIKTG NSITAAALPV LPLVVSGIDS
ENYKLPVNYQ YSFGFQQSLG EASVLGISYV GSQSRHQNDY RQINLPAITS LPALLASNGA
GINQQMPYLG FGDIRLAENE ANASYNSLQV DLRGNLKRDF QYQFGYTWSK AIDAATSNGS
GGDLNNVTNP YVGWQYDKGP SPFDRTHIAF ANFVYSIPLF NNNDSRAVRT ILGGWQVSGI
VTMESGAPLA IGLSGSNVSS LFNGGFVTNS GNRPNVNGAV TYPQSVNEWF DTSAFSAPTC
TTGPDCWGDL GHNTVRGPGR DNWNLSLFKS FTLNDRGSRI EFRADSFNTW NHTQFKGDYN
NGGISTNFGS GNFGAVTAAF DPREFQFGLK LVF