Gene Acid345_0352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0352 
Symbol 
ID4069594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp385617 
End bp388730 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content59% 
IMG OID637982355 
Producthypothetical protein 
Protein accessionYP_589431 
Protein GI94967383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.197071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.320234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCC GCTCGTTGTG TCTCTGCTTT CTCCTGGGAG TTTCCGCTGC AATCGCGCAG 
CCCACACCAG CTTCCACAGC GACCAGCCAA GCGGCCATCG TGCCTCACCT CATCCGCTTT
ACTGGACAAG TGAAAGACGC CAACGGAACG GTCGGCATCA CGTTCACACT CCACAAATCA
CAGAGTGACA ACGCCGCGCT CTTCACGGAA ACACAAAACG TGAAGCTTGA CGGCGAGGGA
AGGTACACCG TTCTCCTCGG GGCAACCAAG GGCGACGGCA TTCCGATGGA ACTCTTCACG
TCCGGCGAAG CCCAGTGGCT CGCGATTCGC GTCGAAGGTC AAACGGAACA GCGTGTGCTG
CTCGTCAGCG TTCCCTACGC GATGAAGGCG GCGGAGGCCG AGACTCTAGC GGGACACGCC
GCGACAGACT TTGTGACCGC CGACCGGCTC ACCAGCACGG TTCAGCAACA AATGCGCCAG
CAAGCCTCGA CCACAACCAC AGCGAAAGAC GCACCGACAG GGAAGCGCGG TAATGTGGTG
ACCAACACCG CCACCAACTT CGCCGACGCA ACCAGTACCC AGGTTGTGCT CGTGACCCAG
AGCGGCGCTG GTTCAGGACT CGTCGCCAGC GCGGTATCCG GGAATGGCGT CGCCGGATCC
ACCACCACCG CGGCTGGCTT CGGGGTGTCG GGCGCCAACT CGGCTACGAC AGGCGTAGCG
ATCGGCGTAC GCGGTACCAC CGTTGCCGAT AGCGGTATCT CGGTCTTCGG AACCGCCAGC
GGAACGGCCG GCAGCGCGAC CGGCGTAAAG GGCATCACGG GAGCTCCGAA CGGATTTGGC
GTCTTCGGCC AGAACACGGC GACCACTGGG CCAGCCGTCG GTTTCCGTGG CACGACCGCA
TCAACCAGCG GCATCGGAAT TTTCGGCACC GCCACCGCCG CCACCGGCAC GGCAATTGGT
CTGCGAACCT CGGTAGCCAG TCCGGGCGGG ACCGCCGCTG TTCTTCAGAA CACCGCCAGC
GGAAAGTTGA TCAGCGGGCA ATCGGGTGCC ACAAATACCG AGGTTTTCTC GGTGGACGGT
GCAGGGAACA CCGTGAGCGC CGGTGGTGTC CAGGCGGCGA CGATGAATGT GGTGAACACC
ACAGTGCGTC AGCCGTTTCA ACTGAATGGT ACCGGCATCC TCGGCATCGG TGATCCTACC
GAGTTGAACG TGTTTGTTGG GCGCGATGCC GGTAAGGTCA ACGTCGCCGA CTTTCCCACT
GGGGCCGGAA TTGGAAACAC CTTTGTGGGA AACGGCGCCG GCGAGCACAA CATCGATGGA
AGCAATAACA CATACGTCGG CCTTTTCACC GGTGGCGCGA TCCACTCTTC AGACAACACG
GCGCTCGGTG ACAGTGCCGG AGCGGGAGAC GGTGCGAGAA ACACTGCCAT TGGCAAAGCC
GCAGGCGCAG GCGTCCACGA TGACAACACC ACTCTCGGGT ACGAAGCTGG GTTTGGGAGT
AGCGGTGCAC GCAACGTGGT AATCGGCGCA AGTGCAGCTT CAGATTTCTT TTCCGGTAAC
GAAAATGTTG TTGTCGGAAT GCAGTCGGCG CTCCATCTTT CCACTGGTTC GCACAACACG
TTCCTAGGCG CGGGGGCCGG AGCTTTGACC TCCACTGGAT CTTTGAACGT GATGATCGGC
CAGAACGCCG GCACTGCGTC TAGCGCAGGT AGTGGAAATG TCTATATCGC AAGCAACGGC
TGTAACCCAT CGCCGTGCAA TGAGAACAAT ACGATACGCA TTGGCGGGGA CTCCGGCCTT
GGAACCGGTC ATACCGCGGC CTTCTTTGCG GGCATCAATG GCCATGCAAT TAGCACGGGT
TCGCCCGTGT TTATCGACTC GAACAATCAG CTCGGCACCG GGCCTGCGAC GCTGCCACCG
TCGGCCGGCT CCTCCTTTTA CATCCAGAAC AACACCGGCA GCCCGCAAAC CTCAGCCAGC
TTCAACATTG ACGGCAACGG TTTCGCGGGC GGTCTCCTTC AAGGTGGGTT CGTAAACGCG
ACCAGCACTG CGGCGAATAA ACCATACCGC GAGAATGGTG TCCCTTTCCT CGGTATCGGA
GTCGAAGGTC AGAACAATGT CTTCCTCGGA GAATTGGCGG GCCAAAGCAA TGTGAGCGGG
AGCGGACTCA ACAACACCTT CGTTGGTGCG TCAGCCGGTA ATTCAAATAC CGGAGGAGAT
AGCAACACCT TTCTCGGCAG CTCCGCGGGA CAATCAAATG TGAGCGGCGG CTTCAATACT
TTCGTTGGCG TTGATGCTGG TTTAAGGAAT ACGACCGCTT CCGGAAACAC CTTCATCGGT
CAAACTGCCG GTATCGAAAA TTCAACCGGC GCCTCGAGTG TTTTTGTTGG TCATAGTGCC
GGTGCCAACA ACACAACGGG TGGCCATAAC GTTTATGTTG GAACGACTGC TGGCCTCGAC
AATTCGACGG GAGGCTTGAA CACTTTTGTC GGCGATGGAG CCGGCTTAAC CGACACAGGG
AACGCCAATA TCTTTGTTGG CGCCAACGCC GGTGGCAATA ACACCTCGGG CGACAACAAC
CTCTACATCG GCAATGTCGG GTGCACGTCG CCCTGCACCG AGAGCGCTAC GATTCGCATC
GGCAACACCC AGACCTCGGC CTTCATGACG GGTATTGCCG GGAAGACCTC ATCGAGTGGC
ATTACGGTCC TGATCAACTC GACAGGGAAA CTCGGTACCA CCACGTCGTC CCGCCGCTTC
AAACAGAACA TCGCGAACAT TCCCGACAGC AGCAAGCTCT TCCAGTTGAG GCCGGTCACC
TTCTTCTATC GCCCCGAATA CGATGACGGC ACCCACGTGC GGCAGTATGG CTTGATCGCC
GAAGAGGTCG CGAAGATCTA TCCGGACCTC GTCGTCTTCG ACAACCAGGG CAAGCCGTAC
ACGGTGCGAT ACCAGTTCCT CGCCCCGCTC CTTCTCGACG CCATGCAGAA GGAACACGCC
GTGGTCGCCG CGCAGCAGAG CGTTATCGCT TCACAACAGA AACGCATCGA CGAACTCTCG
CAGCGTCTCG CACGCCTGGA GGAAACCGTA AACCGTATTT CCGCGGCGCA CTGA
 
Protein sequence
MKLRSLCLCF LLGVSAAIAQ PTPASTATSQ AAIVPHLIRF TGQVKDANGT VGITFTLHKS 
QSDNAALFTE TQNVKLDGEG RYTVLLGATK GDGIPMELFT SGEAQWLAIR VEGQTEQRVL
LVSVPYAMKA AEAETLAGHA ATDFVTADRL TSTVQQQMRQ QASTTTTAKD APTGKRGNVV
TNTATNFADA TSTQVVLVTQ SGAGSGLVAS AVSGNGVAGS TTTAAGFGVS GANSATTGVA
IGVRGTTVAD SGISVFGTAS GTAGSATGVK GITGAPNGFG VFGQNTATTG PAVGFRGTTA
STSGIGIFGT ATAATGTAIG LRTSVASPGG TAAVLQNTAS GKLISGQSGA TNTEVFSVDG
AGNTVSAGGV QAATMNVVNT TVRQPFQLNG TGILGIGDPT ELNVFVGRDA GKVNVADFPT
GAGIGNTFVG NGAGEHNIDG SNNTYVGLFT GGAIHSSDNT ALGDSAGAGD GARNTAIGKA
AGAGVHDDNT TLGYEAGFGS SGARNVVIGA SAASDFFSGN ENVVVGMQSA LHLSTGSHNT
FLGAGAGALT STGSLNVMIG QNAGTASSAG SGNVYIASNG CNPSPCNENN TIRIGGDSGL
GTGHTAAFFA GINGHAISTG SPVFIDSNNQ LGTGPATLPP SAGSSFYIQN NTGSPQTSAS
FNIDGNGFAG GLLQGGFVNA TSTAANKPYR ENGVPFLGIG VEGQNNVFLG ELAGQSNVSG
SGLNNTFVGA SAGNSNTGGD SNTFLGSSAG QSNVSGGFNT FVGVDAGLRN TTASGNTFIG
QTAGIENSTG ASSVFVGHSA GANNTTGGHN VYVGTTAGLD NSTGGLNTFV GDGAGLTDTG
NANIFVGANA GGNNTSGDNN LYIGNVGCTS PCTESATIRI GNTQTSAFMT GIAGKTSSSG
ITVLINSTGK LGTTTSSRRF KQNIANIPDS SKLFQLRPVT FFYRPEYDDG THVRQYGLIA
EEVAKIYPDL VVFDNQGKPY TVRYQFLAPL LLDAMQKEHA VVAAQQSVIA SQQKRIDELS
QRLARLEETV NRISAAH