Gene Acid345_3218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3218 
Symbol 
ID4070430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3807170 
End bp3810397 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content59% 
IMG OID637985239 
Producthypothetical protein 
Protein accessionYP_592293 
Protein GI94970245 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.188502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCGA AGCTCCTGCT GCTGGTTGTA TTGCTCGCGT CCAGTTTCAC CCTCCTAGCT 
CAAACATTCA GAGGCGGCAT CGAAGGCACA GTCACAGATG CCTCTGGTGC AGCCATCCCC
GGCGCACAAG TCACCGCCAA CGATCCCGCA ACCGGCACCT CGCGTAGCGC GACGACCGAT
GGATCCGGCA ACTACGTGTT TACGGAAATG CCGCTTGGCG CTTATGACGT TACAGTCGAG
CACGATGGAT TCCGCAAGCA GGTAATTCGC GGGGTGAAAG TGGAAGTCGG CGCTCCGAAC
CGCGCCAACG CCACGCTCAC TCCGGGCCAG GTGAAGGAAA CGATTGACGT CTCGGCAGAG
ATTCCGGTAA TCGAGACGCA GGCCGACACC ACGGGCGACA CGATTTCCGG CGACCAGGCG
AAGGACCTGC CGGTCAACGG GCGTGATTTC ACCAAGCTCT TCCAGTTAGT CCCAGGCGCA
GCGGGTGACC CGAGCGGTAT CAATGACTCG CCTGGTTCAT TCGGCATGGT CAGCATCAAC
GGCAACCGTG GACGCTCCAA CAACTACCTG CTCGACGGTA CCGACATGAA TGACGGTTAC
CGCAATCTGC CGGCCATCAA CGAGGGCGGC GTGTTCGGTA CACCTGCAAC GATTCTTCCC
ATAGACGCCC TTGCCGAAGT TCCGGTGATT TCGAATACGG AAGCCGAATA CGGACGCAAC
TCCGGCGCGG TGGTGAACAG CGTCACGCGC TCTGGGACGA ACGCGCTGCA CGGCAGCGTG
TACGAGTACT TCCGCAATAA CGGACTGGAT GCGCGCAACT ACTTCAACAG CTCCGGCCCG
CAGGACGCGT TCCACAACAA TCAGTTTGGC GGATCGCTGG GCGGCCCGAT CATCAAGGAC
CGTACGTTCT TCTTCTTCTC TTATGAAGGA CAGCGTGAAA GCGGCGGCAT TCCAACGCCC
GAGACAGTGC CGACGCTGGA CCAAATCGGC GCCTACACTG CCGGTGGCGG CGTGGTAAAT
CCCGTGATCG CGAGCCTGCT CGCTCGCAAT CCGTGGGGAA CCTTGCCGCA ATCAGACGGC
AACGTGCTCT TGACGAATCC GTTCACGAAC ACAGTCGATA GCCTGATCGC AAAGATCGAT
CACCACTTCC TGGGCGCCGA TAAGCACGAT CTCATCACGG GCCGTTATTA CTACGGCAAC
AGCAGCCAGA GCTTCCCGTT GGCGCTGGTC GGCGGCGGCG TAACTCCAGG TTTCAACACC
ACGACGCCAA CGCGGGTGCA GATTGTTTCG CTGTCGTACA CGCACATCTT TTCGCCGAAG
TTCCTGATGG AGTTCCGCGG CGGCTGGAAC CGTTTTGCGG AGCAGTTCTT CGCGCAGGAC
AAGAGCTTCG ATCCGGCGTC GATCGGGTTG TACTCGGCAT CGCCGAGCGC TACGGCGAGG
GACGGGGGAT TGCCGCTGAT GACGTTCGGC GATGGCACTG GCAGCATCGG CGCGAACCTT
TCTGTGCCGC GCGGACGCGT CGATACAAAC ACGCAGTTCT TTACCAATGC GTCGTATAGC
ACCGGCAAGC ACAATTTCAA ATGGGGCTAT GAATTTCGCC GCACGTTCGT GAATGGCTAC
TTCGACGCGG CCTATCGCGG CCGAATCCAC TTCAACTCGT TCGACGATTT TCTCGCGGGC
ACGCCTGCGG ATTCAGGAAA CCACTCGGCC ACGGGGTACT CCGCGCGCCA CACCTTCGAG
AACAACCACG CATTCTATTT CCAGGACAAC TGGCGGCTGA CCAACCGGCT GACGGTCAAC
TACGGATTGC GCTGGGATTA TTTCGGCGTG ATCGGTGAGC AAAACAACCT GTTCAGCTTC
CTCGACGTCC CGACCGGAAA CCTGAAACAG GTGGGAGCAA ATGGCGGCCC GAGCACGCTC
TACCCGAAGG ACTTCAACAA CTTTGGGCCC CGCCTGAGCC TCGCCTATGA CGTCTTCGGT
ACCGGCCATA CGGTGGTCCG CGCCGGCTAT GGAATGTTCT ACGACGCATT CTCGCAGGAC
TTTTTCGTAG GACAGTTGCC GTGGAACACC TTCAATCCCG GTCCCGCATA CAACGCGGTT
CCCGGCGCCG AGATTGACTT TACCGGCAGC GTGAATCCGA TCGATCCGAA TCCTGCAAAC
CACACGCCGA TATTCACCGG CTACGGTGCC ACGGATGTGT TCAGCGTGGA CCAGCACCTG
CGGACGCCGT ACATCCAGAG CTACAACGTC AACGTTGAAC AAGAGATCCG CAACGGTGTG
GCGGTGAGCC TGAGCTACGT TGGATCGCAG GGCCGCAAAC TGTTCCGCTA CATCGATCTT
AACCAGGTCA ATCCGGCCGA TGGCTCGATC GCGTATCCGC AGTATTACTA CGTGAACCAG
TTCCAGTCAT CGGCGGCTTC GGGTTACAAC GCGCTCCAGG CGCAGTTCAA AATCTCGAGC
TGGCACGGAC TGACCTCGAC GATGAACTTC ACGTGGGGCC ACTCGATCGA CAATGCCAGC
GACGGTCAGG ACTATGTGAC CAACGCTACG CAGCCGGACA ACAGCTTCAA TCCTGGCGCC
GAGAGAGCTA ACTCTAACTT CGACTTGCGT AAGGCGTTCA AGTGGTATTA CACGTACGAA
CTGCCGAAGT TCGAGACAGC GAAGTGGATC ACGAACGGGT GGGCGCTCAA CGGTGTACTG
TCGCTCGCTG ATGGGCAGCC GTTCAACGTG ACCTGGCTCG ACAACTTCAA TTACGACATC
AACGGAACGG GCGAGTACTT CGGCCGCCCG GACTTGGTTG GAGATCCTTG GGCAGGCACG
CATGGACCGG CCAATTTCCT TAACCTCTCG GCGTTCGCAG CGCCTTGCAA CTGGGACAAC
GTGAACGGTG GCTGTATCGA CGGCCAGCAC ATTGGGAGCT TGAGCCGCAA CGCGTTCCGC
GGTCCGGCGT ACAAGAATTT CGACTTCTCA GTGTCGAAGA CGTTTGCCTT CACGGAACGA
GTAAACGCTC GCTTCGGCGC GGACTTCTTC AACATCTTCA ACCATCCGAA CTTCTCCAAC
CCGGTGCTTC CGAATTACGT GGTGGACGCG GCTTACAACG GAGACGCGAG CGGCGTGGGA
CATGGATTCC TGCCGATCAC GGCGACTCCT GATGTAGGCG GTGGCAATCC GTTCCTCGGT
GGCGGCGGCC CGCGCGACAT CCAGTTGTCG CTCAAAGTCA CGTTCTAA
 
Protein sequence
MRAKLLLLVV LLASSFTLLA QTFRGGIEGT VTDASGAAIP GAQVTANDPA TGTSRSATTD 
GSGNYVFTEM PLGAYDVTVE HDGFRKQVIR GVKVEVGAPN RANATLTPGQ VKETIDVSAE
IPVIETQADT TGDTISGDQA KDLPVNGRDF TKLFQLVPGA AGDPSGINDS PGSFGMVSIN
GNRGRSNNYL LDGTDMNDGY RNLPAINEGG VFGTPATILP IDALAEVPVI SNTEAEYGRN
SGAVVNSVTR SGTNALHGSV YEYFRNNGLD ARNYFNSSGP QDAFHNNQFG GSLGGPIIKD
RTFFFFSYEG QRESGGIPTP ETVPTLDQIG AYTAGGGVVN PVIASLLARN PWGTLPQSDG
NVLLTNPFTN TVDSLIAKID HHFLGADKHD LITGRYYYGN SSQSFPLALV GGGVTPGFNT
TTPTRVQIVS LSYTHIFSPK FLMEFRGGWN RFAEQFFAQD KSFDPASIGL YSASPSATAR
DGGLPLMTFG DGTGSIGANL SVPRGRVDTN TQFFTNASYS TGKHNFKWGY EFRRTFVNGY
FDAAYRGRIH FNSFDDFLAG TPADSGNHSA TGYSARHTFE NNHAFYFQDN WRLTNRLTVN
YGLRWDYFGV IGEQNNLFSF LDVPTGNLKQ VGANGGPSTL YPKDFNNFGP RLSLAYDVFG
TGHTVVRAGY GMFYDAFSQD FFVGQLPWNT FNPGPAYNAV PGAEIDFTGS VNPIDPNPAN
HTPIFTGYGA TDVFSVDQHL RTPYIQSYNV NVEQEIRNGV AVSLSYVGSQ GRKLFRYIDL
NQVNPADGSI AYPQYYYVNQ FQSSAASGYN ALQAQFKISS WHGLTSTMNF TWGHSIDNAS
DGQDYVTNAT QPDNSFNPGA ERANSNFDLR KAFKWYYTYE LPKFETAKWI TNGWALNGVL
SLADGQPFNV TWLDNFNYDI NGTGEYFGRP DLVGDPWAGT HGPANFLNLS AFAAPCNWDN
VNGGCIDGQH IGSLSRNAFR GPAYKNFDFS VSKTFAFTER VNARFGADFF NIFNHPNFSN
PVLPNYVVDA AYNGDASGVG HGFLPITATP DVGGGNPFLG GGGPRDIQLS LKVTF