Gene Acid345_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2031 
Symbol 
ID4073200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2432517 
End bp2433905 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content61% 
IMG OID637984045 
Producthypothetical protein 
Protein accessionYP_591106 
Protein GI94969058 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG CTTACATTGA ATGTTTTTCC GGGATCAGCG GGGATATGTT CCTCGGAGCC 
CTGGTGGATG CCGGGGTCTC CGCGGAATTG CTGCGACAGA CCGTGCGCGG GCTGAATCTT
GGGGCGGAGC TGCAAGTTGC GCGGGTAGAC CGCTGCGGAA TCACATCGAC GAAGGTTGAT
GTAGTGGTGA ATGGCGAGCC TGACCGTCCG CGGGAAAACG AGCAACCGGT GCACCATGTG
CACTCGCATC AGCACGGACA CGAGCACCAG CACGAACATC ATCATGCCGA TGGGACCGTA
CATGCGCATT CCCACGCACA TGACGATGAG CCCGGGCACA CCCACGAGCA CCCGCACGAG
CATGAGCACG AAGATAAGGA AGATAAGCAT GACCACGCGC ATGGGCGGCA TCTGAGCGAG
ATCAAGACGA TCATTGCCGG GAGCGCGATC AGCGAGCGCG CGAAGAAGAC GGCGACGGAT
GTTTTCGAGG CGCTGGGCGC GGCGGAAGCG AAGATCCACA ACGTACCGGT AGAGACGATC
CACTTCCACG AGGTGGGCGC GGTAGATGCG ATCGTGGATA TCGTGTGCGC GGCCGTGGGC
GCCGAGGCGC TGGATGTCGA GCGCTTCGTG GTATCGCCAC TAAATGTGGG CGGCGGCACG
GTGAAATGCG CGCATGGCGT GTTCCCTGTA CCGGCACCTG CAACCGTTGA GTTGCTCAAG
GGCGCGCCGG TGTACGCGGG CGAAATCCAG AAAGAACTGG TTACGCCGAC GGGCGCGGCA
CTGGTCAAAG TGCTGGCGCA CAGCTTCGGG CAGATGCCGG CGATGACCAT CGCCAAGAGC
GGGTATGGCG CGGGGTCGCG CAACTTCCCT TCCCACGCAA ATGTGCTGCG CATCACTGTG
GGCGAGGCAG CGGCCGTGGA AGAATCGAAG GGTGATCTTC CGCTGGATGA AGTGATTGTG
CTCGAAGCGA ACATCGACGA CTTGAATCCG CAGCTTTTTG GCTACGTTGC CGAGCAGGCG
CTGGCCGCCG GCGCGCTCGA TGTTTTCGCC ACGCCGGTGC AGATGAAGAA GAGCCGCCCG
GGAACGCTGC TGACGTTGCT GGCAAAGCCT GAGGATGCGG AGCGAATTGC CCGGCTAGTG
TTCCGCGAGA CTTCGACGAT TGGGATACGC ACCCGCCGCG AGCAGCGCTA CGTGCTGCCG
CGCCGTCATG AAACGGTGCG CACGCAATGG GGCGAAGTGC GAATGAAGAT CGCGCAGATC
ACGGGGAGCA TCAGTAACTA TGCACCCGAA TATGAAGATT GCCGGCGAAT CGCCGAACAG
CATCATGTGC CGCTGAAGCA CGTGATGCAG GAAGCTATCA GGCTTTACCT GGAACACACG
AATGTCTAA
 
Protein sequence
MRIAYIECFS GISGDMFLGA LVDAGVSAEL LRQTVRGLNL GAELQVARVD RCGITSTKVD 
VVVNGEPDRP RENEQPVHHV HSHQHGHEHQ HEHHHADGTV HAHSHAHDDE PGHTHEHPHE
HEHEDKEDKH DHAHGRHLSE IKTIIAGSAI SERAKKTATD VFEALGAAEA KIHNVPVETI
HFHEVGAVDA IVDIVCAAVG AEALDVERFV VSPLNVGGGT VKCAHGVFPV PAPATVELLK
GAPVYAGEIQ KELVTPTGAA LVKVLAHSFG QMPAMTIAKS GYGAGSRNFP SHANVLRITV
GEAAAVEESK GDLPLDEVIV LEANIDDLNP QLFGYVAEQA LAAGALDVFA TPVQMKKSRP
GTLLTLLAKP EDAERIARLV FRETSTIGIR TRREQRYVLP RRHETVRTQW GEVRMKIAQI
TGSISNYAPE YEDCRRIAEQ HHVPLKHVMQ EAIRLYLEHT NV