Gene Acid345_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4012 
Symbol 
ID4071148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4739525 
End bp4741330 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content58% 
IMG OID637986039 
Producthypothetical protein 
Protein accessionYP_593086 
Protein GI94971038 
COG category 
COG ID 
TIGRFAM ID[TIGR03436] VWFA-related Acidobacterial domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.679014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.800681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCACC GATGTCTCGT TTGGTTTAGT TCCCTCCTCG TCATTCCTTT CGCTTATTCC 
CAAACTGCTG CGCCCAATGC GAGCCCTGCG CCCACGTTCG AATCCAAAGT GCGTGCCGTG
CTGGTGGACG TGGTCGTAAT TGGCGGGAAG GGGGAGCCAG TTCTCGGCTT GCACAAGCAG
GACTTTCGGG TGACGGAAGA CGGCAAGCCG CAGACCATTT CCTCGTTCGA GGAGCATACG
GGCGTACCGC CGACTGAGGT CAAACTGCCC CCGATGCCGC CCGGGGTGTT TACAAACTTT
CCTGCGCTGC TGAAGGCCGA TACGGTGAAT GTGCTGCTCG TGGACGCGCT GAACACGCAG
ACGCAAGACC AGTCCTTCAG TCGTTCGCAG ATGATCAAGT ACTTGAAGAC GATTCCGCCG
GGCGCACACA TCGCGATTTT TACGTTGACG TCGCAACTGC GGATGCTTCA GGAATTCACG
ACCGATTCTT CGGTATTGTT GGCGGCGCTG AATGACCCTG CGGTCGCTGG TCCCCATCAG
TCGCCGTTAC TCCAGTCCCA AGTGGAAAAG GACGCTTACG AGCGGCTGGG AGCAATGGTC
GCCCTCGCAC CGAGCGCACC GATGCAGAAC TTGGCCAAGG AGGCGGTCAA CCCGGCGCTC
GCAGTGAAGC AGATGCTGGA GGAAACCGTT GTGCGGATCA CCGAGTCGCG GGTCCAGATC
ACGCTGCGAG CAATGCAGCA ACTGGGCCGC TATCTCGGGA GCTTTCCGGG CCGCAAAAAT
GTGATCTGGA TCTCCGGCTC GTTTCCCATC AACTTTATGG CGGATCCCAG TTTGCCGGAT
CCGAACGCCG TGGTACGGGG ATTTCAAGGC GAAATTCAAA GGACGGCTGA CCTTCTTACC
GCGGCGCAGG TGGCAGTTTA TCCGGTCGGG GCGGCAGGCC TGAGAGTGGA CGCGCTCTAC
CAAGCCAACG CAAAAGAGAT TGGGTTTTAC AGCACGGGCG GGTTCGTTCA GGACCAGGTG
CAAGGGCTGC ATGCGGGAAT TGACGAGCGG GCTGGCAACG ATCTGACGAT GGAGGAGATG
GCCAAGGACA CCGGAGGTCA GGCTTTCTAC AACAGCAACG GGATTAACGA TGTTCTGACC
CGTATTACGA ACAACGGTAT GCGCTACTAC GAGATCAGCT ATACGTCGAC CAACACGAAG
GTGGACGGGA GTTACCGGCA TATCTCCGTG GAGCTGCTCA AAGGAAAGCA CAAGCTCTCT
TATCGTCGCG GATACTACGC GCTGGATGCT GCGGCCGTTC GGCAATCGGA ACTTGAGGCC
GCACCCGATC CTTTGCTGCC CCTGGTGGGA TTCGCCGTGC CTGATGTTGC GCAGATCCTC
TATAAGCTGC GCGTGTTGCC GTCGAGCCCG CAACCGGCAG TTGATGCTAC CCCTGCGGGG
AGCAACCGCG ACTTGAAAGG GCCAGTGACA CGCTACGACG TTGACTTTGC CGTCGCGCCG
GATGACCTCA AGTACGACAT TGGTCCGGAC GGTACTCGGC ACGGCGACGT CGAAGTGAAA
CTTGTCGCAT ATGATTCCAG CGGGAAGCCC GTGAACATGG TGAGTGGGAG GAAGGCGATG
TCCCTGGATC CGCAAACGTA CGCTACTTTG CAGAAGGTGG GGCTTCAGAT CCACGAACAG
ATCGATGTTC CGAGCAAGGG CGATTTTCAC CTTCGCACAG GCATTTACGA TTTGAAGTCG
AGCAACGCCG GAACCCTCGG AATCAAAATG AAAGATGTGG CCGCGTCGCA GCAAGCGACG
AAGTAG
 
Protein sequence
MSHRCLVWFS SLLVIPFAYS QTAAPNASPA PTFESKVRAV LVDVVVIGGK GEPVLGLHKQ 
DFRVTEDGKP QTISSFEEHT GVPPTEVKLP PMPPGVFTNF PALLKADTVN VLLVDALNTQ
TQDQSFSRSQ MIKYLKTIPP GAHIAIFTLT SQLRMLQEFT TDSSVLLAAL NDPAVAGPHQ
SPLLQSQVEK DAYERLGAMV ALAPSAPMQN LAKEAVNPAL AVKQMLEETV VRITESRVQI
TLRAMQQLGR YLGSFPGRKN VIWISGSFPI NFMADPSLPD PNAVVRGFQG EIQRTADLLT
AAQVAVYPVG AAGLRVDALY QANAKEIGFY STGGFVQDQV QGLHAGIDER AGNDLTMEEM
AKDTGGQAFY NSNGINDVLT RITNNGMRYY EISYTSTNTK VDGSYRHISV ELLKGKHKLS
YRRGYYALDA AAVRQSELEA APDPLLPLVG FAVPDVAQIL YKLRVLPSSP QPAVDATPAG
SNRDLKGPVT RYDVDFAVAP DDLKYDIGPD GTRHGDVEVK LVAYDSSGKP VNMVSGRKAM
SLDPQTYATL QKVGLQIHEQ IDVPSKGDFH LRTGIYDLKS SNAGTLGIKM KDVAASQQAT
K