Gene Acid345_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1143 
Symbol 
ID4069914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1423782 
End bp1426862 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content59% 
IMG OID637983153 
Producthypothetical protein 
Protein accessionYP_590220 
Protein GI94968172 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0637346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAGT TCGCCTTCCT GCTGTTGGTG TGCTCATTCT TTCTGGTTGC GACGAATGAA 
GCTGGTGCGA GTGCGATGAC GACTGTCGTT CTGGAAGAGC CGGGATTTCC GACGGCGGAT
GCGGCGGCGC CGGACATGGC GCGATTACAT GCGCTCATAT CCGACGCGAA GTTTGTGGTG
GCGGACCAAC TGGTTGCTTC GTTAGCGGAC AGGGCGACCA CGCTGCTGGT GCTGCCCTAT
GGGTCGGCGT TTCCGGAAGC GGTGTGGCCG GCAATTGATA GCTATTTGTA TCGCGGCGGG
AACCTGTTGG TGATTGGCGG GCGACCGTTT ACGCGTGCGG CGTTTCGCGG AAAATCAGGA
TGGGAGTTGC GCGAGTACAG CGTGCGCTTC GTGCTCGGAC TGAATATCGA TCAATACCAG
GAGACGCCGG GCAGCGACGG AATGCAGTTC GAGGCGAATC CGGACGTGAT GGTGAAGGCG
TCGCAGTTTA GGTGGAAGCG CGGATTTAGC CCGGTGATCC GGCTGTCATC CAGCGATGTT
TATAAGCGGC AGGGATCGGC GGGAGAACTC GACGCGCGGC TCGATGCACT GGCGTGGGGA
CTGCGTGATG GACGAAAGTT CGCAGCGCCG GCAATTGGGA TTGATCATGT GCGCGGCAAG
TGGGGCGGTG GGCGCTGGGT GTTCGTGAAT TCGGAGATCA CGTCATCGGT ATATGCAGGC
GATCTCATTC GCGATCTTGT GGAGTACACG GAGCGCGGAG CGCAGGAGTT CACGGCGCGT
CCGACGCTGC CACTCTATGC GGAAGGCGAG CCGGTGCAGG TGGAATTGAA CTGGTCGGCA
AACTCTTCTT CGGCCTCGTT GAGAGCAGAA GTGTCGATCG CGCCGGAAGA CAAGCCGGAG
CAGAAGGTGG CTCGCACCGC GACACTCGCA AATGGCGGCG CAGTGGTGGA GTTCCCGCCG
GTGCAGGAAA AAGGATTGTA TCGAATTGAA TCACGATTGT TCGATGGCGA TCGAACTGTC
GCGGCGTATC ACTCTGGCTT TTGGATGCGT GACCTCGAGT ATCTTCGCTC GGGGCCGAAA
CTTGGCGTCA ATAAAGACTA CTTTGAACTC GATGGAAGGC CCTTGGCGGT TGTCGGCACG
ACTTACATGG CGAGCGACGT GCAGCGGTTG TTCTTCGATC ATCCGAATGT GTATGTGTGG
GACAAAGAGT TGGGGCAGAT CAGCGGTGCC GGCTTGAACA TGATTCGCAG CGGCTGGTGG
ACGGGCTGGG ACAAGCTGTG CGACGAGACG GGACGTCCGT ATGAGCGGAC GCTGCGAACG
CTGGAAGCCT ATCTGATGAC CGCGCGTAAA CATGGACTGC CGGTGCAGTG GAACTTCCTC
GCATTTCTCC CCGAGGTGCT CGGTGGAGAG AACCCATATC TTGACCCCGT GGCGGTGCGG
CGGCAGAAGG CGTTCTATTC TGGCGTGGCC GCGCGCTTCC ACGATGTGCC TTTCGTGGCG
TGGGACCTGA TTAACGAGCC GAGCATTTCA CAGTTCGTGT GGAAGACGCG GCCGAACCAG
GATTGGATCG AACTGCAGCA GTGGAACGAG TGGTTGAAGC AAAAGTACCC GGATCGCGCT
GCGCTGGCGG ATGCATGGAA CATGCCGCAA CTTGGCGACA CCGCGCCGGT CCCGACAGAG
TCTGAGTTTG CGCCGCGCGC CATGTACGCT GGCCCGAATT CGCTCAAGAT TTACGACTTT
TATGTGTTTG CGCAGGAGAA GTTCGCGGGA TGGGCGCAGC AGATGCGGGA GGCCATCCGC
GCCACCGGCG CACAGCAACC GATCGTGGTG GGACAGGATG AGGGTGGATA CAACGATCGT
CCGAACCCAG CGTTCTTCGG CAACGCCGTG GACTTCACGG CGAACCATTC GTGGTGGGAG
AACGATTCGC TGTTGTGGGA TTCGCTCGTG GCGAAGCAGC CGGGGAAAGC GATGCTGATC
CAGGAGACTG GATTGCAGCG TGAACTGAAC ATGGATCAGA CCGCGCGGCG CACAGTTGAA
AGCGAAGGCG CACTTTTCGA GAGGAAGATG GCGCTCTCGT TCGCGCAGGG AAGCGGTGCG
ATCCAGTGGC TGTGGAACAC CAACACCTAC ATGACCGAAG GCAACGAAGC GCCGATTGGT
GCGTTACGCG GAGATGCGAC GGAAAAACCG GAAGCCACAG TGCTGCGCAA CTTCGGAACG
TTTGTAGCCA AGGCGCGTGA GGTGCTTCGC AATCCGGTGC AGCCGGATGT GGCGATCGTG
ACGTCGCAGG CAATGCAGTT CTCAGTGATC GGCGATGCAC AACTGGAGGC CCAACGGAAA
GCGGTGCGTG CGCTGGCGTA TGCAAACCAC GTGGCGCCGT ATGTAATTGC TGAGAACCAG
ATTGCGAAGC TGGGCAATCC GAAGCTGGTC GTGCTGCCTT CGCCACAGGC GCTGAACGAG
AAGACATGGC AGACGCTGGT GGCGTACGTG AAAAACGGCG GAAGTTTGCT CGTTACAGGT
GGCGTGGGAC GTGACGAACA TTGGCACGTT GTCGATCGTT TTAACGCGCT CGGCATCAAG
GGGGCGACGG AGCCGCTGAC GTACAAAACT GCCTCCGTGA AGCTTGGCGC TAACGAAGTC
CGCATGAGCT TCGATCAGAC CAAGCAGGCG TGGTTGGAGA CGGCGCGCTT TGCGGACGAA
AAGAAAATTA GCGAAGCCTC GTTGGGTCGC GGAAAGATCT ACTGGGTTGC ATATCCGGTA
GAGCTAGCTG AGGGACTCGA TGCAGCGGCG CAGGTTTACA AGTATGCGTT GGCGCAAGCG
GGAGTGCAGC CGCTGTATGG CGTCGAAGGT GTAGTTTCTC CGGGTGTATT GATCTATGCG
ACGGTGCTCG AGGATGCCGT AGCCTACCTA TTCGTGTCGG ATGACGCGGC GGATACGAAT
ATCGCGGTGC GCGATCGCAC GACCGGAGCA CGCTTGCAGG TGACGTTGCC GTCGCAACGG
GCGGCGATTC GGATCATCCG CAAGAAAGAT AAGTCGGTGG TTGCGGAGTA TTCAAACGGC
GGCGTTCTAG AGGATGAGTA A
 
Protein sequence
MMKFAFLLLV CSFFLVATNE AGASAMTTVV LEEPGFPTAD AAAPDMARLH ALISDAKFVV 
ADQLVASLAD RATTLLVLPY GSAFPEAVWP AIDSYLYRGG NLLVIGGRPF TRAAFRGKSG
WELREYSVRF VLGLNIDQYQ ETPGSDGMQF EANPDVMVKA SQFRWKRGFS PVIRLSSSDV
YKRQGSAGEL DARLDALAWG LRDGRKFAAP AIGIDHVRGK WGGGRWVFVN SEITSSVYAG
DLIRDLVEYT ERGAQEFTAR PTLPLYAEGE PVQVELNWSA NSSSASLRAE VSIAPEDKPE
QKVARTATLA NGGAVVEFPP VQEKGLYRIE SRLFDGDRTV AAYHSGFWMR DLEYLRSGPK
LGVNKDYFEL DGRPLAVVGT TYMASDVQRL FFDHPNVYVW DKELGQISGA GLNMIRSGWW
TGWDKLCDET GRPYERTLRT LEAYLMTARK HGLPVQWNFL AFLPEVLGGE NPYLDPVAVR
RQKAFYSGVA ARFHDVPFVA WDLINEPSIS QFVWKTRPNQ DWIELQQWNE WLKQKYPDRA
ALADAWNMPQ LGDTAPVPTE SEFAPRAMYA GPNSLKIYDF YVFAQEKFAG WAQQMREAIR
ATGAQQPIVV GQDEGGYNDR PNPAFFGNAV DFTANHSWWE NDSLLWDSLV AKQPGKAMLI
QETGLQRELN MDQTARRTVE SEGALFERKM ALSFAQGSGA IQWLWNTNTY MTEGNEAPIG
ALRGDATEKP EATVLRNFGT FVAKAREVLR NPVQPDVAIV TSQAMQFSVI GDAQLEAQRK
AVRALAYANH VAPYVIAENQ IAKLGNPKLV VLPSPQALNE KTWQTLVAYV KNGGSLLVTG
GVGRDEHWHV VDRFNALGIK GATEPLTYKT ASVKLGANEV RMSFDQTKQA WLETARFADE
KKISEASLGR GKIYWVAYPV ELAEGLDAAA QVYKYALAQA GVQPLYGVEG VVSPGVLIYA
TVLEDAVAYL FVSDDAADTN IAVRDRTTGA RLQVTLPSQR AAIRIIRKKD KSVVAEYSNG
GVLEDE