Gene Acid345_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2008 
Symbol 
ID4070914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2407604 
End bp2409529 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content57% 
IMG OID637984022 
ProductABC transporter, ATPase subunit 
Protein accessionYP_591083 
Protein GI94969035 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.198836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAGT TGTCTGCCGC CGGAAAGCGT TTCGGCCCCA AGAGTCTGTT TCAAAATCTC 
GACTGGCTGA TCACGCCGCA GGACCGCGTC GGCCTGGTTG GCGCCAACGG CACCGGCAAA
TCCACGCTGC TCAAAGTGCT TGGCGGCATC GAGTCGCTCG ACTACGGCTC GCTGCAATCT
ACCAAGGGCA TCACCCAGGG CTACCTGCCA CAGGATGGCC TGCGGCTGTC CGGACGAAGC
GTATTCGCCG AGTGTCTGAG CGTCTTCGAA GACCTCAAAG CGATGGAGAA GGAAATCGAA
ACGCTCACGC GGCGCATGAG CGAAATCGAT CACGAGTCGA TTGAGTACAA GCAGATCTCC
GATCGCTTCC ACCGCATCGA CGGCGAATTT CGCATCCGCG ATGGATACGC CCTGGAATCG
CAGGTCGGCA CCGTGCTTTC CGGACTCGGT TTCAGCAAAG AGGATTGGCA GCGGGACACC
GGTGAGTTCT CGGGTGGCTG GCAAATGCGC ATTGCGCTGG GGAAACTGCT ACTCGCCAGG
CCGAATCTTC TGCTTCTCGA CGAGCCGACT AACCACCTCG ACCTCGAAAC CCGCAACTGG
CTCGAGGAGT ACCTCACGCA CTATCCATTC GCCTACATCC TGATATCACA CGATCGGTAT
TTCCTCGATG TTACGGTCAA CAAGATCGTT GAGATATGGA ACAAGGAAGT GCACTTCTAC
GCGGGCAATT ACGACAAGTA TCTCGCGCAA AAAGAAATGC GCCGGACGCA GATCGTGAGC
GCCTACAAGA ACCAGCGCGA GCGGATCGAG CAGCTCGAAG CCTTCATCAA CCGATTTCGA
TACCAGGCAA CCAAGGCGAA ACAAGTCCAG AGTCGCATCA AAGAGCTGGA AAAGATCGAG
CGTATCGAAA TTCCGCCGGA TGAAAAGACC ATTCACTTCA CATTCCCGCA GCCGAAGCCC
AGCGGACGTC AGGTGGTCGA GTCCAAGGGG CTGGCGAAGA GCTACGGTGA GAAGCAGGTC
CTCAACAACG TAGATTTCTT TATTGAGCGC GGCGATCGGG TAGCGCTTGT CGGTGTGAAC
GGTGCCGGCA AATCCACGCT TATCAAGCTG CTCGCCGGTC TTGAGCCGCT CACCGGTGGA
GAGTTGCGGC TCGGGCACAA CGTTGAAGTG GACTACTTTG CCCAGGACCA ATACAAGGAA
CTCGATCCGA ACGCGCGCAT GCTCGACGAC ATTGCTGAGA TTTCGCCGCG TTCCACCCAG
ACGGAACTAC GCAGCCTGCT TGGTTGCTTC CTTTTCTCGG AAGAAGATGT CTTCAAGACG
CTGGGCGTGC TCTCCGGCGG CGAGCGTAAT CGTTATGCGC TGGCGCGCAT GCTTCTGCAT
CCGTCGAACT TCCTGCTGCT CGATGAGCCG ACCAACCACC TCGACATGCG AGCCAAGGAC
GTGCTGCTCG AATCGCTGGA GAAGTTCCAG GGAACCGTCG TCTTTGTGTC GCACGACCGC
TACTTTATCG ACAAGCTCGC GACACGAGTC TTCGAAGTTG CCGATGGCGG CGTGCAGGTA
TATCCGGGGA ACTACGAAGA ATACCTGCGC TCGAAGGCCG GCGTCACGAC CGCAGTTGAT
CTCGAGGCGG TAAGAGCTGA GGCCCCCAAA AGCAATGGTG ACGGGGCTAC GACGGCTGGC
GAAAAAGCAA AGCGTCTCAA TCCTATCAAG CTGCGCCAGT ATGAAGAACG CATGCGCGAG
CTCGAAGAAC TTGTCGAACG CACCGAAACC GGCATCGTGG AGTGTGAGAA TTCTCTCGGC
AATTTCGTGA GCGTGGAAGA GACGAAGCGC CAGACGGAGC TGCTCGAACA GCGCCGCGCC
GAGCTGGAAA CTCTGATGGT CGAGTGGGAA CAACTCACGC AGGCTCTCGA AGAAGCGAAG
GCATAG
 
Protein sequence
MIQLSAAGKR FGPKSLFQNL DWLITPQDRV GLVGANGTGK STLLKVLGGI ESLDYGSLQS 
TKGITQGYLP QDGLRLSGRS VFAECLSVFE DLKAMEKEIE TLTRRMSEID HESIEYKQIS
DRFHRIDGEF RIRDGYALES QVGTVLSGLG FSKEDWQRDT GEFSGGWQMR IALGKLLLAR
PNLLLLDEPT NHLDLETRNW LEEYLTHYPF AYILISHDRY FLDVTVNKIV EIWNKEVHFY
AGNYDKYLAQ KEMRRTQIVS AYKNQRERIE QLEAFINRFR YQATKAKQVQ SRIKELEKIE
RIEIPPDEKT IHFTFPQPKP SGRQVVESKG LAKSYGEKQV LNNVDFFIER GDRVALVGVN
GAGKSTLIKL LAGLEPLTGG ELRLGHNVEV DYFAQDQYKE LDPNARMLDD IAEISPRSTQ
TELRSLLGCF LFSEEDVFKT LGVLSGGERN RYALARMLLH PSNFLLLDEP TNHLDMRAKD
VLLESLEKFQ GTVVFVSHDR YFIDKLATRV FEVADGGVQV YPGNYEEYLR SKAGVTTAVD
LEAVRAEAPK SNGDGATTAG EKAKRLNPIK LRQYEERMRE LEELVERTET GIVECENSLG
NFVSVEETKR QTELLEQRRA ELETLMVEWE QLTQALEEAK A