Gene Acid345_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2371 
Symbol 
ID4069183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2801319 
End bp2803187 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content58% 
IMG OID637984387 
ProductABC transporter, ATPase subunit 
Protein accessionYP_591446 
Protein GI94969398 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.684572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAAC ACGAGGAAGA ACAACTAGGC AAGGCGTACG ACAGTCGCTT AATGAAGCGC 
CTGCTCGGCT ATTTGCGTCC ATATCAGTGG CAGGTCTATG TCGCGATCGC GGCCATCATT
CTAAAGGCCG GTGCGGACGT AGTCGGGCCT GTCCTGACCC AAGTCGCGAT TGATAAATAC
CTCGCCAAGA GGTCCGTCTC TCATCTGTTC GACCGCTTCC TGAGCGTGAA CCCGCTCACC
GGTATCGGAC AGATTGCCGG CCTCTACGTC ATTCTGCTCC TCCTGAGTTT CGCCTTCGAA
TACGCGCAAA CATATCTCAT GCAGTGGGTC GGACAGAAAG CCATGTTCGA CATGCGGGCC
GAGATCTTCC GCCATCTCCA GCGCCTGCAC ATTGGCTTCT TCGACAAGAA CCCCGTCGGC
CGCCTCGTGA CCCGCGTCAC CAGCGACGTT GATGCTCTGA ACGAGATGTT CACTTCGGGA
GTGGTCTCGA TCGTTGAGGA CGTCTTCGTG CTCTCCGGGA TCATGTACGT CATGCTCCGG
ATGAACTGGC GCCTGGCCTT GTTGGTCTTC GCCGTGCTGC CAATCATCAC ATGGGCGACC
GGTATATTCC GGCGGGCGGT GCGCGATGCC TATCGCAAGA TTCGCGTAGC CATCGCGCGT
ATTAATTCGT ACCTCCAGGA ACACATAACC GGCATCGTCG TCCTCCAGCT CTTCAACCGT
GAGAAGCGTT CGTACGACAA GTTCGAAGAA GTAAACCGCG CACACATGGA CGCCTTCAAA
GACGCGATCA TGGCGCACGC CGTCTATTAC CCCATCGTCG AGTTCCTGTC GGCAGTGGCA
ATTGCAAGCG TCATCTGGTT CGGTGGCGGA CAATTTCTCC GCAATGCACC CGGAATCAGC
ATCGGCATCC TGGTCGCGTT TATTCAGTAC TCGCAGCGCT TCTTTCGTCC TATCCAGGAT
TTCAGCGAAA AGTACAACAT CGTGCAGCAG GCCATGGCCG CTTGCGAGCG CATCTTCAAG
CTCCTGGACA CCCCGGTTGA GATCGACGAG TTGGCCCAGC CGAAATCCGC TACCGGTTCG
GGCCGCATAG AGTTTGACCA TGTGTGGTTC GCCTATCGCA AGGACCCGAA TAAGAACGAC
GAATGGGATT GGGTTCTGCG CGACGTGAGC TTCACGCTAG AACCCGGGGA AACCGTGGCT
GTCGTCGGCC ACACCGGGGC TGGCAAGACA ACGCTCATTT CCCTGCTCCT GCGCTTTTAC
GACGTGCAGC AAGGCGCGAT CAAGATTGAC GGCATCGACA TTCGCGAAAT GTCGTTAGCC
GACCTTCGCC GCCGTTACGG TGTGGTCCTG CAGGATCCGT TCCTCTTCAC GGGCACCATC
GGCGAGAATA TTCGCCTGGG TACCGAGTGG ATCACCGACG AGCAAATGAT GAAGGCCGCC
GAAGAAGTGA ACGTCGCCGA GTTCATCCGC TCGCAACCGG CGGGATTCGA GCAGGGTGTG
CATGAACGCG GCAGCACGCT ATCCACGGGA CAAAAGCAGC TCATCTCCTT CGCGCGAGCG
TTGGCGCACA ATCCGAAGAT CCTCATCCTC GACGAAGCCA CGTCGAGCGT AGACACCGAG
ACAGAGTTTC GCGTGCGCGC TGCACTGTCT AAGCTCGTCG AAGGACGCAC TTCGCTCATC
ATCGCGCACC GACTTTCGAC TATTCAGCGC GCCGACAAGA TCATCGTGAT GCACAAGGGC
AAGGTCCGCG AGATGGGCAG CCATCAGCAA CTCCTCGCAC AACGAGGCAT CTATTACAAG
CTCTACCAGT TGCAGTACAA GGACCAGGAA TTGCCGCTGG CGCCCACGCT TAGCCCTGCT
ACCGACTAG
 
Protein sequence
MAQHEEEQLG KAYDSRLMKR LLGYLRPYQW QVYVAIAAII LKAGADVVGP VLTQVAIDKY 
LAKRSVSHLF DRFLSVNPLT GIGQIAGLYV ILLLLSFAFE YAQTYLMQWV GQKAMFDMRA
EIFRHLQRLH IGFFDKNPVG RLVTRVTSDV DALNEMFTSG VVSIVEDVFV LSGIMYVMLR
MNWRLALLVF AVLPIITWAT GIFRRAVRDA YRKIRVAIAR INSYLQEHIT GIVVLQLFNR
EKRSYDKFEE VNRAHMDAFK DAIMAHAVYY PIVEFLSAVA IASVIWFGGG QFLRNAPGIS
IGILVAFIQY SQRFFRPIQD FSEKYNIVQQ AMAACERIFK LLDTPVEIDE LAQPKSATGS
GRIEFDHVWF AYRKDPNKND EWDWVLRDVS FTLEPGETVA VVGHTGAGKT TLISLLLRFY
DVQQGAIKID GIDIREMSLA DLRRRYGVVL QDPFLFTGTI GENIRLGTEW ITDEQMMKAA
EEVNVAEFIR SQPAGFEQGV HERGSTLSTG QKQLISFARA LAHNPKILIL DEATSSVDTE
TEFRVRAALS KLVEGRTSLI IAHRLSTIQR ADKIIVMHKG KVREMGSHQQ LLAQRGIYYK
LYQLQYKDQE LPLAPTLSPA TD