Gene Acid345_2435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2435 
Symbol 
ID4072869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2875494 
End bp2877488 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content60% 
IMG OID637984451 
Productoligopeptide transporter, OPT 
Protein accessionYP_591510 
Protein GI94969462 
COG category[S] Function unknown 
COG ID[COG1297] Predicted membrane protein 
TIGRFAM ID[TIGR00728] oligopeptide transporters, OPT superfamily
[TIGR00733] putative oligopeptide transporter, OPT family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.498464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCGC CGCAAACCAC CCTCCCCACC AGCGCCTACC AAACGCTGAA ACCCGGCGAG 
AAATATCCGC CCATCATCTC GCCGTCGCAG AACATTCCCG AACTCACACC GCGTTCGGTG
ATCTGGGGCA TCGCGCTCTG CGTGCTTTTC ACCGTCGCAT CAGCGTATTC GGGATTGAAG
GTGGGGCAGG TGATGGAGTC GGCAATCCCG ATCTCGATCC TCGCTATAGG CCTTGCGCGC
GCCTACAGCC GGCGTTCCAC CATCCTCGAG AACGTGATCA TGACCGGCAT TGGCGGCGTC
GCCGGATCGG TCGTCGCGGG CGCGATCTTC ACCTTGCCGG CGCTCTACAT CCTGAAGCTT
GATCCACATC CGATGCAAAC GGTCTTCATC TGCCTCGCCG GCGGATGCAT GGGCGTGCTC
TTCGTCATAC CGCTGCGGCG CTACTTCGTC CGCGATATGC ACGGACTGCT TCCCTACCCT
GAAGCGACGG CCATTACCGA GGTGCTCGTC ACCGGCGAAA AGGGTGGTTC GCAGGCGCGG
CTGCTGCTGG AAGCTACGGC AATCTCCGGT GTGTATGACT TCTTCGTGAC GACCTTCAGC
GTGTGGAAGG AATACGTCAA CTTCCAGTTC GTGCCGATTA TGCAAAAGCT GAATGACAAA
GCCCGCATGG TCGTCAGCTT CGATGCCATC AGCTTCATCC TCGGCCTCGG CTACGTAATC
GGCCTGCGCA GCTCGATGAT CCTGTGCGCT GGAGGCATGC TCTCGAACCT CGTGCTGGTG
CCGTTGATCT GGTACGTGGG GAGCCATCTT GATTCTGCCG TCTATCCGGC ACTGATTCCG
ATTTCCAAGA TGACCGCAGT TCAGATCTAT CGCAACTACG TGCGTTTCAT CGGCGTCGGG
ACCATCGCGA CGGCGGGCAT TTTCGGCATC ATCAAGTCGC TACGGATTGT CGCTAGTTCA
TTCGGCATCG CGCTCAAGGT CTTCCAGCGC TCGGAGTCCG TGGCACAACC GGAGCGCACC
GACCGTGATA TCTCCATGGT GACAATCGTT CTCGGCATCG TCGCCAGCGG CATCGGCGTG
GCCGTGTTCA TGGGCTTCCT GAAGCCATCG TGGCTGGTGG TGATGATCGG CATGCTGCTG
ACGGTCGTCT TCTCGTTCTT CTTTACGTCC GTCGCGGCGA ATGCCATCGC GACCACGGCA
CGCAATCCCG TCAGCGGCAT GACGATGTTG ACGATCATCA TTTCGTCGGC AGTGCTGTTG
CAATTCGGGC TCTCAGGTAC CATCGGCATG TTCTTCGTGA TGGCGATTGC CGGCATGGTT
TGCACCGCTT TATCGGTCAG CGGCCAGGCC ATTACTGACT TCAAGGCCGG CTACTGGATC
GGCTCGACCC CGTCGGCGCA GGAGAAGGTG AAGTTCTTTG GCGTGATCGC CGCTGCGATC
GCGGCATCGC TAACTATCGT CATGCTGGCG CGTGGCTTCC AGTTCGGCGA AGCTGCGCTT
GGCGATCTGC GGCCAGTACT GCCGTCACCG CAGGCTTCCA TCATGAAGGC ACTGGTCGAA
GGATTGATGA GCCATCAACC GGTGGCGTAT ATGCTCTTCG GCGTAGGTGC GGTGATCGCC
GTTATCCTCG AGATGCTGGG CATGCCGTCG CTACTCTTCG CGCTCGGCAT GTATTTGCCA
CTGGAACTGA ACACACCCGC GCTGGTGGGC GGCATGATTT CGCACTTCGT GAATAAGCGT
GCCGAAAAGA CCGGCGGCGA TGCCGGCCGC AGCGTGCGCG AGCGTGGCGT TATCATCGCC
TCAGGACTGA TGGCGGGCGG TGCCCTCGGG GGCGTGCTCG GCGCGGCCTT GCGACTGCTG
CCCAAATATC GTGAAGATTT GATTCACACC CCGTTCTTCA ACAACGAGCC GGTGTCGCAA
ACGGTCTCGG CGGTGCTGTT CATCGGCCTC TGTCTTTATA TGTGGTTCCG GTCACTCAGG
AAGGAAGCGG AATAA
 
Protein sequence
MAAPQTTLPT SAYQTLKPGE KYPPIISPSQ NIPELTPRSV IWGIALCVLF TVASAYSGLK 
VGQVMESAIP ISILAIGLAR AYSRRSTILE NVIMTGIGGV AGSVVAGAIF TLPALYILKL
DPHPMQTVFI CLAGGCMGVL FVIPLRRYFV RDMHGLLPYP EATAITEVLV TGEKGGSQAR
LLLEATAISG VYDFFVTTFS VWKEYVNFQF VPIMQKLNDK ARMVVSFDAI SFILGLGYVI
GLRSSMILCA GGMLSNLVLV PLIWYVGSHL DSAVYPALIP ISKMTAVQIY RNYVRFIGVG
TIATAGIFGI IKSLRIVASS FGIALKVFQR SESVAQPERT DRDISMVTIV LGIVASGIGV
AVFMGFLKPS WLVVMIGMLL TVVFSFFFTS VAANAIATTA RNPVSGMTML TIIISSAVLL
QFGLSGTIGM FFVMAIAGMV CTALSVSGQA ITDFKAGYWI GSTPSAQEKV KFFGVIAAAI
AASLTIVMLA RGFQFGEAAL GDLRPVLPSP QASIMKALVE GLMSHQPVAY MLFGVGAVIA
VILEMLGMPS LLFALGMYLP LELNTPALVG GMISHFVNKR AEKTGGDAGR SVRERGVIIA
SGLMAGGALG GVLGAALRLL PKYREDLIHT PFFNNEPVSQ TVSAVLFIGL CLYMWFRSLR
KEAE