Gene Acid345_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1988 
Symbol 
ID4069375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2381467 
End bp2382984 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content57% 
IMG OID637984002 
ProductABC peptide transporter, periplasmic ligand binding protein 
Protein accessionYP_591063 
Protein GI94969015 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.693495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.584075 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGC TGCTCTTCGC GCTGGTTGCT ATGCTGCTGT TTTCTCTTTC CTGTGCGCTC 
AGCAAGCCCG ACCCCAACAC GCTAGTGATG ATCATCGAGA GCAGCCCAAC GAATCTCGAT
CCCCGCATCG GCCTCGACGC GCAATCCGAA CGCGTCGACG AACTTCTTTT CGACGCCCTC
GTCCGTCGCG ACGAACACTT CAACCTCCAA CCCTCTCTCG CCGAACGATG GGAAATCCCA
GACCCGCTGA CGTACATCTT TCATCTGCGA CACGATGTCC GCTTCCACAA CGGTCAATTG
CTCACTTCAC GGGATGTGAA GTGGACTCTC GACTCTCTCA TTTCCGGCAA GCTCATCAGC
CCCAAGGCCA GCACGTACAA CAGCATCTCG TCCGTCGAAG CGCCTGATGA CTGGACCGTC
ATCCTTCATC TCAAGCAAGC ATATTCGTCC TTGCTTTGGA ACCTGAGCGA CGGCGCCTTC
GGGGTCGTCC CCAACGGCAG CACCGGCAGC TTCTCGCAAC AGCCGATTGG TTCTGGTCCG
TTCAAGTTCG TCTCAGCGCG CCAGGACCGC GATCTGGTGA TCGAGCGCAA CGATCAGTAC
TGGGGCGAGA AACCTCACCT GCAGCGCGTG CGCTTCATCG TGGTGCCCGA CGAAACCACG
CAGGCCCTCG AACTCCGTAA AGGCAGCGCC GATGCTGTGA TCAACGCTCT GAAAGCCGAC
ACCGTTCGCG TGCTCCAGAA CGAAAAGAGC ATCCAGATCG ACGTCTCACC GGGCAGCAAC
TACGCCTACC TCGCCTTCAA CATGCGCGAC CCGATCCTCA AAGACGTGCG CGTGCGGCAG
GCAATCGCAT ATGCCATTGA TCGCCAGGAG ATCATCCACT ATCTGATGAA CGACATGGCG
AAGCCTGCGA ACAGCGTGCT TCCGCCGCAA AGCTGGGCCA GCAATCCGGG CGCGAAGGGT
TATCCCCATG ATCCTGACGC GGCCAATCGC CTGCTGGATA GCGCTGGCTA TATGAAGAAA
AACGGCAACC GTTTTCAATT GGCGATGAAA ACATCCACCG ATGGCGCCAC TCGTGTGCTC
GCCGCCGCGT TGCAACAACA GATGCGAGCG ATCGGCATTG ATCTGCAGAT CCGCACCTTT
GAGTTCGCCA CTTTCTATTC CGACGTAACC AAGGGCGCCT ATCAGATCCA CTCGTTGCGA
TGGATCGGCG GTAATCAAGA CCCCGACATC TTCGAGAGCA TCTTCGCCAC CGATAGTTTC
GCGCCGAAGC GCCAGAACCG CACCTTCTAC TCCAATCCTC GCGTGGACGA ACTCATCAAG
ATCGGTCGCA GCACGGTGAA GCAAGAGGAT CGCAAGCGCG CCTATTTCGA ACTGCAAGAG
ATCGTGAATC GCGATGTCCC TTACGTGAAC CTCTGGTACT TCGACAACGT CCTCGTCCAT
ACAGTTCGAG TGCGCAACCT GCATGTCTCT GCGGCCGGCA ATTACGATTT TCTGCTGACG
GCCGAACTCG CAAATTAG
 
Protein sequence
MRKLLFALVA MLLFSLSCAL SKPDPNTLVM IIESSPTNLD PRIGLDAQSE RVDELLFDAL 
VRRDEHFNLQ PSLAERWEIP DPLTYIFHLR HDVRFHNGQL LTSRDVKWTL DSLISGKLIS
PKASTYNSIS SVEAPDDWTV ILHLKQAYSS LLWNLSDGAF GVVPNGSTGS FSQQPIGSGP
FKFVSARQDR DLVIERNDQY WGEKPHLQRV RFIVVPDETT QALELRKGSA DAVINALKAD
TVRVLQNEKS IQIDVSPGSN YAYLAFNMRD PILKDVRVRQ AIAYAIDRQE IIHYLMNDMA
KPANSVLPPQ SWASNPGAKG YPHDPDAANR LLDSAGYMKK NGNRFQLAMK TSTDGATRVL
AAALQQQMRA IGIDLQIRTF EFATFYSDVT KGAYQIHSLR WIGGNQDPDI FESIFATDSF
APKRQNRTFY SNPRVDELIK IGRSTVKQED RKRAYFELQE IVNRDVPYVN LWYFDNVLVH
TVRVRNLHVS AAGNYDFLLT AELAN