Gene Acid345_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2040 
Symbol 
ID4073209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2443875 
End bp2445611 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content61% 
IMG OID637984054 
Productpeptidase M28 
Protein accessionYP_591115 
Protein GI94969067 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.194253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGT ACGCATGCCT CGCCACATGT CTCCTTGCAC TTTCATTCCC GCTCTGCGCC 
CAAAAACCCG AAGCCCTTGA CTACAACATG TACCAACGCA TCCGCTCCGA GGGATTCGAC
CACTCGCACA TCATGGAATA CGCCTCCGCC CTCATGGACG GTATCGGCCC GCGTCTCACC
GGCTCACCGA ATCTCAAGCA CGCCAATGAA TGGACGCGCG ACCAGTTCAC CTCCATGGGC
TGCAGCAACG CTCACCTCGA AGACTGGGGC GAGTTCGGAC TGGGCTGGCG CCAGATCAAT
ACCTGGGTGC GCATGTCCGC CCCTGACAAC GCCGTCTTCA TCGCGCAAGC GCTCCCGTGG
TCTCCCGCCA CCAGCGGCCC CATCAACGGC CAGGCCATCT GGATCGAAGC CAAAGACGAA
AAAGATCTCG AGAAGTACAA AGGTAAGCTG ACCGGCAAGA TCATCTTCTT CGGCCCCATG
CGCGACGTGA AGCCCGTCGA GAAGCCGCTC ACCAAACGCA ACGAAGATGC CGACCTCAAG
AAGATCGAAG ACTTCCCTGT CCGAGTCGGT GAACAGCACG AAGACTTCCT CGCCGGCTTC
ATTAAGGAAC TCACCTTCCG CGAAAAGGCC GGCAAGTTCT TCGCGGATGA GCACGCCGCC
GCCATCGTCG TCCCTTCCCG CGATGGCCGC GACAACGGCG GCTCTGGCGG CACCATCTTC
GACGACGGTG GCACCGGCAT GGGCTGGTTT ACCTACCAGC GTGAGCACGC CGAGAAGCTT
CCCATCGTCG TCACCGCCAT CGAAAACTAC GGCCGCGTCT ATCGCCTCTT GAAAGCCAAC
GTCCCCGTCT CCATCGAAAT GGACGTTCGC ACCGAGTTCA CCGGCGACCA CGAACACGGC
TTCGATACCA TCGCTGAAAT CCCCGGCACC GATCCCGCTC TTAAAGATCA AGTCGTGATG
GTCGGCGGCC ACCTCGACTC CTGGGCCTCC GGCACCGGCG CCACCGACAA CGGCGCAGGC
ACCGTCGTCG CTATGGAGGT CATGCGAATC CTCAACGCGC TCCACGTGCA GCCTCGCCGC
ACCATCCGCG TCGCTCTCTG GACTGGTGAG GAAGAAGGCG AGTTCGGCTC CTACGGCTAC
GTCAAAAACC ATTTCGGATT CGCGCCGCTC TCCACCGCCC CCGACCAGCT CGCGCTTCCT
GAATTCGTGC GCAAGCCCGG TGGTCCCATC CAGATCAAGC CCGAGCATGC CAAAATTTCC
GGCTACTTCA ACGTAGATAA CGGTTCCGGC AAAATCCGCG GCATCTACCT CCAGGGCAAC
GCGCAACTGG CGCCGCTCTT CAAGGAGTGG ATCGCGCCTC TCTCTGATCT CGGAGTCAAC
ACCATCTCCG TTCGCAACAC CGGCGGCACC GACCACGAAG CCTTTGACTC GGTCGGCATC
CCCGGCTTCC AGTTCATCCA AGACCCGCTC GACTACAGCT CCCGCACCCA CCACAGCAAC
ATGGATCTCT ACGAGCGTCT CCAACCCGCC GACCTCGCGC AAGCCGCCGT TGTCGAAGCC
ATCTTCGTCT ACAACACCGC CATGCGCGAC CAGATGCTCC CGCGCAAACC GCTCCCCCAC
CCCGAGCTAG ACGAGCCCGG CAAAGCCCCG CTCAAGAACG TAATGCCCGG TGTAGTCGCC
GCCGCCGAAG AACAAAAGAA AGACGCGACA CCAGAAAAGA CGCCCGAGAA AAAGTAG
 
Protein sequence
MKKYACLATC LLALSFPLCA QKPEALDYNM YQRIRSEGFD HSHIMEYASA LMDGIGPRLT 
GSPNLKHANE WTRDQFTSMG CSNAHLEDWG EFGLGWRQIN TWVRMSAPDN AVFIAQALPW
SPATSGPING QAIWIEAKDE KDLEKYKGKL TGKIIFFGPM RDVKPVEKPL TKRNEDADLK
KIEDFPVRVG EQHEDFLAGF IKELTFREKA GKFFADEHAA AIVVPSRDGR DNGGSGGTIF
DDGGTGMGWF TYQREHAEKL PIVVTAIENY GRVYRLLKAN VPVSIEMDVR TEFTGDHEHG
FDTIAEIPGT DPALKDQVVM VGGHLDSWAS GTGATDNGAG TVVAMEVMRI LNALHVQPRR
TIRVALWTGE EEGEFGSYGY VKNHFGFAPL STAPDQLALP EFVRKPGGPI QIKPEHAKIS
GYFNVDNGSG KIRGIYLQGN AQLAPLFKEW IAPLSDLGVN TISVRNTGGT DHEAFDSVGI
PGFQFIQDPL DYSSRTHHSN MDLYERLQPA DLAQAAVVEA IFVYNTAMRD QMLPRKPLPH
PELDEPGKAP LKNVMPGVVA AAEEQKKDAT PEKTPEKK