Gene Acid345_4338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4338 
Symbol 
ID4071756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5146227 
End bp5147816 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content59% 
IMG OID637986371 
Productpeptidase M28 
Protein accessionYP_593412 
Protein GI94971364 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.387318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.135899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAG AGATATTGCT CTCTTCTTTA TTGGTAGCGA TTGCGGCGGG TGGACAGCAG 
ACTGGGCCGG TTGCGTTCGA TGGCAAGCAG TGGTGGACAT ACGTCAAGGT ACTTGCGGAC
GACAACATGG AAGGGCGAAA CACCGGCAGC GATGGCGAGA AGCGTGCCGA AGCGTACGTA
GTGGAGCAGG CGAAGGCGAG CGGTCTGCAA GCGGCCGGAA CGAATGGTTT TTATCAGCCG
GTGAAGCTGG TAGAGAGCAA GCTGGATGAG GCGGGTTCAA GCTTTGCACT GGTGAAGGAT
GGAAAGAGTG AGTCGTTGAC GTTGGGCGAT GACTTGACTC TAAGTGCGCG GCTGGACGGC
GGCGACGTGG AAGCACCGCT GGTGTTCGTG GGATACGGGC TGACGATTCC GGAGAAAAAC
TACGATGACC TCGCGGGACT CGACCTGAAG GGGAAGGTCG CGGTGATCTT CAGTGGATCG
CCGGCTTCGA TTCCGACGGA GTTGGCCTCG CATGCGCAGT CTGCCGCCGA ACGGTGGAAG
GCGTTGAAGG CGGCGGGAGT GGTGGGAGTG ATCTCGATCC CGAACCCAAA GGCGATGGAT
ATTCCGTGGG AACGCATTAA GGGGAACCGG CTACAGGCGT CGATGCGGCT GGTGGAACTG
AATGAGACGG CGGACGAGAA GCTGGGGGGA TATTTCAATC CGGCGTCGGC ACAGAAGCTT
TTCGAAGGAT CAGGACATAG CTTCGACGAA ATTGCCGCGC TGGGAGCGAA CCGGGAGGCG
TTGCCGCATT TCGCGCTCAA GGTTTCGGTG AAGGCCAAGA CGACGATCGA ACGCAAGGAG
ATCGAATCGG CAAACGTGGT CGCGAAGCTG GTGGGATCCG ACGCAAAGTT GAAGAACGAG
TATGTGGTGG TGTCGGCGCA TATAGACCAC CTGGGAATGG GAGAGCCGGT GAACGGCGAT
CGCGTATACA ACGGCGCGAT GGACAATGGC TCAGGCAGTG CACTGCTGCT CGACCTGGCG
CGGTCGTTCA AGGAGCATCC GGAGAACCTG AAGCGGTCGG TGCTGTTTGT GTGGGTGACG
GGCGAAGAGA AAGGTCTGCT CGGTTCGCGC TATTTCGGAC TGCATCCGAC GGTATCGCGG
CGGGCGATGG TGGCGGACAT TAATACAGAT ATGTTCCTGC CGATTGAGCC GATGAAAGTG
ATTACAGCGT TCGGATTGAA TGAGACAACG CTGGGCGACG CATTGAAGAA GTTGGCGGGC
GAGCGGAACG TGCAGGTACA ACCCGACCCG CAGCCGCTGC GGAACATTTT TATTCGCAGC
GATCAGTACA GCTTCGTGCG AGTGGGAGTG CCTTCGATCA TGTTCATGGG GGGAAGTCCG
GCCGATCCGG TACTGGAGCA GTGGTTGAAA GAGCGATACC ACGGGCGCAG CGATGACACG
AACCAGCCGG TGGACTTGGT TGCGGCGGGA GAGTTCGAGG CGATATCGCG GGCGCTGGTG
GTGGATGTGG CGAATGCGAG CGCAAAGCCG GAGTGGAAGG CGGAGAGCTT CTTTAAGAGA
TATGTGGCGG AGCCGGATTT GGGTAAGTAG
 
Protein sequence
MRKEILLSSL LVAIAAGGQQ TGPVAFDGKQ WWTYVKVLAD DNMEGRNTGS DGEKRAEAYV 
VEQAKASGLQ AAGTNGFYQP VKLVESKLDE AGSSFALVKD GKSESLTLGD DLTLSARLDG
GDVEAPLVFV GYGLTIPEKN YDDLAGLDLK GKVAVIFSGS PASIPTELAS HAQSAAERWK
ALKAAGVVGV ISIPNPKAMD IPWERIKGNR LQASMRLVEL NETADEKLGG YFNPASAQKL
FEGSGHSFDE IAALGANREA LPHFALKVSV KAKTTIERKE IESANVVAKL VGSDAKLKNE
YVVVSAHIDH LGMGEPVNGD RVYNGAMDNG SGSALLLDLA RSFKEHPENL KRSVLFVWVT
GEEKGLLGSR YFGLHPTVSR RAMVADINTD MFLPIEPMKV ITAFGLNETT LGDALKKLAG
ERNVQVQPDP QPLRNIFIRS DQYSFVRVGV PSIMFMGGSP ADPVLEQWLK ERYHGRSDDT
NQPVDLVAAG EFEAISRALV VDVANASAKP EWKAESFFKR YVAEPDLGK