Gene Acid345_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1050 
Symbol 
ID4073137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1316549 
End bp1319182 
Gene Length2634 bp 
Protein Length877 aa 
Translation table11 
GC content60% 
IMG OID637983057 
Productpeptidase M1, membrane alanine aminopeptidase 
Protein accessionYP_590127 
Protein GI94968079 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0939422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.722063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCCGA AAGGCCTGAT GAAGAAGCTT CTGTTGCTGT TCGCTCTTAT GACTGCGGCG 
ACGTTCTGCT CCGCGCAGCG TCTGCCCGGC AACGTTGTTC CCGATCATTA TTCGCTGAAG
TTCGCGCCCG ATTTTTCCAG CAGCACTTTC CAGGGCGACG AGACCATTGA CGTGCGGGTT
CTCTCCGCAA CCGATGCCAT TGTCCTCAAC GCTCTTGAAC TGGAAATCAA ATCTGCGACA
GTGACGGTTG CCGGCAAGGA GCTCACAGCC AGCGTCACCG CGGATGCCGA GAACGAGACC
GTCACGCTGC ACGTTCCCTC GCAACTCACG GTAGGCTCAG CCACGATCCA CATCGGATAT
ACCGGCCGAC TGAACGACAA GCTCCGCGGT CTCTATCGCA GCGAAGCCAA TAATCGCCGC
TACGCCGTGA GCCAGTTCGA GGCCGTGGAT GCCCGGGTCG CCTTCCCATC CTTCGACGAG
CCTTCTTATA AGGCGACTTT CGACATCACG ACCGTCGTGG ACCAAGGCGA CACTGCCATC
TCGAATGGCC GCATCGTGAG CGATGAGCCC GGTCCGGCAG GGAAGCACAC CATCAAGTTC
TCGACCACGC CGAAGATGTC GAGCTATCTC GTAGCGCTCA CCGTCGGTGA CTGGAAGTGC
ATCTCTGGCG AGCAAGACGG CATCGCGCTT CGCATTTGCT CGGTGCCCGG CAAGGAACAG
CAGGGAGCGT TCGCTCTCGA GGCGACCAAG GCGATTCTGC ATTTCTACAA TCAGTACTTC
GGGATCAAGT ATCCCTATGG CAAGCTCGAC CAGATCGCAG CTCCTGACTT CGAAGCCGGC
GCGATGGAGA ACACCGCCGC CATCGTCTAT CGAGAGAGCG CGCTGCTGCT CGATCCTGCA
AAAGCATCGG TTAACGATCA GAAGGAAATT TCATCGGTCA TCGCGCATGA GATGGCGCAC
CAGTGGTTCG GCGATCTCGT CACCATGAAG TGGTGGAACG ACATCTGGCT GAATGAGGGT
TTTGCCACTT GGATGGAGAG CAAGCCGGTC GCAGCGTGGA AGCCCGAGTG GCAAATCTCG
CAGGACGATG TTCTCGGCTC CAGCTCCGCG CTCAACACCG ACTCAACGCA GAACACCCGC
CCCATCCGCC AGCAAGCGGA AACCCGCAAC GAGATCAACG CGCTGTTCGA CGGCATCGCT
TATGGCAAGA CTGCCGCCGT CCTGCGGATG CTCGAGGGCT ACATCGGTCC AGAAGTCTTC
CGCAAAGGCG TAAACAGCTA TCTGGAAGCG CACAAGTACG GCAACGCTAC CGCCGAAGAT
TTCTGGGGCG CGATGGCGAA GGCCTCAGGT CGACCGATCG ACAAGCTCAT GCCGACGTTC
GTCACGCAGC CCGGAGCGGC ATACATCGCG CTAACGGACA AGTGCGAGAA CAACGAGACT
GTCGGTACGA TCACCCAGCA ACGCTTCTAC TCCTCGCCGA AGCTCATGCA GAGCACGTCT
GACCAACTCT GGCAGGTGCC GGTCTGCAGC AACGAAATCG GCGGAGCAGG AACCGCCAGT
TGCGAATTGC TGACGCAGAA GCAGCAGTCG TTCAAGCTCA AGGGCTGCGG CCATGGCGTT
ATGGGGAACT CGAAGGGTAG TGGCTACTAT CGTTATAGCT TCGATCCCGC CGAGTACCAG
TCGCCGAACT TCAAGATGGA AGAACTGCCT GAAGAAGACC AGGTCTCGCT GGTTGGCAAC
GAAGGCGCAC TGCTCGCCGC CGGGGTTCAC CACATCCAGG ACTTCATGGC CATGACGGAG
AAGTTCCGCG GCGTCCCCAC GTACGGCGCA GTGAGCGAAC TCGCGGACCA CCTGACTTTC
GCCGACCGGC ATCTGGTCTC CGACCAGGAC CGCGAACAAT TCCGGTCGTG GGTGCGCAGC
GTGTTCAAGC CGACTCTCGA GAAGGTCGGC ACGGTTTCGT CTAAATCTGA CACTCCGGGC
CAGCGCAACA CGCGAGCGAC GCTCGTAGAA CTTCTCGGCA ACGTCGGCGA AGATCCCGAT
GCCATCGCCC TCGCGAAGCA GACGGTCAAC GCCTACATGC AGGACCCTGC GTCGGTGGAT
ACCACCCTCG TGGACGCGTC GTTCCCAGTT GCTGCCGCAC ACGGCGATGC CTCGCTCTAC
GACATCTTCC TCGCGAAAAC GAAGCAGGCT TCGTCACCGC AGGACTACTA CCGTTACGTT
CACGCACTTC GCGACTTCCG CGATCCGGTG CTGCTCAAAC GCACCCTCGA GTGGACCCTC
GGTCCCGAAG TTCGCAACCA GGACCTTCGG GGTCTCGTCG GCGTGCTCTC GAATCCCGCC
GGCCAGCAAT TGACCTGGGA TTTCATTCGA CAACGCTGGA GTGACATTCA GAACAAGGCC
GGACAAAGCA TCGTGGGCGC GCAATTGGCG TATTACGCGA TTGGGGTCTT CTGCGATGCG
GAGCATGCGA AGGAAGCGCA GTCCTTCATT GATCAGCACC GCGTTCAGGG TTTGGATCGC
ATCGCGCGCC AGCAGATGGA GCGCGTTGGT CAGTGCATTG ACCTCCGCCA GCGCGAAGAA
CCCAATCTCG CGCGCTTCCT TCAGAAGTCA GGGAACGGCA GCGGGCAGCA CTAG
 
Protein sequence
MHPKGLMKKL LLLFALMTAA TFCSAQRLPG NVVPDHYSLK FAPDFSSSTF QGDETIDVRV 
LSATDAIVLN ALELEIKSAT VTVAGKELTA SVTADAENET VTLHVPSQLT VGSATIHIGY
TGRLNDKLRG LYRSEANNRR YAVSQFEAVD ARVAFPSFDE PSYKATFDIT TVVDQGDTAI
SNGRIVSDEP GPAGKHTIKF STTPKMSSYL VALTVGDWKC ISGEQDGIAL RICSVPGKEQ
QGAFALEATK AILHFYNQYF GIKYPYGKLD QIAAPDFEAG AMENTAAIVY RESALLLDPA
KASVNDQKEI SSVIAHEMAH QWFGDLVTMK WWNDIWLNEG FATWMESKPV AAWKPEWQIS
QDDVLGSSSA LNTDSTQNTR PIRQQAETRN EINALFDGIA YGKTAAVLRM LEGYIGPEVF
RKGVNSYLEA HKYGNATAED FWGAMAKASG RPIDKLMPTF VTQPGAAYIA LTDKCENNET
VGTITQQRFY SSPKLMQSTS DQLWQVPVCS NEIGGAGTAS CELLTQKQQS FKLKGCGHGV
MGNSKGSGYY RYSFDPAEYQ SPNFKMEELP EEDQVSLVGN EGALLAAGVH HIQDFMAMTE
KFRGVPTYGA VSELADHLTF ADRHLVSDQD REQFRSWVRS VFKPTLEKVG TVSSKSDTPG
QRNTRATLVE LLGNVGEDPD AIALAKQTVN AYMQDPASVD TTLVDASFPV AAAHGDASLY
DIFLAKTKQA SSPQDYYRYV HALRDFRDPV LLKRTLEWTL GPEVRNQDLR GLVGVLSNPA
GQQLTWDFIR QRWSDIQNKA GQSIVGAQLA YYAIGVFCDA EHAKEAQSFI DQHRVQGLDR
IARQQMERVG QCIDLRQREE PNLARFLQKS GNGSGQH