Gene Acid345_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2102 
Symbol 
ID4069701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2514108 
End bp2515646 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content61% 
IMG OID637984117 
Productleucyl aminopeptidase 
Protein accessionYP_591177 
Protein GI94969129 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCC ATCTCTCGAC CCTCGATCCC GCGCAGTTGG AAACCGACGC TCTCATCGTG 
CTCGCCATTG ACGGCGGCGA CAAAGACAAC AACAAGCCGC AACTCCAGGC CAAGTCCGAC
GCATTCGCGA AAGCTGCCGC CGACCTCATC GCCTCAAAAG AGATCACCGG CAAGCTGCTC
GAGATCGCGA CTCTGCATAA GCCCGAGGGC GTGAAGGCCA AGCGCTTGAT CGTCGTGGGG
GTTGGCAAAG CCAAGAGCTT TACGTCGTAC GAATTGCGCA AAGCTGCAGG TGCCGCCGTG
CGCGCACTGA AGAAGAGCGT GAAGTCCGCG GCCATCGTTG CCCCCGAAAA CTGGGGCGGA
GCCGCCGATC CCTCCACGAC CAGCACGTTG ATGTTTGAAC GCGGCGGATT ACCTGAAGCG
GTGAAGGCGA TTGCGGAAGG CGCGGTGGTT GCCAACCCGG ATTACAACTA TTACCACTCC
GATCGCAAGA CCTACGAACT CGACGAGCTC ACCATCCTCG TCCCTGCCAA TGGCCACGCC
AACGATCTTG AAGCGGCCAT GAAGGAAGGC CATGTCATCG GCGAGTCGCA AAACTTCACT
CGCGATCTCG TCAACGAGCC CGGCAACCGC ATGACGCCAA CCATCCTCGG CCAGCGCGCC
AAGAAGATGG CGGAAGAAGT CGGCATCCAG TGCGATGTCT ACAGCACCGA CTTCCTGCAC
GAAAAGAAGA TGGGCGCGTT CTGGAGCGTC TCGCAAGGTT CCGAGGAACC GCCCGCGCTG
ATCGTGATGA AGTACGAGCC AGCAGGCGCG CCGCAATCGC CGGTTCTCGG TCTCGTCGGT
AAAGGCATTA CCTTCGACAC CGGCGGCATC TCCATCAAGC CCGCCGACGG CATGGAGAAG
ATGAAGTACG ACATGGCCGG CGGTGCCGCG ATGATCGGCG CTATGCGCGC CATCGCGCTG
CTCAAGCCGA ATGTGCGCGT GATCGGCGTC GTTTGCGCGG CCGAGAACAT GCCCAGTGGC
AAAGCGCAGA AGCCCGGCGA TGTCCAGATC GCGATGAGCG GCAAATCCAT CGAGATCATC
AACACCGACG CTGAGGGCCG CCTCGTCCTC GCCGACGGCC TGCACTACGC CAAACAACTC
GGCGCAACTC ATCTCATCGA CGCCGCCACC CTCACCGGCG CCTGCATGGT CGCCCTCGGC
GGCATCAACG CCGGTGTCTT CGCCAACGAC GAAGACTACT TCAACCGCTT CGCCGAGGCC
TTGAAGAAAT CGGGCGAGAA GATGTGGCGG CTGCCCATCG ACGACGACTA CAAAGAACTC
ATCAAGTCGC CCATCGCCGA CATCAAGAAC ACCGGCGGCC GTTACGGCGG CGCCATCACG
GCAGCAATGT TCCTGAAAGA ATTCGTCGGC GAAACCCCAT GGATCCACCT CGACATCGCC
GGCGTTGCCT GGCAGGAAGA AGCGGTGCCG TTCCTCGCGA AAGGGCCGTC GGGGATTGCG
GTGCGATCGA TTATTGAGTT GGTGCAGAGC TTCGGATAG
 
Protein sequence
MKIHLSTLDP AQLETDALIV LAIDGGDKDN NKPQLQAKSD AFAKAAADLI ASKEITGKLL 
EIATLHKPEG VKAKRLIVVG VGKAKSFTSY ELRKAAGAAV RALKKSVKSA AIVAPENWGG
AADPSTTSTL MFERGGLPEA VKAIAEGAVV ANPDYNYYHS DRKTYELDEL TILVPANGHA
NDLEAAMKEG HVIGESQNFT RDLVNEPGNR MTPTILGQRA KKMAEEVGIQ CDVYSTDFLH
EKKMGAFWSV SQGSEEPPAL IVMKYEPAGA PQSPVLGLVG KGITFDTGGI SIKPADGMEK
MKYDMAGGAA MIGAMRAIAL LKPNVRVIGV VCAAENMPSG KAQKPGDVQI AMSGKSIEII
NTDAEGRLVL ADGLHYAKQL GATHLIDAAT LTGACMVALG GINAGVFAND EDYFNRFAEA
LKKSGEKMWR LPIDDDYKEL IKSPIADIKN TGGRYGGAIT AAMFLKEFVG ETPWIHLDIA
GVAWQEEAVP FLAKGPSGIA VRSIIELVQS FG