Gene Acid345_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0995 
Symbol 
ID4069760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1259183 
End bp1260346 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content61% 
IMG OID637983002 
Productaminopeptidase DmpA 
Protein accessionYP_590072 
Protein GI94968024 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0120323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAAG TCGTGCTCGC GACTGCTCTC ACTCTCCTAA GCGCCCTCGG CATCGCCCAG 
AACCAAACCT CGAACGAACG GCCCCGCGCC GCTGACCTGG GAATCAACGT CGGCGTGCTT
CCGCGTGGAC CGCTCAACGC CATTACCGAT GTGGATGGAG TGCTGGTTGG CCAGACGACC
ATCATCCGCG GCGACAACAT TCGCACCGGC GTGACCGCGA TCCTGCCGCA CGCAGGCAAT
CTTTATCGCG AGAAGGTGCC CGGCGCGGTG TTCGTGGGGA ACGGCTACGG CAAGATGACT
GGCTCTACAC AGGTAGAGGA ACTCGGCGAG ATCGAAGGGC CGGTGCTGCT GACCAACACA
ACGAGTGTTT CGCAGGTCGC CGATGCGCTG ACCACGTACA TGATCGGCTT GCCCGGGAAC
GAAGACGTGC TCTCGTTCAA TCCGGTGGTC GGTGAAACCA ACGACGGCTA TTTGAACGAC
ATCCGCGGAC GCCACGTTTC GCCGGACGAT GTATTTGCCG CGATCAAGAG CGCAAAGGGC
GGCCCGGTCG CAGAAGGCGC CGTTGGAGCG GGCACCGGAA CCGTGGCGTT CGGATGGAAG
GGCGGCATTG GCACGGCGTC GCGACGATTG CCGGCGAACC TTGGCGGCTA CACCGTCGGG
GTGCTGGTGC AGACGAATTT CGGCGGCGTG CTGACCATCG CAGGCGCACC GGTAGGGCAG
GAACTTGGCC AGTACTACCT GCGCGAAGAG CTGCAAAAGG CCGGTAATGG AAGAGACCGC
GCTGATGGCT CGTGCATGAT CGTCGTAGCC ACCGACGCCC CGATGGACTC GCGCAATTTG
AAGCGCCTCG CAGCACGAGC GATCCTGGGG CTGGCCCGTA CCGGTAGCGC GTCATCGAAC
GGCAGCGGCG ATTATGCGAT CGCGTTCTCC ACCGCTTCGC AGAACCGGAT TCGTACTGCC
GACAAAGCGC TAACGCGGCA CACTGAAACA GTGACCAACG ACACGATGTC ACCGCTGTTC
GAAGCCACGA TTGAAGCGAC CGAGGAAGCG ATTTACAACT CGATGCTGAA GGCCACGACG
ACCACGGGAA ACGGGCATAC GGTAAAGGCG TTGCCGATCG AGGAGACAAA GGGGGTTCTT
AAGAAGTATG GGGTGATTCG ATAA
 
Protein sequence
MHKVVLATAL TLLSALGIAQ NQTSNERPRA ADLGINVGVL PRGPLNAITD VDGVLVGQTT 
IIRGDNIRTG VTAILPHAGN LYREKVPGAV FVGNGYGKMT GSTQVEELGE IEGPVLLTNT
TSVSQVADAL TTYMIGLPGN EDVLSFNPVV GETNDGYLND IRGRHVSPDD VFAAIKSAKG
GPVAEGAVGA GTGTVAFGWK GGIGTASRRL PANLGGYTVG VLVQTNFGGV LTIAGAPVGQ
ELGQYYLREE LQKAGNGRDR ADGSCMIVVA TDAPMDSRNL KRLAARAILG LARTGSASSN
GSGDYAIAFS TASQNRIRTA DKALTRHTET VTNDTMSPLF EATIEATEEA IYNSMLKATT
TTGNGHTVKA LPIEETKGVL KKYGVIR