Gene Acid345_1522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1522 
Symbol 
ID4073010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1858061 
End bp1859167 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content59% 
IMG OID637983531 
ProductZn-dependent dipeptidase 
Protein accessionYP_590598 
Protein GI94968550 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.329262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.479778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCC GCCAGTTTGT TTCCTTCGCT GCGCAAGCGT CTGCGCTCGC CTTCATCGCT 
CCGCACTCAT TCGCACAAAC TGCGACCCCC GACATCGCCG CACTCTATAA GAACGCCCTT
GTCATCGACA CCCTCTGCGC CCCCTTCGCC ACCGACGACT TCCCGCCCGC CGACAACGCC
CTCCAGCAGG TTCGCGGCTC CGGCTTCACT GCGATCAACA CCACCATCTC AGACCGCACC
TATGAAGGAA CGATCCAGAC GCTCGCCCGA ATTCACTCCT ACGTCGAGCG CTATCCTGAA
CTTTTCTCGA TCGTCATCAA GCGCTCCGAC ATCGACCGTG CCAAGCGTGA GAATAAGGTC
TGCATCATGC TGGGATTCCA ATACACCTCG TTTTTCGAAG AAGATGTTTC TCGCATCGAG
GTTTTTCGCG ATCTCAGTGT GCGCATCATG CAGCTCACCT ACAACCTGCG CAGTACGTTC
GGCGACGGCT GTCTCGAGTC CGAAAACTCA GGGCTGAGCA GGGCCGGACA CGACCTGGTC
AAGAAGATGA ACGCGATCGG TATCGCCGTT GATGCCAGTC ACAGCGGCTA CCGCACTACT
TCCGACGCCA TAGCCGGTTC CGCGAAGCCC ATCCTCATCT CGCATTCGGG ATGTGCCGCG
GTCAGCGCGC ATCCACGCAA CAAGCCAGAT GAAATCCTTA AAGCACTCGC CGATCGCGGC
GGCTATTTCG GTGTCTATCT CATGCCCTAC CTCGTCGCTT CCCCGACCGT CCCGACGCGC
GAACACGTCA TGGCGCATCT GCTTCACGCC ATCAATGTCT GCGGCGCAGA CCACGTCGGC
ATTGGCTCCG ATGGCAGCAT TGAGGCGGTC CATCTCACCG ACGAGCAGAA GAAAGCTTTC
GATGAGGACA TCGCCCGGCG TAAGAAGCTC GGCATCGGCG CACCCGGCGA AGACCGCTAT
CCCTACGTTC CCGACGTCAA CGGTCCCAAC CACATGGAAC TGATCGCCAG CGAATTACAG
AAGCGCGGCC AGCCCAGTTC GGTCATCGAG AAGGTACTCG GCGCGAACTT CTATCGCGTT
ATTGGCGATA TCTGGGGCAC AGCTTAA
 
Protein sequence
MDRRQFVSFA AQASALAFIA PHSFAQTATP DIAALYKNAL VIDTLCAPFA TDDFPPADNA 
LQQVRGSGFT AINTTISDRT YEGTIQTLAR IHSYVERYPE LFSIVIKRSD IDRAKRENKV
CIMLGFQYTS FFEEDVSRIE VFRDLSVRIM QLTYNLRSTF GDGCLESENS GLSRAGHDLV
KKMNAIGIAV DASHSGYRTT SDAIAGSAKP ILISHSGCAA VSAHPRNKPD EILKALADRG
GYFGVYLMPY LVASPTVPTR EHVMAHLLHA INVCGADHVG IGSDGSIEAV HLTDEQKKAF
DEDIARRKKL GIGAPGEDRY PYVPDVNGPN HMELIASELQ KRGQPSSVIE KVLGANFYRV
IGDIWGTA