Gene Acid345_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1504 
Symbol 
ID4069251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1835543 
End bp1837003 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content61% 
IMG OID637983513 
Productpeptidase M20C, Xaa-His dipeptidase 
Protein accessionYP_590580 
Protein GI94968532 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.774685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCTT CTACTGCCGT GGCGGTAGCC GAACTGGAAC CGAAAGCCAT CTGGAAGCAC 
TTCGAGGCGC TGACCAAGAT TCCTCGCCCC TCGACGAAGG AAGATGCCGC CCGCAAGTAC
GTGATCGGTA TCGCCGAAAA GCACGGTCTG AAACATGTGA TGGATGCGGC GGGGAACCTC
GTGGTGAGCA AGCCGGCGAC GAAGGGCCGC GAACATGCCC CGATGGCCGC GCTGCAAGGG
CACCTCGACA TGGTGTGCGA GAAGAACGAA GGCACGCACT TTGACTTCGA TAAGGACGCG
ATCAAAGTCA TCCGCGGCGA AGATTGGCTC TACGCCGACG GCACCACGCT CGGTTCCGAC
AACGGCGTAG GCGTCGCGAC TGCGCTGGCT GTGATGGAGA GCAAGGACAT CGCGCATGGG
CCGCTGGAAT TCGTGTTCAC GATTGACGAA GAGACCGGTT TGACCGGCGC TTCGGACTTC
CAGCCGGGGC TGCTGAAGTC GAAGTACTTC CTCAACCTCG ACGGCGAAGA GAAGGGAACG
CTTTGCATTG GTTGCGCGGG CGGTGTGAAC ACTCCGGCGC ACCGCAAGGT GACGAAGAAA
GCCGCACCCG CCGGTACGGC GCTGCGCGTG AAAGTTTCCG GACTGAAGGG CGGCCACTCC
GGCGTGGACA TCCACCTTGG ACGCGGCAAC GCGGTGCGCA TTCTGGGGCG CGTGTTGGAA
ACTCTGCTGC GCGACGAACA CGCGAATCTC GCCGACATCA AGGGCGGCAG CGCGCACAAC
GCGATTCCGC GCGAGGCCTA CGCGGTTGTA GTGATCGATC CGAAGCGCGA AGAAGAAGTG
AAGACCGTGG TGGCACGCAT CGCGGAGGAC GTGAAGGCGG AGTTGGGCGC GTTCGATCCC
GACGTGAAGA TCTCGGTGGA AAACGTGGCG GCACCGAAGG AGATCATGGA GCACGCGGAC
GCGACGAAAG TTGCGGACCT GCTGGCGACG GTCCATCACG GCGTGCTTGC AATGAGCCCG
GACATCAAGG GGCTGGTGCA GACGTCCACG AATCTTGCGA CGGTTTCCCT GAATGGCGAT
ACGGTTGAGG TTGTCACCAG CCAGCGCAGT TCGATCGAGA GCAGCAAGAA CGCGATTGCA
CGCATGGTCG CTGCGCTCTG CAAAAACACG GGATTCCACG CGGAGCACAC CACGGGGTAT
CCGGGATGGA AGCCGGAGCC GAACAGCGAC ATCGTAAAGA TCTCGCGCAA GGTGCACGAG
GAGGTCCTCG GCAAGGACCC GGAACTGGTA GCGATGCATG CTGGCCTGGA GTGCGGCGTG
ATCGGCGAAA AGCACCACGG CATGCAGATG ATTTCGTTCG GGCCGCAGAT CGAGAACCCG
CACAGCCCGA ATGAACGCGT CCAGATTTCC TCGGTTGAGA GCTTCTGGAA GTTCCTGCGC
GTGCTGCTCG AGCGGATTTA G
 
Protein sequence
MSSSTAVAVA ELEPKAIWKH FEALTKIPRP STKEDAARKY VIGIAEKHGL KHVMDAAGNL 
VVSKPATKGR EHAPMAALQG HLDMVCEKNE GTHFDFDKDA IKVIRGEDWL YADGTTLGSD
NGVGVATALA VMESKDIAHG PLEFVFTIDE ETGLTGASDF QPGLLKSKYF LNLDGEEKGT
LCIGCAGGVN TPAHRKVTKK AAPAGTALRV KVSGLKGGHS GVDIHLGRGN AVRILGRVLE
TLLRDEHANL ADIKGGSAHN AIPREAYAVV VIDPKREEEV KTVVARIAED VKAELGAFDP
DVKISVENVA APKEIMEHAD ATKVADLLAT VHHGVLAMSP DIKGLVQTST NLATVSLNGD
TVEVVTSQRS SIESSKNAIA RMVAALCKNT GFHAEHTTGY PGWKPEPNSD IVKISRKVHE
EVLGKDPELV AMHAGLECGV IGEKHHGMQM ISFGPQIENP HSPNERVQIS SVESFWKFLR
VLLERI