Gene Acid345_0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0237 
Symbol 
ID4073087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp249625 
End bp250791 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content62% 
IMG OID637982238 
Productpeptidase M20 
Protein accessionYP_589316 
Protein GI94967268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAAA GACCTTCGTC TGACCCTAGA ACCTCGCATC GTGACCTGTT TGCCCACTTG 
AAAGACCAGC AATCGGCCAT GCAATCCATG CTCCGCCGCA TCGTGGAACA CGAGTCCCCG
AGCGACGATA AACCTTCCTG CGACGCCATG GCCGAAATGC TAGCGCAGGA TTTCGGCGCC
ATCGGCGGCC GCACGAAACT CCATCGCGAC AAGATCACCG GCAATCACCT GCAAGTGGAT
TTCGGCGGAC CGCGCGGCGC GAAGCCGGTG CTGATCGTCG GACATTACGA CACGGTCTAT
CCGCTCGGCA CTCTGAAGCA GATGCCGTAC CGCGAGAGCA AAGGTCGCAT CAGCGGTCCA
GGTGTGCTCG ACATGAAGGG CGGCATCGTC CAAATTCACG GCGCCATCGC CGCACTGCAG
ACCCAAGGCG GGTTGCCGCG ACCAGTGACG ATCATCCTCG TCTCCGACGA GGAGACCGGC
AGCGAGAGTT CCCGGTCGAT CACCGAGAAG CTCGCGAAAC AATCCGCAGC AGCGTTTGTC
TGTGAGCCGG CCGCTGGCAC CGACGGCGCG CTGAAGACCG CGCGCAAAGG CGTCGGCGAC
TATTTGTTGC GTATCACTGG CGTGGCTTCC CACTCGGGAC TCGATTTCGA GAAGGGCCAG
AGCGCGGTGC TGGAACTTGC GCGACAGATT GAGAAAATCG CCGGCTTCAC CGATCTCAAG
CGGGGAACCA CGGTCAATCC CGGCGTCATT CGCGGTGGAA CCCGCAGCAA CGTGATCGCT
GCGTCGGCGG AAGCGGAAAT CGACGTGCGC GTGAGCACCA AGCGCGATGT CGAACGCGTT
TCGAAGCTGT TTGCACGGTT GAAACCGATC AATAGGCGCT GCTCGCTAGC TCTTTCCGGC
GGAGTCAACC GTCCGCCGAT GGAGCGCACT CCGGGCACCG CAGCTTTATT TGCGCAGGCG
CAGCTGATCG CTGCGGAACT CAGCTTTTCC CTAGCGGAGA GAATGGTGGG GGGCGGTTCC
GACGGCAACT TCACCGGGGC TATCGTGCCG ACTTTAGACG GGTTAGGCGC GGTCGGCGAC
GGAGCCCACG CGGTTCACGA ATATATTTTC GCCACCGAAA TGCCGAAGCG CGCAGCACTG
CTGGCCGGAT TAATTCGGGC GTTATAG
 
Protein sequence
MPQRPSSDPR TSHRDLFAHL KDQQSAMQSM LRRIVEHESP SDDKPSCDAM AEMLAQDFGA 
IGGRTKLHRD KITGNHLQVD FGGPRGAKPV LIVGHYDTVY PLGTLKQMPY RESKGRISGP
GVLDMKGGIV QIHGAIAALQ TQGGLPRPVT IILVSDEETG SESSRSITEK LAKQSAAAFV
CEPAAGTDGA LKTARKGVGD YLLRITGVAS HSGLDFEKGQ SAVLELARQI EKIAGFTDLK
RGTTVNPGVI RGGTRSNVIA ASAEAEIDVR VSTKRDVERV SKLFARLKPI NRRCSLALSG
GVNRPPMERT PGTAALFAQA QLIAAELSFS LAERMVGGGS DGNFTGAIVP TLDGLGAVGD
GAHAVHEYIF ATEMPKRAAL LAGLIRAL