Gene Acid345_3440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3440 
Symbol 
ID4070324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4058869 
End bp4060152 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content61% 
IMG OID637985462 
Productdiaminopimelate decarboxylase 
Protein accessionYP_592515 
Protein GI94970467 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACCAT CGCTGCCGTC GCGGCCTCCG GGCTTCGTCT ACAAATCCAA CAAGCTGCAC 
GTGGAGAAGA TCGCCGCATC GGATCTGGCG CGCAGGTTCG GCACGCCGCT CTACACTTAC
TCCGCGACGA CGATCCGCGA GCGTTTCGAC ATCTTCAACC GTGCCTTCGG CAAGGTGCCG
CACACAATCT GCTATTCGGT GAAGGCGAAC TCGAACCTCA CCATCCTCAA AATGCTGGTG
AAGCTCGGCT CCGGCTTCGA TGTCGTCTCC GGTGGCGAAC TTGAGCGAGT ACTGGTGGTC
GACAAGAAAG CGGCGAAGAA GGTCGTCTTC TCCGGGGTTG GCAAGACCGC GCCCGAGATG
GACCTGGCGA TCAAGTCGGA CATTCTGCTG TTCAATATCG AGAGCGAATC TGAGCTGGAG
TTGCTCGCCT CGCGGGCGTC AAAGCTGCGG AAAAAGCCGC GCATCGCGTT CCGCGTGAAT
CCCGACGTCC CGGCAGAAAC GCATCCGTAT ATTTCCACGG GGCTGCGCGA GCACAAGTTC
GGTGTACCGA TCACAGACGC TGCCCTGCTC TATGCGCAGG CGGCATCGAC AGAGTGGCTG
GAGCCTGCCG GCGTCAGCGT GCACATTGGT TCGCAAATCA CAGACGTCGC GCCATTCAAC
GCCACCATGG AACGCGTGGC CGATCTCGTC CGCGGGTTGC GCGCCCAGGG CCACAATATT
CGTCTCATCG ACGGCGGCGG TGGCCTCGGG ATCGCCTACC GCGATGGTGA CACCAGCCTC
AAGGCCTTCG AGGCTTCCGC GAAAAACTAC AGCGCGGCTC TCTTGCCGCC GCTCAACAGC
CTTGGCATTC ATCTTTTGCT GGAACCGGGT CGCGCGATCA TTGGTCCTGC CGGTGCACTG
CTCACCCGGG TGGTCTATTC GAAGAAGAAC GGTCGCAAGA CATTCGTAAT CGTGGATGCG
GGCATGAACG ACCTGATCCG GCCCGCGCTC TACGGCGCTT ACCATGAGAT TGTCGCCGCC
GAGCCGCGCG GCACTGGCGC GGAAACGGTG GACGTGGTCG GTCCCGTCTG TGAAACCGGC
GATTTCTTCG CCCGCGATCG CGAGCTTACG CACCTTGAAG AAGGCGATCT CGTCGCCATC
CTCGACGCCG GTGCCTATGG CATGTCGCAG ACTTCGAACT ACAACACGCG ACCCAGACCA
GCCGAGGTTT TGGTCGAAGG ATCGCGCGCA AAAGTCGTGC GACGACGTGA AACCATCAAG
GACCTGCTGG GCCCCGAGCG GTAG
 
Protein sequence
MKPSLPSRPP GFVYKSNKLH VEKIAASDLA RRFGTPLYTY SATTIRERFD IFNRAFGKVP 
HTICYSVKAN SNLTILKMLV KLGSGFDVVS GGELERVLVV DKKAAKKVVF SGVGKTAPEM
DLAIKSDILL FNIESESELE LLASRASKLR KKPRIAFRVN PDVPAETHPY ISTGLREHKF
GVPITDAALL YAQAASTEWL EPAGVSVHIG SQITDVAPFN ATMERVADLV RGLRAQGHNI
RLIDGGGGLG IAYRDGDTSL KAFEASAKNY SAALLPPLNS LGIHLLLEPG RAIIGPAGAL
LTRVVYSKKN GRKTFVIVDA GMNDLIRPAL YGAYHEIVAA EPRGTGAETV DVVGPVCETG
DFFARDRELT HLEEGDLVAI LDAGAYGMSQ TSNYNTRPRP AEVLVEGSRA KVVRRRETIK
DLLGPER