Gene Acid345_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0934 
Symbol 
ID4070586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1187429 
End bp1188829 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content59% 
IMG OID637982941 
Productpyridoxal-dependent decarboxylase 
Protein accessionYP_590011 
Protein GI94967963 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000218274 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000132275 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCCCTC TCGACCTCAG CCCCGCAGAT TTCCGCGCCC TTTCCCGCAA AATCTCCGAT 
TTCACCGCCG ATTACCTGGA ACGTCTCCCG AATCTACCTG CCTTCCCGCT AAACGTCTCC
GGCGAGGCGG TAAATGCTCT CTTTTCCGCG GAAGTCCCGA TCGCACCTAT GGGCGAACGC
GCCTTCGATC CGCTGGCGGA CGTATTCGCC TTGTCGAGGC CAAACTCCCC GCGTTTCTTC
GGATACGTCT TCGGTTCCGG CCTCCCGATC GCCGCGCTTG GTGACTTCGC CGCAAGCGTT
TTGAACCAGA ACGTCACCGC CTGGCGCTCC GGTCCAGCCG CCGTGACCAT CGAACGCACC
GTCGTTGGCT GGCTCGCCGA AGCCATCGGT TGTTCTGGGT TTTCCGGCAG CCTCACCGGC
GGAGGCTCAC AAGCCAACCT CATGGCTCTT TGCATGGCCC GCGAAGCGAA AGCGCCCGCC
AACGAAAACG GAGCCCAAGG TGGAGTGATC TATTGCTCCG ACGAAGCTCA CATGTCCATG
CCGAAAGCCG CGATGATGCT CGGCCTTGGT CAGAAGAATG TCCGCCGTAT CCCAGTGAAT
GATCGCTTCC AGATGGACAT CAGTCATCTA CGTGACGCAA TCATGCGTGA TCTCCGGGAA
GGGAATCGTC CCATCGCCGT TGTCGCCAGC GCTGGAACCG TTGCTACCGG CAGTATCGAT
CCTCTGCCCG AGATTGCCGA CATCTGCTCC GAACACAACC TCTGGATGCA CGTGGACGGC
GCCTACGGCG CACTCGCTGC AATGACAGTT CCCGAAAAAT TCGTTGGACT GAATCGTGCT
GACTCGCTCT CCCTCGACCC GCATAAGTGG CTCTACCAGC CTGCGGGTTG CGGATGTCTC
CTCTACCGCG ATCCTGCCGC CGCGCAACGC GCGTTCTCGC ATACCGAAGA CTACGCACGC
TCCCTTTCGA CTGACCCCAT CGAAAGCTTC GCGTTCTTCG AATCGTCCAT GGAACTTTCG
CGGCCGTTTC GCGCGTTGAA GATATGGCTT TCGCTCCGCT ACTTCGGGCT TCAGGCATTC
CAGCAGCGCA TCGCCGAAGA CCTTCGCCTT GCCCGCATTC TCGCCGACTC CGTTTCCGCC
GAGCCGCAAC TCGAACTTCT CGCCCCCGTT GAGCTAAGCG CTGTTTGTTT TCGCTATGTG
AGGAAAAATG CCGATCTCGA CCACCTGAAC CTCGAGATTC TTCAGCGCAT CATTCAACGA
GGGAAGGTCT GCATCTCGAA CGCAACCATT CGTGGCCAGT TCGCTCTCCG CGCCTGCGTC
GTGAATCATC GCAGCACGGA GGAAGACGTT AAGGCTGTCG TAAGTGAGGT CCTACATGCT
GCGAATGAAG TGAGCGGATG A
 
Protein sequence
MAPLDLSPAD FRALSRKISD FTADYLERLP NLPAFPLNVS GEAVNALFSA EVPIAPMGER 
AFDPLADVFA LSRPNSPRFF GYVFGSGLPI AALGDFAASV LNQNVTAWRS GPAAVTIERT
VVGWLAEAIG CSGFSGSLTG GGSQANLMAL CMAREAKAPA NENGAQGGVI YCSDEAHMSM
PKAAMMLGLG QKNVRRIPVN DRFQMDISHL RDAIMRDLRE GNRPIAVVAS AGTVATGSID
PLPEIADICS EHNLWMHVDG AYGALAAMTV PEKFVGLNRA DSLSLDPHKW LYQPAGCGCL
LYRDPAAAQR AFSHTEDYAR SLSTDPIESF AFFESSMELS RPFRALKIWL SLRYFGLQAF
QQRIAEDLRL ARILADSVSA EPQLELLAPV ELSAVCFRYV RKNADLDHLN LEILQRIIQR
GKVCISNATI RGQFALRACV VNHRSTEEDV KAVVSEVLHA ANEVSG