Gene Acid345_2966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2966 
Symbol 
ID4068867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3512089 
End bp3513498 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content60% 
IMG OID637984985 
ProductMmgE/PrpD 
Protein accessionYP_592041 
Protein GI94969993 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACTG CTACGAAGCC GAGCGCAAAG ACTGCCGCAC CAACGATTAC CGCCCACATG 
GCCGAATGGG CCGCGGACGT GCAGTTCGAA GACCTTTCCA AAGACGCCGT ATACCAGGCG
AAGCGCTTCC TGCTGGATTC CGTGGGCTGC GCGCTAGGCG GCTATCAGCA GCACGACGTG
AAGATCGCCC TGGAAGTGAT GGCGGAGATC GCGGGGCCGG GATCGTGCAC GGTGATGGGT
ACGGGCGAGC AACTGGATGC AGTGTCGGCA TCGTTGCTTA ACGGGCTGAT GATCCGCTGC
ATGGATTACA ACGACATCTA CTGGAAACAG GATCCGTGCC ATCCATCGGA CATTTTCCCT
GCGGCGCTGG CGGGTGCAGA ACGGGCGGGA ACGGGCGGAC GCGAGTTGAT CGTCGGATTG
GTGCTTGGAC ACGAATTTGA GCAGCGTCTA TGTGAAGCGG CGTTCCCGGG GATTCGTGAG
CGCGGTTGGC ACCATGCGAC GTTGACGGCG TTCGTTTCGC CGATCGTCGC AGGACGGATG
TTGAAGCTCT CGGCAGAGCA GATGCAGCAC GCGATCGGCA TTTCAGCTTC GCCACGTTGC
ACGCTAGGCG CCGTGACGGC CGGCAAGTTG ACGATGATGA AGAACACCGT GGATCCGCTG
GCGACGCAGT CAGGCGTATT CGCGGCGCTG CTGGCGGAGA AGGGCTACAC CGGTCCGGAG
CACGTGATTG ACGGCAAGGA AGGTCTGGTG CACGTCTTCG GTCCTGACTG GAAGCTCAAC
ATCCTGACCG ACGGACTGGG CGAGAGCTGG CGGATTACGC AGTGCGGCAT GAAGGCGTTC
CCGACCGAAG CGCTGACACA CACGCCGATT TCAGCGGTGC TGGGCATCGT GAAAGACCAG
AATCTGAAAC CGGAAGAGAT TCTGCAGGTG CACATTCGGA CGACCGCGCG CGGTGCGGAC
ATCCTGAGCG ATCCGAGCAA GTACGCTCCG CACACGAAAG AGACGGCCGA CCACTCACTG
CCGTATGTGG TTGCTGCGGC AATTGCGGAG AGGCAGGTAA CGCCGCTGCA GTTCGAGATG
AAGAAGATCC TTGATCCGCG GATTCGTGAA CAACTGCACA AGATCGTCGT CGTCGCCGCT
GCGGAAATCG AGAAGTGCTT CCCGGCGTTG CAACGAGTGA TTGTGAAGAT CACGACGACG
GACGGGCGTT CGTTCGAGAA GCAGTTGGAT TATCCGAAGG GCGACCCGCG GAACCCGCTG
ACCGATAAAG ATGTGGAAGA GAAGTTCGAG GCGCTTGCCG GGCCGGTGAT GACCAAGGCC
GCGCGGCAAC GCGTGATTGA TGCGACGTGG AAGCTGGAGT CATTCACGAA CACGACGGAG
TACATGCAGT TGCTGAAGGC GGACCGCTAG
 
Protein sequence
MATATKPSAK TAAPTITAHM AEWAADVQFE DLSKDAVYQA KRFLLDSVGC ALGGYQQHDV 
KIALEVMAEI AGPGSCTVMG TGEQLDAVSA SLLNGLMIRC MDYNDIYWKQ DPCHPSDIFP
AALAGAERAG TGGRELIVGL VLGHEFEQRL CEAAFPGIRE RGWHHATLTA FVSPIVAGRM
LKLSAEQMQH AIGISASPRC TLGAVTAGKL TMMKNTVDPL ATQSGVFAAL LAEKGYTGPE
HVIDGKEGLV HVFGPDWKLN ILTDGLGESW RITQCGMKAF PTEALTHTPI SAVLGIVKDQ
NLKPEEILQV HIRTTARGAD ILSDPSKYAP HTKETADHSL PYVVAAAIAE RQVTPLQFEM
KKILDPRIRE QLHKIVVVAA AEIEKCFPAL QRVIVKITTT DGRSFEKQLD YPKGDPRNPL
TDKDVEEKFE ALAGPVMTKA ARQRVIDATW KLESFTNTTE YMQLLKADR