Gene Acid345_0693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0693 
Symbol 
ID4071338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp854692 
End bp855747 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content62% 
IMG OID637982699 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_589772 
Protein GI94967724 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCC CTAACTCCCT CTTTGTCCGC GCCTGCAAGC GTCAGCCCGT TGAGCGCACC 
CCCGTTTGGT TTATGCGCCA GGCCGGACGC TATATGTCCG AATACCGCGC CGTCCGTGAG
AAGTATTCGC TGGTCGAAAT CTGCAAGAAG CCTGACGTCG CCGCCGAAGT GACCATCACC
GCCGCCGAAG CCCTCAACGT GGACGCGGCC ATCATCTTCG CCGACTTGCT TCTTCCTCTC
GAAGTCATGG GACTTCCGTT CCACTTCTCG CCCGGCGAAG GCCCGGTGAT CGAGGTGCCG
ATCCGCGATG CCGCGCACAT CACGCGCCTC CGCACCGATC GCGCCCACGA TCTTGGCTAC
GTCGCGGAAG CCGTGAAGAA AGTCAGCGAT CACTTCGGCT CGAAGCTGCC GGTCATCGGC
TTCTGCGGTG CGCCATTCAC TCTAGCCAGC TACATGATTG AAGGCGGCGG CTCGCGCAAC
TACCTCGAAG TCAAAAAACT GATGTACAAC TCGCCCCACG CGTGGGACGA GCTTCTCGGC
AAACTTGTCG CCGTCCTCAG CGAATACACC GCCGACCAGG TAAAAGCTGG CGCCGACGTC
ATTCAGATCT TCGATAGCTG GGTCGGCTGC CTCAGCGTCG AAGACTATCG CCGCTACGTC
CTGCCACGCA CCACCGAACT GGTGAAGGCT CTCCAGCACT CGACGAGAGT TCCGATCATC
TACTTCGGTA CTGACAGCGC CACGCTGCTG CCCTCCATGC GCGAAACCGG CGCGGAGGTG
ATCGGCCTCG ACTGGCGTGT GCCGCTCGAC CAAGGCTGGA GCCTCCTCGA ACACAGCGTC
GCGATACAGG GAAATCTCGA TCCGGTGCTG CTCTTCGCCG AGTGGCCCGA GCTCAAGTCG
CGCGCCGAGC GCATCCTCAA GCAGGCCGAC GGGCGCCCCG GTCATATCTT CAATCTCGGA
CACGGCATCC TTCCGCACAC GCCCGTTCAG AACGTGAAGG ACCTTGCCAA GTTCGTGCAT
GAATATTCTT CGCAGTTGGT AGGCCCGCGC GAATGA
 
Protein sequence
MAAPNSLFVR ACKRQPVERT PVWFMRQAGR YMSEYRAVRE KYSLVEICKK PDVAAEVTIT 
AAEALNVDAA IIFADLLLPL EVMGLPFHFS PGEGPVIEVP IRDAAHITRL RTDRAHDLGY
VAEAVKKVSD HFGSKLPVIG FCGAPFTLAS YMIEGGGSRN YLEVKKLMYN SPHAWDELLG
KLVAVLSEYT ADQVKAGADV IQIFDSWVGC LSVEDYRRYV LPRTTELVKA LQHSTRVPII
YFGTDSATLL PSMRETGAEV IGLDWRVPLD QGWSLLEHSV AIQGNLDPVL LFAEWPELKS
RAERILKQAD GRPGHIFNLG HGILPHTPVQ NVKDLAKFVH EYSSQLVGPR E