Gene Acid345_2366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2366 
Symbol 
ID4069178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2795820 
End bp2797052 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content59% 
IMG OID637984382 
Productputative oxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_591441 
Protein GI94969393 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCTCG GCATTTATAT CTCTGTCCCG TTTTGCCGCA CTAAGTGCTC TTTCTGCAAT 
TTCGCGTCAG GGGTGTTCTC GCGGGAGCTT TTCGACCGCT ACATCAATAT CGTGGCCGAA
GACATTGCGC GGGCCGAACA GATTGCGCCG GGCGCTCAGT TCGAGAGCGA GGTGGACTCG
ATCTATCTCG GTGGGGGGAC TCCGAGTGTA TTGGCCCCCG ACCAACTGGA GCGACTTTTC
CTCGCGGTTC ACGAAAAGTT CAAAGTGAGC AACAACACCG AAGTCACAGT GGAATGCGCG
CCGGGAACAC TGACCCGCGA AATGCTGAAC GCGCTGGTGG ATTTCGGTGT GAACCGCGTG
AGCCTTGGGG TGCAGTCGTT TGTAGATGAA GAGAGCCGTT CGGTCGGACG ACTCCATACG
CGCGAAATCA CCTTCGCCGA CATCAATTCC CTGCGCCAAC ATGGGATTGA AAACATCAGC
GTGGACCTGA TCGCCGGACT TCCGCACCAA ACGCCCAAGA GTTGGCGTGA GTCGCTGCAG
GACGTGGTGG ACTCGCAGGT GCCGCACGTA AGTGTGTACA TGCTTGAGGT CGATGAGGAC
TCCCGCTTGG GACGCGAACT GATGGCGGGC GGCACGCGCT ACCATGCCCA TTTTGTCCCC
GATGACGACA CCACCGCGGA CCTCTATCAG CAGGCGTGCG ACACACTGAA CAAGGCTGGA
GTAAGGCAAT ACGAAATCTC GAATTTTGCA CGCCCAGAGA GCGAATCGCG GCATAACTTA
AAGTATTGGT TGAGGCAGCC TTATCTTGGA TTTGGGGTGG ATGCTCACTC CATGCTGGCT
TCGCAGAATT GCGAGGGACT GCGCTTCTCC ACTGCCGACG ACCTGGATCA ATTCCTTGCC
GGAGCGCCGC GAACGTTTCG CCATGTGAAC CGGGCCGCAG CGGAAGACGA AGCATTCTTC
CTCGGCTTGC GACTGAACCG CGGCGTCGAC CTGAGCGCGA TCGAGCAGGA GTCCGGGAAA
GACCCCGTCG CACGTCGCGG CGCCGCACTC GAAGAACTCA CAGAGGCCGA ACTCCTCCAA
CGCGATCACT GCACCATACG TCTCACCGAT CGCGGGCGGT TGCTCTCCAA TGAAGTCTTT
GAGCGGTTAA CGCTTGCCGC GCCTCCCGAA GAGGAACGGC GGAACGAGTC GGCGAATCCG
CCACTCATCG CAATTTCTCC GCCACCCAAG TAA
 
Protein sequence
MALGIYISVP FCRTKCSFCN FASGVFSREL FDRYINIVAE DIARAEQIAP GAQFESEVDS 
IYLGGGTPSV LAPDQLERLF LAVHEKFKVS NNTEVTVECA PGTLTREMLN ALVDFGVNRV
SLGVQSFVDE ESRSVGRLHT REITFADINS LRQHGIENIS VDLIAGLPHQ TPKSWRESLQ
DVVDSQVPHV SVYMLEVDED SRLGRELMAG GTRYHAHFVP DDDTTADLYQ QACDTLNKAG
VRQYEISNFA RPESESRHNL KYWLRQPYLG FGVDAHSMLA SQNCEGLRFS TADDLDQFLA
GAPRTFRHVN RAAAEDEAFF LGLRLNRGVD LSAIEQESGK DPVARRGAAL EELTEAELLQ
RDHCTIRLTD RGRLLSNEVF ERLTLAAPPE EERRNESANP PLIAISPPPK