Gene Acid345_2185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2185 
Symbol 
ID4071437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2607869 
End bp2609086 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content59% 
IMG OID637984201 
ProductMazG family protein 
Protein accessionYP_591260 
Protein GI94969212 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTTG GCATAATCCA GAGCGCGCGT TTAAAACCGC GGAAGCTGAA TTCGACCGCA 
TTGCGATTGG CGCGGATGCC GACGGTGGAG AGAACAATCC AGGCGAAAGC GGCGAGCGAG
AGCCAGTCTT GGAGCGGGCG CGGCGACCAC AGCGCCAGCA TGATCAGCGT AAAAGTGAAA
GCGACCTCGA ACACAGGCCG CGCGGTCTGG CTCACATCCT GCTGCACAAC TGCTCATTAT
GCCCGAAATC GAGGCGGGCG AGGCGGCGAT TGGCGATTTC TTACAACCCC GAACGTGCAC
GAGGAATTAC ACTGTGGTGC AATGACGACG ACAGGGGATC GCTTCGAACG CGCCGTGGCG
ATCATGAAGC GGCTACGCGC GCCGGGCGGG TGTCCATGGG ACCGCGAGCA GACCTTCGAC
ACGATCAAGC CTTTCACGCT GGAAGAGACT TACGAGGTCA TCGAAGCGAT TGACAATCGC
GACTGGGACG AACTGCCGGG AGAACTCGGA GATCTGCTGC TGCAGATCCT TTTTTATGCG
CAAATGGCAG CCGAGGAAGG GAAGTTCACG ATTGACGAAG TCGTCGAGAC ACTCTCGAAC
AAGCTGGTGA ACCGCCATCC GCACGTGTTT GGAACTGTCG AAGCCAACAC CGCCGCCGAG
GTCGTGCGCA ATTGGGAGAT GCTGAAGGCG CAGGAACGCG CGGCCAGCGG CAAGCACCAG
GCGAAAGAGG CAGCGAACAA ATCGCTGCTG GCCGGCGTTT CGCTGGGACT GCCAGGCCTG
ATCGAAGCCG GTAAGATCAG CAACAAGGCT GCGGGCGTTG GCTTCGAGTG GCCGAAGATC
GAGGGACTGT TCGAAAAGCT GACGGAAGAA ACCGTCGAGC TGCGCGAGGA ACTAAAGCAT
TTTCCCGGAG GCGAGCCGAC ACCCCCGAGC CAGCACGGAA TTGCGAGTTC CACCGGCGAA
GAAGCTCTGG AAGATGGGCT ACGGGCGCGG CTTGAGGATG AGATTGGCGA TTTATTATTT
ACGGTGGTGA ACCTGGCGCG ATATGTTCGA GTGGACCCGG AATCGGCGTT GAAGCGCACC
AACCGTAAAT TCCGCGCGCG TTTTCTCGCG ATGGAACAAA CAGCAGAACA GCAAGGCAGG
AAGTTGGACG GCCTCTCGCT GGAGGAGTTG GAAGAACTGT GGCAGCAGTC GAAACGAGTA
GAGCGGGACG GGAAATGA
 
Protein sequence
MAVGIIQSAR LKPRKLNSTA LRLARMPTVE RTIQAKAASE SQSWSGRGDH SASMISVKVK 
ATSNTGRAVW LTSCCTTAHY ARNRGGRGGD WRFLTTPNVH EELHCGAMTT TGDRFERAVA
IMKRLRAPGG CPWDREQTFD TIKPFTLEET YEVIEAIDNR DWDELPGELG DLLLQILFYA
QMAAEEGKFT IDEVVETLSN KLVNRHPHVF GTVEANTAAE VVRNWEMLKA QERAASGKHQ
AKEAANKSLL AGVSLGLPGL IEAGKISNKA AGVGFEWPKI EGLFEKLTEE TVELREELKH
FPGGEPTPPS QHGIASSTGE EALEDGLRAR LEDEIGDLLF TVVNLARYVR VDPESALKRT
NRKFRARFLA MEQTAEQQGR KLDGLSLEEL EELWQQSKRV ERDGK