Gene Acid345_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3233 
Symbol 
ID4072568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3828598 
End bp3830139 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content61% 
IMG OID637985254 
ProductMg chelatase-related protein 
Protein accessionYP_592308 
Protein GI94970260 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0606] Predicted ATPase with chaperone activity 
TIGRFAM ID[TIGR00368] Mg chelatase-related protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.424185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0608426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTTC GCACCAAGAG CGCCGCGGTG TACGGTATCG ACGCGCACAT CATCGATGTT 
GAGGTGGATG TAGCCGCGCA ACTCTCAGAA AAACCACTGT TTACTACCGT CGGGCTGCCC
GATGCCGCAG TGCGCGAAAG CCGCGAACGG GTGCGCGCCG CTCTAAGGAA TGCGGGTTAC
GACGTTCCTA ATACCACCAT CACCGTGAAC CTGGCGCCCG CCGATATAAA GAAAGAAGGC
TCTGGCTTCG ATCTCCCGAT GGCTGCCGGA ATCTTGGGCG CCTACGGTGG CCTGAATAAG
CAGACGCTGG ACGATGTGGT GATGGTCGGC GAACTGTCGC TCGATGGCTC GATTCGCGGC
GTACGGGGGA CGTTATCCAC CGCAGTCGCC GCCCGGGCAG CGAAGATCAA GCGGCTTTTC
GTTCCGGTGG AGAATGCGCG GGAAGCCGCT GTGGTGGACG GGATCGAGAT CTACCCGGTC
AAGACGCTGA TGGACGTAGT GCACCTGATC AACACCGGCA ATGGCATTGA ACCGGTGAAG
GTTGATTCGC AGGCCGTCCT GAATGAAGCG CAACACTTCA ACGTGGATTT CAAAGACGTG
CGCGGGCAAC AGACCGCGAA GCGCGCGCTG GAAGTCGCGT GCGCAGGCGG GCACAACATC
CTGATGATCG GACCTCCGGG CAGCGGCAAG ACCATGCTCG CGAAACGCAT TCCGACCATC
ATGCCCCCGC TGACATTTGA AGAAGCGCTA GAGACCACCA AGATCCACAG CGTTGCGGGC
GTTCTCGATT CCCGTGCCGG CTTAGTGGGC GTGCGCCCGT TTCGCTCTCC GCACCACTCG
GTTTCCGACG CGGGATTGAT TGGTGGTGGC GCGGTGCCGC GTCCCGGCGA AGTCTCGCTC
GCGCACCATG GCGTGTTGTT CCTCGACGAA CTGCCGGAGT TCCCGCGCAA TGTGCTGGAG
GTGATGCGGC AGCCGCTGGA AGACGGGACG GTGACGATTT CTCGCGCGGC GATGTCGCTG
ACGTTTCCAG CGCGCTTTAT GCTGGCCGCC GCGATGAACC CGTGCCCGTG CGGGTATTTC
AACGACCGCA CTCGCGAGTG CAAATGCAGC CAGCCGATGA TCCAACGGTA CGTGGCGAAG
ATCTCCGGTC CGTTGCTCGA TCGCATTGAT ATCCACATTG ATGTGCCGGC AGTAAATTAC
AAGGAACTCC GCTCCGGCCA GGCACCCGAA GGATCGACGC AGATCCGCGA GCGAGTGCTG
CATGCGCGTG AGATCCAGTT GAACCGTTTC GCCAAAGAAC GGATTTACGC CAACGCGCAA
ATGAGTTCGC GCCAAATCCG CACCTACTGC GAGTTGTCGA GCGAGGGCGA ACACATGCTG
GAGCGTGCAA TGTCGCAACG GGGCCTGAGC GCCCGCGCCC ACGACCGCAT TCTCAAAGTC
GCGCGCACCA TCGCCGATCT CGATGCGGCA GCGCAGATCG AGTCGCGGCA CATCGCGGAG
GCGATTCAGT ATCGGGCGTT GGACCGTACG TTCTGGGCTT AG
 
Protein sequence
MLFRTKSAAV YGIDAHIIDV EVDVAAQLSE KPLFTTVGLP DAAVRESRER VRAALRNAGY 
DVPNTTITVN LAPADIKKEG SGFDLPMAAG ILGAYGGLNK QTLDDVVMVG ELSLDGSIRG
VRGTLSTAVA ARAAKIKRLF VPVENAREAA VVDGIEIYPV KTLMDVVHLI NTGNGIEPVK
VDSQAVLNEA QHFNVDFKDV RGQQTAKRAL EVACAGGHNI LMIGPPGSGK TMLAKRIPTI
MPPLTFEEAL ETTKIHSVAG VLDSRAGLVG VRPFRSPHHS VSDAGLIGGG AVPRPGEVSL
AHHGVLFLDE LPEFPRNVLE VMRQPLEDGT VTISRAAMSL TFPARFMLAA AMNPCPCGYF
NDRTRECKCS QPMIQRYVAK ISGPLLDRID IHIDVPAVNY KELRSGQAPE GSTQIRERVL
HAREIQLNRF AKERIYANAQ MSSRQIRTYC ELSSEGEHML ERAMSQRGLS ARAHDRILKV
ARTIADLDAA AQIESRHIAE AIQYRALDRT FWA