Gene Acid345_2319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2319 
Symbol 
ID4071473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2748765 
End bp2750270 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content59% 
IMG OID637984335 
Productradical SAM family Fe-S protein 
Protein accessionYP_591394 
Protein GI94969346 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID[TIGR03471] hopanoid biosynthesis associated radical SAM protein HpnJ 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.518236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0137531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTGA AGACACTGTT TCTGAATCCG CCATCGTTTG AGAATTACGA CGGAGGCGCC 
AGTTCGCGCT GGCCAGCCAC CCGTGAGATC GAGTCCTACT GGTATCCGGT ATGGCTCGCG
TATCCCGCTG GGATGCTGGA GGGGTCGCGG TTGCTCGATG CACCGCCGCA CCACGTGTCG
TTCGACGAGA CGATCCAGAT CAGCAAGGAC TACGACTTCG TAGTGCTGTT CACCAGCACT
CCCGGCTTCC CGGGCGACCT GAAGATCGCC AAGGCGATGA AGACCGCGAA CCCGAATGTA
AAGATTGCGT TCGTCGGGCC GCATGTGACG ACCCTGCCGG AGCGTTCCCT GGCGGAGGGT
CCGGAGATTG ATTTCATCGT CCGCCGCGAG TTCGACTACG CGGTAGTGGA GTATGCCAAC
GGCAAGCCGC TGAACGAGAT TACCGGTGTC AGTTATCGCG GCGCGGATGG CAAGGTCGTG
CACAATCCCG ACCGGCCGCC GGTTTCGAAC CTCGACGAGA TGCCGGATGT GATCGACGTT
TACAAGCGCG ATCTCGACGT GAAGCGCTAC AACGTTCCCT TCCTGCTACA TCCGTACATC
GCGCTGTACA CCACGCGTGG ATGCCCAGCG CAATGCACGT TCTGCCTGTG GCCGCAGACC
CTCAGTGGGC ATCCGTGGCG CAAGCGTTCC AGCGATGCGG TAGCGTCGGA GATGAAGCGG
GCCATGGAAC TCTTCCCTTG GGTGCGCGAG TTCTTCTTCG ACGACGATAC CTTCAACATC
CAGAAGGCAC GCACCATCGA GCTTTGCGAA AAGCTGAAGC CGCTGGGGAT GACCTGGAGC
TGCACCTCGC GCGTGACCAC CGACTACGAG ACGCTGAAGG CGATGAAGGA AGCCGGATGC
CGGCTGCTGA TCGTTGGTTA CGAATCGGGC GATCCGCAGA TTTTGAAAAA CATTAAGAAG
GGCGCCACCG TCGAGCGCGC GCGGGCCTTT ACCAAGGATT GCCACAAACT CGGCCTGAAG
GTGCACGGCG ATTTCATCCT CGGCCTGCCG GGCGAGACGA AAGAGACGAT CCGCCGCACC
ATGGATTTCG CGAAAGAACT TGACGTCGAG ACGATCCAGG TTTCAATCGC GCACGCGTAT
CCGGGAACGG AGCTTTACGA CTACGCGAAG GCCAATGGCT TCATCGTGCA GGAAGGCGCT
GCAATGGTGG ACGACCAGGG CCACCAGGTC GCGATGATCG AATACCCCGG CCTGCCCCGC
GATTACGTGA TGGAGATGGT GCACAAGTTC TACGACGAAT ACTATTTCCG CCCGAAGGCG
ATCTTCCGCA TCGTGCGCAA GGCTGTATTC AATAATGTGG AGCGCAAGCG TCTCTACAAA
GAAGCCAAGG ACTTCATGAA GCTGCGCAGC GTACGCAATA AGGCCGTGAA GGTCGCGCGT
GAACAGCAGG CGAAGTCGAA CATCAATCCG AAAAAGAATG AGCCGGCAGA GCCGGTGAAT
GTGTAG
 
Protein sequence
MPLKTLFLNP PSFENYDGGA SSRWPATREI ESYWYPVWLA YPAGMLEGSR LLDAPPHHVS 
FDETIQISKD YDFVVLFTST PGFPGDLKIA KAMKTANPNV KIAFVGPHVT TLPERSLAEG
PEIDFIVRRE FDYAVVEYAN GKPLNEITGV SYRGADGKVV HNPDRPPVSN LDEMPDVIDV
YKRDLDVKRY NVPFLLHPYI ALYTTRGCPA QCTFCLWPQT LSGHPWRKRS SDAVASEMKR
AMELFPWVRE FFFDDDTFNI QKARTIELCE KLKPLGMTWS CTSRVTTDYE TLKAMKEAGC
RLLIVGYESG DPQILKNIKK GATVERARAF TKDCHKLGLK VHGDFILGLP GETKETIRRT
MDFAKELDVE TIQVSIAHAY PGTELYDYAK ANGFIVQEGA AMVDDQGHQV AMIEYPGLPR
DYVMEMVHKF YDEYYFRPKA IFRIVRKAVF NNVERKRLYK EAKDFMKLRS VRNKAVKVAR
EQQAKSNINP KKNEPAEPVN V