Gene Acid345_3135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3135 
Symbol 
ID4070250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3727800 
End bp3729272 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content60% 
IMG OID637985155 
Productradical SAM family Fe-S protein 
Protein accessionYP_592210 
Protein GI94970162 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.183959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGCGC TAGGGAAGCG TCCTCTGAAC GTGAAGTTCA TCCTGCCGGC GCTGAAGGAA 
GCGACCGATC CGTATTGGCG GCCGATCAAG TACTCGCTCT TTCCGCCGCT CGGGCTGGCG
CAACTTGCCG CGTATCTCTC GCCTGACGAT TATGTCGTGC TCACCGATGA ACATGTCGAA
CCTCTCACGC TGGAAGACAA TCCGGATCTT GTCGTAATCC AGGTGTACAT CACCAATGCT
TACCGCGCGT ATCGCATTGC GGACCACTAT CGGAAGCGCG GGGCGTTCGT CTGTCTCGGC
GGATTGCATG TGACTTCCAT GCCGCACGAA GCTGCAGAAC ACGCCGACTC GATCTTCCTC
GGACCAGGCG AACAGATCTT CCCGCAGTTC CTGACCGACT TCCGCGCGGG AAATCCGCAG
CGACTTTATG CCTCGACGAG CGGGCGCACG TTGGAGCGCG CTCCGTCGCC GCGACGCGAC
CTGATCAAGC GCCATTGCTA TCTGGTGCCG AACTCGATCG TGGTGACGCG CGGCTGCCCG
CAGCACTGCG ACTTCTGCTA CAAGGACGCG TTCTACCAGG GCGGCAAGAC CTTTTACACG
CAGCGAGTGG ACGAGGCGCT GGCCGAGATC TCGCGGCTTC CCGGACGTCA CGTGTACTTC
CTTGACGACC ACATGCTCGG CGATCGCCGT TTTGCCGAAG GGCTCTTCGA CGGCATGAAA
GGAATGCGGC GTTTGTTCCA GGGCGCTGCG ACGGTTGATT CCATCCTGCG CGGAAACCTG
ATCGAACGCG CGGCGGAAGC GGGGCTGCGC AGTATCTTCG TCGGCTTCGA GACGCTCGCG
CCCGCGAACC TGAAGCAGTG CAACAAGCGG CAGAACCTCG GCCGCGACTA CAAGGCGGTG
ACCGATCGCC TGCACTCGCT CGGCATCATG ATCAACGGCA GCTTTGTCTT TGGCATGGAC
GACGACGGGC CTGACGTCTT TCGGCGCACC GTGGATTGGG CCGTCGAGCA CGGCGTTACG
ACGGCGACGT TCCACATTCA AACACCGTAT CCGGGAACCG GGCTGCATGC GCGCATGGAG
CGCGAAGGGC GCATGACGAC GCGCGACTGG AACCTCTATG ACACGCGGCA CGTGGTCTAT
CGTCCGGCGA GGCTTACGGC GGAACAACTG AAGACCGGTT ACGACTGGGC CTACGAAGAG
TTCTACACCT GGAGCAACAT TGCGAAGGCA TCGCTGCACC ACGGGACGCT GAAGCACCAG
GCCAAGCATT TCTTCTATGC GTCGGGGTGG AAGAAATTCG AGGCGGTTTG GGATTTCATC
ATTCGCACCC GGCAATTGAA TCGGACCACG AAGATTCTCG AGAGCGTGCT GTCGAAGGTG
ACAGGCAAGA AGGAAGACCA TACTTTCGTC CCGCCGATTC CGTCGCCGCA AAATGCAGAG
TTGGTGACGA TTTCGACAGA GCAAGTTTCA TGA
 
Protein sequence
MVALGKRPLN VKFILPALKE ATDPYWRPIK YSLFPPLGLA QLAAYLSPDD YVVLTDEHVE 
PLTLEDNPDL VVIQVYITNA YRAYRIADHY RKRGAFVCLG GLHVTSMPHE AAEHADSIFL
GPGEQIFPQF LTDFRAGNPQ RLYASTSGRT LERAPSPRRD LIKRHCYLVP NSIVVTRGCP
QHCDFCYKDA FYQGGKTFYT QRVDEALAEI SRLPGRHVYF LDDHMLGDRR FAEGLFDGMK
GMRRLFQGAA TVDSILRGNL IERAAEAGLR SIFVGFETLA PANLKQCNKR QNLGRDYKAV
TDRLHSLGIM INGSFVFGMD DDGPDVFRRT VDWAVEHGVT TATFHIQTPY PGTGLHARME
REGRMTTRDW NLYDTRHVVY RPARLTAEQL KTGYDWAYEE FYTWSNIAKA SLHHGTLKHQ
AKHFFYASGW KKFEAVWDFI IRTRQLNRTT KILESVLSKV TGKKEDHTFV PPIPSPQNAE
LVTISTEQVS