Gene Acid345_1567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1567 
Symbol 
ID4068676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1915802 
End bp1916842 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content61% 
IMG OID637983576 
Producthypothetical protein 
Protein accessionYP_590643 
Protein GI94968595 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00430195 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00703244 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGCGTC TTGGCCAGCC CGCCTACCGC TCCCGGCAGC TTTGGCAAGG CCTTTACCGC 
GACCGAATCG CTTCACTCGA CCAGTTCACC ACCCTCCCCA TCCCCCTCCG CGAGGAGCTC
AAATCCTCAG GTTGGGCCAT CGCTTTTCCC TTCGTCCAGA AGCGTTTCAC CTCCACCGAC
GGCACCGTAC GTTACTTATT GCAGTTCTCC GACGGCCAAT CCGTCGAGAC CGTCTGGATG
CCCGAGGGCG ACGGTGGCGA GCAAGGCGAC GGCTCCGAAG ACGGCCCCTC CTACGACCGA
GCCACCATCT GCGTCTCCAG CCAGGTCGGC TGCGCCGTTG ATTGCCAGTT CTGCATGACC
GCCTTGCTCG GCCTTCTCCG TAATCTTTCC GCCGGAGAAA TCGTTGGCCA AATCCTCGCC
GTGCTCAAAG ATGAGAACGT GGATGTCGAG AAAAGCCGCA TCAATCTCGT CTTCATGGGC
CAGGGCGAGC CCTTCCTGAA CTTCGACAAC TTCGTGAAGG CTGTCACGCT TCTTGCTGAA
GCCGTTGGGA TTCCCGAATC CCGCATGACC GTCTCGACCT CCGGTATCGT CCCGCGCATC
GTCGATTTCG GTCAGCTCGC GATCCGTCCC AAACTAGCAA TCTCGCTCAA CGCCTCCAAC
GACGAATCCC GCCGCGAACT CATGCCGATC ACCAAGAAGT GGACGCTCGA AAAGCTGATG
TCCGCGGCGC GCGAGTTCCC TCTCCGCAAC CGCGAGCGCA TGACCTTCGA GTACGTTCTC
CTGGGTGGCG TCAACGACAG CGAGCAGAAT GCCCGCGAAG TGGTTCAACT GCTGCGCGGC
CTCCGCGCCA AGGTAAATCT CATCGCCTGG AACCCCGGCC CCGAGATCCC CTTCTCCACG
CCCGATCCCC AGCACGTGGA AGCCTTTCAA CAGATCCTCA TCGACGCCGG CATCCCCACA
TTCATCCGCA AGCCGCGTGG ACGAGACATC TTCGCCGCCT GCGGACAGTT GAAGCGCACG
GAACTCGTCA CTCTCAGCTA A
 
Protein sequence
MERLGQPAYR SRQLWQGLYR DRIASLDQFT TLPIPLREEL KSSGWAIAFP FVQKRFTSTD 
GTVRYLLQFS DGQSVETVWM PEGDGGEQGD GSEDGPSYDR ATICVSSQVG CAVDCQFCMT
ALLGLLRNLS AGEIVGQILA VLKDENVDVE KSRINLVFMG QGEPFLNFDN FVKAVTLLAE
AVGIPESRMT VSTSGIVPRI VDFGQLAIRP KLAISLNASN DESRRELMPI TKKWTLEKLM
SAAREFPLRN RERMTFEYVL LGGVNDSEQN AREVVQLLRG LRAKVNLIAW NPGPEIPFST
PDPQHVEAFQ QILIDAGIPT FIRKPRGRDI FAACGQLKRT ELVTLS