Gene Acid345_1171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1171 
Symbol 
ID4072912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1456292 
End bp1457347 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content59% 
IMG OID637983181 
Producthypothetical protein 
Protein accessionYP_590248 
Protein GI94968200 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2226] Methylase involved in ubiquinone/menaquinone biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.544586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTAA GCGAAGCAAA ACTGCACGAC TTTCTCGGCA AATTTGTCTC CGATCTTGGC 
GCCACGGCCC ATGCGGGCAT GGTGGTCATT GGAGAACGGC TCGGCCTCTA TAAAGCGCTG
GCCGGAAAGA ATCTCACTGC CGATGAACTC GCGCAAGCCA CGGGGACAGA CGCACGTTAC
GTACGCGAAT GGGCAGCCTC CCAGGCCGCC GGCGGATACC TAACTTACGA CGCACATACG
CAGAAATTCG GCATGACCGA GGAGCAAACT CTCACGCTCG CTCACGAGGA CGGACCCGCC
TATATTCCTG GAGCATTTGA ACTCGCTCTC GGATCGCTCG CTGCCGTCCC GCGCATCACC
GACGCTTTCC GCAGCGGCGC CGGCATGGGT TGGTACGAGC ACGATGAAGC GGTCTTTCAT
GGTTGCGAGA AGTTTTTCCG GCCCGGCTAC ACGATGAATC TCGTCAGCTC ATGGATCCCT
GCGCTAAGCG GCGTGAAGGA GAAACTCGAG CGTGGAGCGA AGGTCGCTGA TATTGGCTGC
GGAAAAGGCG CATCGACCAT CCTGCTCGCA AAGGCGTTCC CGAACTCCAA TATTTTCGGC
TTCGACTATC ACGATAAGTC CATCGAGATG GCTCGCGAAG CAGCGCGGCT CGAGGGCGTG
GATGATCGCG TCACCTTCCA GCTAGCCAGC GCGAAAGAGT TTCCCGGCGA CGGCTACGAT
TTTGTCTCCG TCTTCGACTG CTTGCATGAC ATGGGCGATC CCGTTGGTGC TGCTGCTCAC
GTGCGTCGCG CTCTCAAAGA CGATGGCTCA TGGATGATCG TGGAACCCTT CGCGCACGAC
GAGATGCACG ACAACTTCAA CCCGGTCGGA CGCGTGTACT ACTCGTTCTC CACCTTGCTC
TGTACGCCGT GCTCGCGTTC GCAGGAAGTG GGCGCGTGTC TTGGCGCGCA AGCTGGCGAG
AAGAGAGTGC GACAAGTCGT CGAATCTGCC GGCTTCAGCC AGTTCCGCCG CGCAACGGAA
ACGCCATTCA ACCTGATTTA CGAGGCCCGT CCGTAG
 
Protein sequence
MAVSEAKLHD FLGKFVSDLG ATAHAGMVVI GERLGLYKAL AGKNLTADEL AQATGTDARY 
VREWAASQAA GGYLTYDAHT QKFGMTEEQT LTLAHEDGPA YIPGAFELAL GSLAAVPRIT
DAFRSGAGMG WYEHDEAVFH GCEKFFRPGY TMNLVSSWIP ALSGVKEKLE RGAKVADIGC
GKGASTILLA KAFPNSNIFG FDYHDKSIEM AREAARLEGV DDRVTFQLAS AKEFPGDGYD
FVSVFDCLHD MGDPVGAAAH VRRALKDDGS WMIVEPFAHD EMHDNFNPVG RVYYSFSTLL
CTPCSRSQEV GACLGAQAGE KRVRQVVESA GFSQFRRATE TPFNLIYEAR P