Gene Acid345_2218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2218 
Symbol 
ID4072963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2636700 
End bp2637632 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content62% 
IMG OID637984234 
Productprotoheme IX farnesyltransferase 
Protein accessionYP_591293 
Protein GI94969245 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0109] Polyprenyltransferase (cytochrome oxidase assembly factor) 
TIGRFAM ID[TIGR01473] protoheme IX farnesyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.446862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.810917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCGG TCACACAGCC GCTTGCGGTT CCCCGCCCGG GTGTTGCGTC GCTTCTCCGC 
GACTACTCTG AATTGACCAA GGCCCGCGTC ACCACGCTCA TCGTGATGAC CGCCTGGACC
GGCGCATTCT TCGGCGCGGC CAAATCAGGA TTGCCGCTGG TGAGTTGGAC GCTGTTCCAT
GCGCTGCTCG GCATCGGTCT CGTCTCCGGC GGTACAGCCG CGATGAACGA AGTGATCGAG
CGTGAGTCCG ACGCGCGCAT GCGCCGCACC GCCATCCGTC CGCTAGTGAC CGGCAGCATG
AGCCTGGGCC ATGGCATGGT CGTCTCGCTG GTAATGATGA TCGGCGGCGC CGGCTATCTT
GGCCTCATGA CCAATTGGCT CACCGCTGCA CTCGCGCTGA TGACCAGCGT GGTCTACCTG
ATGGCGTATA CGCCGTTGAA GAAGATCCAC CCCATCTGCA CCACCATCGG CGCCTTCCCT
GGCGCGATGC CGCCGGTGCT CGGCTGGACC GCCATCCGCG GGCATCTCGG GTGGGAAGCA
GTTGCGTTGT TCGCCATTCT GTTCTTCTGG CAGTTCCCGC ACTTCCATTC CATCGCATGG
CTCTACCGTG ACGACTACGC GAATGCGGGA ATCCGCATGC TGCCCGTGGT CGAACGCGAT
GGCCGCTCTA CGGCGCGCGA GATCGTGATC TACGCAGCCT TCCTGTTGCC GATCACCCTG
ACACCGTTCC TGCTGCGTTT TGCCGGAAGA ATCTACTTCC TGGCGGCGCT GGTGCTTGGC
AGCATGCTTT TCTGGGTGAG TCTCCGCATG TTCACCATGA ACCTGGCGCC CAGCTCACCG
CATTCCAAGA AATACGCGCG ACAGTTGCTG CTGGCGTCCG TGACGTACCT GCCACTGCTA
TTCGCGGTGA TGATGCTGGA TCGGACGATC TAA
 
Protein sequence
MSSVTQPLAV PRPGVASLLR DYSELTKARV TTLIVMTAWT GAFFGAAKSG LPLVSWTLFH 
ALLGIGLVSG GTAAMNEVIE RESDARMRRT AIRPLVTGSM SLGHGMVVSL VMMIGGAGYL
GLMTNWLTAA LALMTSVVYL MAYTPLKKIH PICTTIGAFP GAMPPVLGWT AIRGHLGWEA
VALFAILFFW QFPHFHSIAW LYRDDYANAG IRMLPVVERD GRSTAREIVI YAAFLLPITL
TPFLLRFAGR IYFLAALVLG SMLFWVSLRM FTMNLAPSSP HSKKYARQLL LASVTYLPLL
FAVMMLDRTI