Gene Acid345_1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1739 
Symbol 
ID4072006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2110629 
End bp2111603 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content58% 
IMG OID637983747 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_590814 
Protein GI94968766 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.739612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCG ATCAAACTGA CAAGACTTTA CTGTTGCTGA AGCCACGCGG CTTCTGCGCT 
GGGGTGGTTC GCGCCATCGA CATCGTTCGC ATTGCCCTTG AGGCGTTCGG GGCGCCGATT
TATGTGCGCA AAGAGATTGT GCACAACCGC TACGTCGTGG ACGACCTCGC CAGCAAGGGT
GCCATTTTCG TCGACGAAAT TGACGAAGTT CCCAATGGAC AGCGGGTGAT TTACTCCGCC
CACGGCGTGT CACCGGAAGT CCGCGAGGCA AGCAAGAAGC GCGGCCTGCG CGTGATCGAT
GCTACTTGTC CGCTGGTCAC CAAGGTCCAC GTAGAGGCGA TCAAATTTGC TAAGGAAGGG
CACTCGCTCG TCTTGATCGG CCACCACGAC CACGATGAGG TCGTCGGTAC GCTCGGCGAG
GCGCCGGATG TCACGTACGT CATCTCGACG CCGGAAGAGG TCCAGACGCT TGAGATTCCC
GATCCGAATC GCGTGGCCTA TCTGACGCAG ACCACTTTGA GCCTCGATGA AACCGTGCAC
GTGATCGCTG CGCTCAAGCA GAAGTTCCCC AACATTAAAG GACCGCACGC GCAGGATATC
TGCTACGCCA CCGAAAATCG CCAGACGGCG GTGAAGGATG TATCCGCTGA ATGCGACCTG
TTGCTGGTGG TGGGTTCGGA CAACAGCTCG AACTCGAACC GGCTGGTGGA AGTGGCGCGC
AATCTTCGAA CCAAGTCTCA TCTCATCGAG AACTTCAAGG CGATTCAGTC AGAGTGGCTG
GAAGGCGTCC GCACGGTTGG GGTGACCGCG GGTGCGTCGG CACCGGAGAT CTTGGTAGAG
CAGGTCGTCG AGTTCCTGAC CAGTAAGGGC TACACGAACC TGAAGGAAGT GGAAGTTATG
CCGGAGAATG TGCGCTTCGG ACTGCCGCCC GAGATTGTTG CAGCCATTGG GTCGGCCCCG
GCAGCGGCAC AGTAA
 
Protein sequence
MSFDQTDKTL LLLKPRGFCA GVVRAIDIVR IALEAFGAPI YVRKEIVHNR YVVDDLASKG 
AIFVDEIDEV PNGQRVIYSA HGVSPEVREA SKKRGLRVID ATCPLVTKVH VEAIKFAKEG
HSLVLIGHHD HDEVVGTLGE APDVTYVIST PEEVQTLEIP DPNRVAYLTQ TTLSLDETVH
VIAALKQKFP NIKGPHAQDI CYATENRQTA VKDVSAECDL LLVVGSDNSS NSNRLVEVAR
NLRTKSHLIE NFKAIQSEWL EGVRTVGVTA GASAPEILVE QVVEFLTSKG YTNLKEVEVM
PENVRFGLPP EIVAAIGSAP AAAQ